Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations.

IF 2.3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Yanfeng Ji, Junfan Zhao, Jiao Gong, Fritz J Sedlazeck, Shaohua Fan
{"title":"Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations.","authors":"Yanfeng Ji, Junfan Zhao, Jiao Gong, Fritz J Sedlazeck, Shaohua Fan","doi":"10.1007/s00438-024-02158-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations.</p><p><strong>Results: </strong>Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution.</p><p><strong>Conclusion: </strong>Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.</p>","PeriodicalId":18816,"journal":{"name":"Molecular Genetics and Genomics","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Genetics and Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00438-024-02158-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations.

Results: Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution.

Conclusion: Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.

Abstract Image

利用来自全球 19 个人群的 41 个样本的长读测序数据,揭示 370 个具有挑战性的医学相关基因中的新型基因变异。
背景:大量具有挑战性的医学相关基因(CMRGs)位于人类基因组的复杂或高度重复区域,阻碍了利用新一代测序技术对基因变异进行全面鉴定。在这项研究中,我们采用了广泛应用于复杂基因组区域研究的长线程测序技术,对来自全球19个种群的41个个体的370个CMRGs中的遗传变异进行了表征,包括短变异(单核苷酸变异、短插入和短缺失)和拷贝数变异:我们的分析表明,CMRGs 中存在大量基因变异,其中 68.73% 存在拷贝数变异,65.20% 包含短变异,这些变异可能会破坏不同个体的蛋白质功能。这些变异可影响药物基因组学、遗传病易感性和其他临床结果。我们观察到不同人群的 CMRG 变异存在明显差异,与来自其他大洲的样本相比,非洲血统的个体携带的拷贝数变异和短变异数量最多。值得注意的是,15.79% 到 33.96% 的短变异只能通过长读测序检测到。虽然T2T-CHM13参考基因组大大改善了CMRG区域的组装,从而促进了这些区域的变异检测,但一些区域仍然缺乏分辨率:我们的研究结果为未来的临床和药物遗传学研究提供了重要参考,强调了在参考基因组中全面体现全球遗传多样性和改进变异调用技术以全面解析医学相关基因的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Genetics and Genomics
Molecular Genetics and Genomics 生物-生化与分子生物学
CiteScore
5.10
自引率
3.20%
发文量
134
审稿时长
1 months
期刊介绍: Molecular Genetics and Genomics (MGG) publishes peer-reviewed articles covering all areas of genetics and genomics. Any approach to the study of genes and genomes is considered, be it experimental, theoretical or synthetic. MGG publishes research on all organisms that is of broad interest to those working in the fields of genetics, genomics, biology, medicine and biotechnology. The journal investigates a broad range of topics, including these from recent issues: mechanisms for extending longevity in a variety of organisms; screening of yeast metal homeostasis genes involved in mitochondrial functions; molecular mapping of cultivar-specific avirulence genes in the rice blast fungus and more.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信