罕见病队列中深度学习模型与临床级变体致病性分类之间的不一致。

IF 4.7 2区 医学 Q1 GENETICS & HEREDITY
Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl
{"title":"罕见病队列中深度学习模型与临床级变体致病性分类之间的不一致。","authors":"Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl","doi":"10.1038/s41525-025-00480-w","DOIUrl":null,"url":null,"abstract":"<p><p>Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks<sup>1,2</sup>, yet distinguishing pathogenic from benign missense variants remains elusive<sup>3-5</sup>. Our investigation of AlphaMissense (AM)<sup>5</sup>, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.</p>","PeriodicalId":19273,"journal":{"name":"NPJ Genomic Medicine","volume":"10 1","pages":"17"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11871343/pdf/","citationCount":"0","resultStr":"{\"title\":\"Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort.\",\"authors\":\"Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl\",\"doi\":\"10.1038/s41525-025-00480-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks<sup>1,2</sup>, yet distinguishing pathogenic from benign missense variants remains elusive<sup>3-5</sup>. Our investigation of AlphaMissense (AM)<sup>5</sup>, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.</p>\",\"PeriodicalId\":19273,\"journal\":{\"name\":\"NPJ Genomic Medicine\",\"volume\":\"10 1\",\"pages\":\"17\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11871343/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Genomic Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41525-025-00480-w\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Genomic Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41525-025-00480-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

基因检测对于诊断和管理临床疾病,特别是罕见的孟德尔疾病至关重要。尽管鉴定罕见表型相关变异的努力主要集中在蛋白质截断变异上,但解释错义变异仍然具有挑战性。深度学习算法在各种生物医学任务中表现出色1,2,但区分致病性和良性错义变体仍然难以实现3-5。我们对AlphaMissense (AM)5(一种用于预测错义变异的潜在功能影响和评估基因必要性的深度学习工具)的研究揭示了在45种罕见疾病(包括非常早发性炎症性肠病)中识别致病性错义变异的局限性。对于我们的队列中鉴定的专家整理的致病变异,AM的准确率为32.9%,召回率为57.6%。值得注意的是,AM很难评估内在无序区(idr)的致病性,导致含有idr的基因在基因水平上的重要性评分不可靠。这一观察结果强调了临床遗传学正在面临的挑战,强调了在变异致病性预测中不断改进计算方法的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort.

Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks1,2, yet distinguishing pathogenic from benign missense variants remains elusive3-5. Our investigation of AlphaMissense (AM)5, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
NPJ Genomic Medicine
NPJ Genomic Medicine Biochemistry, Genetics and Molecular Biology-Molecular Biology
CiteScore
9.40
自引率
1.90%
发文量
67
审稿时长
17 weeks
期刊介绍: npj Genomic Medicine is an international, peer-reviewed journal dedicated to publishing the most important scientific advances in all aspects of genomics and its application in the practice of medicine. The journal defines genomic medicine as "diagnosis, prognosis, prevention and/or treatment of disease and disorders of the mind and body, using approaches informed or enabled by knowledge of the genome and the molecules it encodes." Relevant and high-impact papers that encompass studies of individuals, families, or populations are considered for publication. An emphasis will include coupling detailed phenotype and genome sequencing information, both enabled by new technologies and informatics, to delineate the underlying aetiology of disease. Clinical recommendations and/or guidelines of how that data should be used in the clinical management of those patients in the study, and others, are also encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信