Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl
{"title":"罕见病队列中深度学习模型与临床级变体致病性分类之间的不一致。","authors":"Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl","doi":"10.1038/s41525-025-00480-w","DOIUrl":null,"url":null,"abstract":"<p><p>Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks<sup>1,2</sup>, yet distinguishing pathogenic from benign missense variants remains elusive<sup>3-5</sup>. Our investigation of AlphaMissense (AM)<sup>5</sup>, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.</p>","PeriodicalId":19273,"journal":{"name":"NPJ Genomic Medicine","volume":"10 1","pages":"17"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11871343/pdf/","citationCount":"0","resultStr":"{\"title\":\"Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort.\",\"authors\":\"Sek Won Kong, In-Hee Lee, Lauren V Collen, Michael Field, Arjun K Manrai, Scott B Snapper, Kenneth D Mandl\",\"doi\":\"10.1038/s41525-025-00480-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks<sup>1,2</sup>, yet distinguishing pathogenic from benign missense variants remains elusive<sup>3-5</sup>. Our investigation of AlphaMissense (AM)<sup>5</sup>, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.</p>\",\"PeriodicalId\":19273,\"journal\":{\"name\":\"NPJ Genomic Medicine\",\"volume\":\"10 1\",\"pages\":\"17\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11871343/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Genomic Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41525-025-00480-w\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Genomic Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41525-025-00480-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort.
Genetic testing is essential for diagnosing and managing clinical conditions, particularly rare Mendelian diseases. Although efforts to identify rare phenotype-associated variants have focused on protein-truncating variants, interpreting missense variants remains challenging. Deep learning algorithms excel in various biomedical tasks1,2, yet distinguishing pathogenic from benign missense variants remains elusive3-5. Our investigation of AlphaMissense (AM)5, a deep learning tool for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals limitations in identifying pathogenic missense variants over 45 rare diseases, including very early onset inflammatory bowel disease. For the expert-curated pathogenic variants identified in our cohort, AM's precision was 32.9%, and recall was 57.6%. Notably, AM struggles to evaluate pathogenicity in intrinsically disordered regions (IDRs), resulting in unreliable gene-level essentiality scores for genes containing IDRs. This observation underscores ongoing challenges in clinical genetics, highlighting the need for continued refinement of computational methods in variant pathogenicity prediction.
NPJ Genomic MedicineBiochemistry, Genetics and Molecular Biology-Molecular Biology
CiteScore
9.40
自引率
1.90%
发文量
67
审稿时长
17 weeks
期刊介绍:
npj Genomic Medicine is an international, peer-reviewed journal dedicated to publishing the most important scientific advances in all aspects of genomics and its application in the practice of medicine.
The journal defines genomic medicine as "diagnosis, prognosis, prevention and/or treatment of disease and disorders of the mind and body, using approaches informed or enabled by knowledge of the genome and the molecules it encodes." Relevant and high-impact papers that encompass studies of individuals, families, or populations are considered for publication. An emphasis will include coupling detailed phenotype and genome sequencing information, both enabled by new technologies and informatics, to delineate the underlying aetiology of disease. Clinical recommendations and/or guidelines of how that data should be used in the clinical management of those patients in the study, and others, are also encouraged.