CAVaLRi: An Algorithm for Rapid Identification of Diagnostic Germline Variation

IF 3.3 2区 医学 Q2 GENETICS & HEREDITY
Robert J. Schuetz, Austin A. Antoniou, Grant E. Lammi, David M. Gordon, Harkness C. Kuck, Bimal P. Chaudhari, Peter White
{"title":"CAVaLRi: An Algorithm for Rapid Identification of Diagnostic Germline Variation","authors":"Robert J. Schuetz,&nbsp;Austin A. Antoniou,&nbsp;Grant E. Lammi,&nbsp;David M. Gordon,&nbsp;Harkness C. Kuck,&nbsp;Bimal P. Chaudhari,&nbsp;Peter White","doi":"10.1155/2024/6411444","DOIUrl":null,"url":null,"abstract":"<p>Clinical exome and genome sequencing (ES/GS) have become indispensable diagnostic tools for rare genetic diseases (RGD). However, the interpretation of ES/GS presents a substantial operational challenge in clinical settings. Test interpretation requires the review of hundreds of genetic variants, a task that has become increasingly challenging given the rising use of ES/GS. In response, we present Clinical Assessment of Variants by Likelihood Ratios (CAVaLRi), which employ a modified likelihood ratio (LR) framework to assign diagnostic probabilities to candidate germline disease genes. CAVaLRi models aspects of the clinical variant assessment process, taking into consideration the predicted impact of the variant, the proband and parental genotypes, and the proband’s clinical characteristics. It also factors in computational phenotype noise and weighs the relative significance of genotype, phenotype, and variant segregation information. We trained and tested CAVaLRi on variant and phenotype data from an internal cohort of 655 clinical ES cases. For validation, CAVaLRi’s performance was benchmarked against four leading gene prioritization algorithms (Exomiser’s hiPHIVE and PhenIX prioritizers, LIRICAL, and XRare) using a distinct cohort of 12,832 ES cases. Our findings reveal that CAVaLRi significantly outperforms its counterparts when clinician-curated phenotype sets are used, as evidenced by its superior precision-recall curve (PR AUC: 0.701) and average diagnostic gene rank (1.59). Notably, even when substituting highly focused clinician-curated phenotype sets with large and potentially nonspecific computationally derived phenotypes, CAVaLRi retains its precision (PR AUC: 0.658; diagnostic gene average rank: 1.68) and markedly outperforms other tools. In a large, heterogeneous validation cohort, CAVaLRi stood out as the most precise prioritization algorithm (PR AUC: 0.335; average diagnostic rank: 1.91). In conclusion, CAVaLRi presents a robust solution for prioritizing diagnostic genes, surpassing current methods. It demonstrates resilience to noisy, computationally-derived phenotypes, providing a scalable strategy to help labs focus on the most diagnostically relevant variants, thus addressing the growing demand for ES/GS interpretation.</p>","PeriodicalId":13061,"journal":{"name":"Human Mutation","volume":"2024 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Mutation","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/2024/6411444","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Clinical exome and genome sequencing (ES/GS) have become indispensable diagnostic tools for rare genetic diseases (RGD). However, the interpretation of ES/GS presents a substantial operational challenge in clinical settings. Test interpretation requires the review of hundreds of genetic variants, a task that has become increasingly challenging given the rising use of ES/GS. In response, we present Clinical Assessment of Variants by Likelihood Ratios (CAVaLRi), which employ a modified likelihood ratio (LR) framework to assign diagnostic probabilities to candidate germline disease genes. CAVaLRi models aspects of the clinical variant assessment process, taking into consideration the predicted impact of the variant, the proband and parental genotypes, and the proband’s clinical characteristics. It also factors in computational phenotype noise and weighs the relative significance of genotype, phenotype, and variant segregation information. We trained and tested CAVaLRi on variant and phenotype data from an internal cohort of 655 clinical ES cases. For validation, CAVaLRi’s performance was benchmarked against four leading gene prioritization algorithms (Exomiser’s hiPHIVE and PhenIX prioritizers, LIRICAL, and XRare) using a distinct cohort of 12,832 ES cases. Our findings reveal that CAVaLRi significantly outperforms its counterparts when clinician-curated phenotype sets are used, as evidenced by its superior precision-recall curve (PR AUC: 0.701) and average diagnostic gene rank (1.59). Notably, even when substituting highly focused clinician-curated phenotype sets with large and potentially nonspecific computationally derived phenotypes, CAVaLRi retains its precision (PR AUC: 0.658; diagnostic gene average rank: 1.68) and markedly outperforms other tools. In a large, heterogeneous validation cohort, CAVaLRi stood out as the most precise prioritization algorithm (PR AUC: 0.335; average diagnostic rank: 1.91). In conclusion, CAVaLRi presents a robust solution for prioritizing diagnostic genes, surpassing current methods. It demonstrates resilience to noisy, computationally-derived phenotypes, providing a scalable strategy to help labs focus on the most diagnostically relevant variants, thus addressing the growing demand for ES/GS interpretation.

CAVaLRi:快速鉴定诊断性种系变异的算法
临床外显子组和基因组测序(ES/GS)已成为罕见遗传病(RGD)不可或缺的诊断工具。然而,ES/GS 的解读给临床操作带来了巨大挑战。为此,我们提出了 "通过似然比对变异进行临床评估"(CAVaLRi),该方法采用改进的似然比(LR)框架,为候选种系疾病基因分配诊断概率。CAVaLRi 对临床变异评估过程的各个方面进行建模,考虑到变异的预测影响、原告和父母的基因型以及原告的临床特征。它还考虑了计算表型噪声,并权衡了基因型、表型和变异分离信息的相对重要性。我们对来自 655 个临床 ES 病例的内部队列的变异和表型数据进行了 CAVaLRi 的训练和测试。为了验证 CAVaLRi 的性能,我们使用 12,832 例 ES 病例组成的不同队列,将其与四种领先的基因优先化算法(Exomiser 的 hiPHIVE 和 PhenIX 优先化算法、LIRICAL 和 XRare)进行了比较。我们的研究结果表明,在使用临床医生归纳的表型集时,CAVaLRi 的表现明显优于同类产品,其卓越的精确度-召回曲线(PR AUC:0.701)和平均诊断基因等级(1.59)证明了这一点。值得注意的是,即使用大型且可能是非特异性的计算得出的表型替代高度集中的临床医生校准表型集,CAVaLRi 仍能保持其精确性(PR AUC:0.658;诊断基因平均等级:1.68),并明显优于其他工具。在一个大型异质性验证队列中,CAVaLRi 是最精确的优先排序算法(PR AUC:0.335;平均诊断等级:1.91)。总之,CAVaLRi 为诊断基因的优先排序提供了一种稳健的解决方案,超越了现有的方法。它展示了对嘈杂的计算衍生表型的适应能力,提供了一种可扩展的策略,帮助实验室关注最具诊断相关性的变异,从而满足对 ES/GS 解释日益增长的需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Mutation
Human Mutation 医学-遗传学
CiteScore
8.40
自引率
5.10%
发文量
190
审稿时长
2 months
期刊介绍: Human Mutation is a peer-reviewed journal that offers publication of original Research Articles, Methods, Mutation Updates, Reviews, Database Articles, Rapid Communications, and Letters on broad aspects of mutation research in humans. Reports of novel DNA variations and their phenotypic consequences, reports of SNPs demonstrated as valuable for genomic analysis, descriptions of new molecular detection methods, and novel approaches to clinical diagnosis are welcomed. Novel reports of gene organization at the genomic level, reported in the context of mutation investigation, may be considered. The journal provides a unique forum for the exchange of ideas, methods, and applications of interest to molecular, human, and medical geneticists in academic, industrial, and clinical research settings worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信