CAVaLRi: An Algorithm for Rapid Identification of Diagnostic Germline Variation

IF 3.7 2区医学 Q2 GENETICS & HEREDITY

Human Mutation Pub Date : 2024-04-29 DOI:10.1155/2024/6411444

Robert J. Schuetz, Austin A. Antoniou, Grant E. Lammi, David M. Gordon, Harkness C. Kuck, Bimal P. Chaudhari, Peter White

{"title":"CAVaLRi: An Algorithm for Rapid Identification of Diagnostic Germline Variation","authors":"Robert J. Schuetz, Austin A. Antoniou, Grant E. Lammi, David M. Gordon, Harkness C. Kuck, Bimal P. Chaudhari, Peter White","doi":"10.1155/2024/6411444","DOIUrl":null,"url":null,"abstract":"<p>Clinical exome and genome sequencing (ES/GS) have become indispensable diagnostic tools for rare genetic diseases (RGD). However, the interpretation of ES/GS presents a substantial operational challenge in clinical settings. Test interpretation requires the review of hundreds of genetic variants, a task that has become increasingly challenging given the rising use of ES/GS. In response, we present Clinical Assessment of Variants by Likelihood Ratios (CAVaLRi), which employ a modified likelihood ratio (LR) framework to assign diagnostic probabilities to candidate germline disease genes. CAVaLRi models aspects of the clinical variant assessment process, taking into consideration the predicted impact of the variant, the proband and parental genotypes, and the proband’s clinical characteristics. It also factors in computational phenotype noise and weighs the relative significance of genotype, phenotype, and variant segregation information. We trained and tested CAVaLRi on variant and phenotype data from an internal cohort of 655 clinical ES cases. For validation, CAVaLRi’s performance was benchmarked against four leading gene prioritization algorithms (Exomiser’s hiPHIVE and PhenIX prioritizers, LIRICAL, and XRare) using a distinct cohort of 12,832 ES cases. Our findings reveal that CAVaLRi significantly outperforms its counterparts when clinician-curated phenotype sets are used, as evidenced by its superior precision-recall curve (PR AUC: 0.701) and average diagnostic gene rank (1.59). Notably, even when substituting highly focused clinician-curated phenotype sets with large and potentially nonspecific computationally derived phenotypes, CAVaLRi retains its precision (PR AUC: 0.658; diagnostic gene average rank: 1.68) and markedly outperforms other tools. In a large, heterogeneous validation cohort, CAVaLRi stood out as the most precise prioritization algorithm (PR AUC: 0.335; average diagnostic rank: 1.91). In conclusion, CAVaLRi presents a robust solution for prioritizing diagnostic genes, surpassing current methods. It demonstrates resilience to noisy, computationally-derived phenotypes, providing a scalable strategy to help labs focus on the most diagnostically relevant variants, thus addressing the growing demand for ES/GS interpretation.</p>","PeriodicalId":13061,"journal":{"name":"Human Mutation","volume":"2024 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Mutation","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/2024/6411444","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

Clinical exome and genome sequencing (ES/GS) have become indispensable diagnostic tools for rare genetic diseases (RGD). However, the interpretation of ES/GS presents a substantial operational challenge in clinical settings. Test interpretation requires the review of hundreds of genetic variants, a task that has become increasingly challenging given the rising use of ES/GS. In response, we present Clinical Assessment of Variants by Likelihood Ratios (CAVaLRi), which employ a modified likelihood ratio (LR) framework to assign diagnostic probabilities to candidate germline disease genes. CAVaLRi models aspects of the clinical variant assessment process, taking into consideration the predicted impact of the variant, the proband and parental genotypes, and the proband’s clinical characteristics. It also factors in computational phenotype noise and weighs the relative significance of genotype, phenotype, and variant segregation information. We trained and tested CAVaLRi on variant and phenotype data from an internal cohort of 655 clinical ES cases. For validation, CAVaLRi’s performance was benchmarked against four leading gene prioritization algorithms (Exomiser’s hiPHIVE and PhenIX prioritizers, LIRICAL, and XRare) using a distinct cohort of 12,832 ES cases. Our findings reveal that CAVaLRi significantly outperforms its counterparts when clinician-curated phenotype sets are used, as evidenced by its superior precision-recall curve (PR AUC: 0.701) and average diagnostic gene rank (1.59). Notably, even when substituting highly focused clinician-curated phenotype sets with large and potentially nonspecific computationally derived phenotypes, CAVaLRi retains its precision (PR AUC: 0.658; diagnostic gene average rank: 1.68) and markedly outperforms other tools. In a large, heterogeneous validation cohort, CAVaLRi stood out as the most precise prioritization algorithm (PR AUC: 0.335; average diagnostic rank: 1.91). In conclusion, CAVaLRi presents a robust solution for prioritizing diagnostic genes, surpassing current methods. It demonstrates resilience to noisy, computationally-derived phenotypes, providing a scalable strategy to help labs focus on the most diagnostically relevant variants, thus addressing the growing demand for ES/GS interpretation.

查看原文本刊更多论文

CAVaLRi：快速鉴定诊断性种系变异的算法

临床外显子组和基因组测序（ES/GS）已成为罕见遗传病（RGD）不可或缺的诊断工具。然而，ES/GS 的解读给临床操作带来了巨大挑战。为此，我们提出了 "通过似然比对变异进行临床评估"（CAVaLRi），该方法采用改进的似然比（LR）框架，为候选种系疾病基因分配诊断概率。CAVaLRi 对临床变异评估过程的各个方面进行建模，考虑到变异的预测影响、原告和父母的基因型以及原告的临床特征。它还考虑了计算表型噪声，并权衡了基因型、表型和变异分离信息的相对重要性。我们对来自 655 个临床 ES 病例的内部队列的变异和表型数据进行了 CAVaLRi 的训练和测试。为了验证 CAVaLRi 的性能，我们使用 12,832 例 ES 病例组成的不同队列，将其与四种领先的基因优先化算法（Exomiser 的 hiPHIVE 和 PhenIX 优先化算法、LIRICAL 和 XRare）进行了比较。我们的研究结果表明，在使用临床医生归纳的表型集时，CAVaLRi 的表现明显优于同类产品，其卓越的精确度-召回曲线（PR AUC：0.701）和平均诊断基因等级（1.59）证明了这一点。值得注意的是，即使用大型且可能是非特异性的计算得出的表型替代高度集中的临床医生校准表型集，CAVaLRi 仍能保持其精确性（PR AUC：0.658；诊断基因平均等级：1.68），并明显优于其他工具。在一个大型异质性验证队列中，CAVaLRi 是最精确的优先排序算法（PR AUC：0.335；平均诊断等级：1.91）。总之，CAVaLRi 为诊断基因的优先排序提供了一种稳健的解决方案，超越了现有的方法。它展示了对嘈杂的计算衍生表型的适应能力，提供了一种可扩展的策略，帮助实验室关注最具诊断相关性的变异，从而满足对 ES/GS 解释日益增长的需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Human Mutation 医学-遗传学

CiteScore

8.40

自引率

5.10%

发文量

190

审稿时长

2 months

期刊介绍： Human Mutation is a peer-reviewed journal that offers publication of original Research Articles, Methods, Mutation Updates, Reviews, Database Articles, Rapid Communications, and Letters on broad aspects of mutation research in humans. Reports of novel DNA variations and their phenotypic consequences, reports of SNPs demonstrated as valuable for genomic analysis, descriptions of new molecular detection methods, and novel approaches to clinical diagnosis are welcomed. Novel reports of gene organization at the genomic level, reported in the context of mutation investigation, may be considered. The journal provides a unique forum for the exchange of ideas, methods, and applications of interest to molecular, human, and medical geneticists in academic, industrial, and clinical research settings worldwide.