在全基因组数据上校准变异效应预测因子会掩盖不同基因之间的差异。

IF 8.1 1区生物学 Q1 GENETICS & HEREDITY

American journal of human genetics Pub Date : 2024-09-05 Epub Date: 2024-08-21 DOI:10.1016/j.ajhg.2024.07.018

Malvika Tejura, Shawn Fayer, Abbye E McEwen, Jake Flynn, Lea M Starita, Douglas M Fowler

{"title":"在全基因组数据上校准变异效应预测因子会掩盖不同基因之间的差异。","authors":"Malvika Tejura, Shawn Fayer, Abbye E McEwen, Jake Flynn, Lea M Starita, Douglas M Fowler","doi":"10.1016/j.ajhg.2024.07.018","DOIUrl":null,"url":null,"abstract":"In silico variant effect predictions are available for nearly all missense variants but played a minimal role in clinical variant classification because they were deemed to provide only supporting evidence. Recently, the ClinGen Sequence Variant Interpretation (SVI) Working Group updated recommendations for variant effect prediction use. By analyzing control pathogenic and benign variants across all genes, they were able to compute evidence strength for predictor score intervals with some intervals generating moderate, strong, or even very strong evidence. However, this genome-wide approach could obscure heterogeneous predictor performance in different genes. We quantified the gene-by-gene performance of two top predictors, REVEL and BayesDel, by analyzing control variants in each predictor score interval in 3,668 disease-relevant genes. Approximately 10% of intervals had sufficient control variants for analysis, and ∼70% of these intervals exceeded the maximum number of incorrect predictions implied by the SVI recommendations. These trending discordant intervals arose owing to the divergence of the gene-specific distribution of predictions from the genome-wide distribution, suggesting that gene-specific calibration is needed in many cases. Approximately 22% of ClinVar missense variants of uncertain significance in genes we analyzed (REVEL = 100,629, BayesDel = 71,928) had predictions in trending discordant intervals. Thus, genome-wide calibrations could result in many variants receiving inappropriate evidence strength. To facilitate a review of the SVI's calibrations, we developed a web application enabling visualization of gene-specific predictions and trending concordant and discordant intervals.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2031-2043"},"PeriodicalIF":8.1000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11393694/pdf/","citationCount":"0","resultStr":"{\"title\":\"Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes.\",\"authors\":\"Malvika Tejura, Shawn Fayer, Abbye E McEwen, Jake Flynn, Lea M Starita, Douglas M Fowler\",\"doi\":\"10.1016/j.ajhg.2024.07.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In silico variant effect predictions are available for nearly all missense variants but played a minimal role in clinical variant classification because they were deemed to provide only supporting evidence. Recently, the ClinGen Sequence Variant Interpretation (SVI) Working Group updated recommendations for variant effect prediction use. By analyzing control pathogenic and benign variants across all genes, they were able to compute evidence strength for predictor score intervals with some intervals generating moderate, strong, or even very strong evidence. However, this genome-wide approach could obscure heterogeneous predictor performance in different genes. We quantified the gene-by-gene performance of two top predictors, REVEL and BayesDel, by analyzing control variants in each predictor score interval in 3,668 disease-relevant genes. Approximately 10% of intervals had sufficient control variants for analysis, and ∼70% of these intervals exceeded the maximum number of incorrect predictions implied by the SVI recommendations. These trending discordant intervals arose owing to the divergence of the gene-specific distribution of predictions from the genome-wide distribution, suggesting that gene-specific calibration is needed in many cases. Approximately 22% of ClinVar missense variants of uncertain significance in genes we analyzed (REVEL = 100,629, BayesDel = 71,928) had predictions in trending discordant intervals. Thus, genome-wide calibrations could result in many variants receiving inappropriate evidence strength. To facilitate a review of the SVI's calibrations, we developed a web application enabling visualization of gene-specific predictions and trending concordant and discordant intervals.\",\"PeriodicalId\":7659,\"journal\":{\"name\":\"American journal of human genetics\",\"volume\":\" \",\"pages\":\"2031-2043\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2024-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11393694/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American journal of human genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ajhg.2024.07.018\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ajhg.2024.07.018","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/21 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

摘要

几乎所有的错义变异都有硅学变异效应预测，但在临床变异分类中的作用微乎其微，因为它们被认为只能提供辅助证据。最近，ClinGen 序列变异解释（SVI）工作组更新了变异效应预测使用建议。通过分析所有基因中的对照致病变异和良性变异，他们能够计算出预测得分区间的证据强度，其中一些区间可产生中等、强甚至非常强的证据。然而，这种全基因组方法可能会掩盖不同基因中预测因子的异质性。我们通过分析 3,668 个疾病相关基因中每个预测因子得分区间的对照变异，量化了 REVEL 和 BayesDel 这两个顶级预测因子在不同基因中的表现。大约 10% 的区间有足够的对照变异可供分析，其中 70% 的区间超过了 SVI 建议所暗示的最大错误预测数。这些趋势性不一致区间的出现是由于特定基因的预测分布与全基因组分布出现分歧，这表明在许多情况下需要对特定基因进行校准。在我们分析的基因（REVEL = 100,629，BayesDel = 71,928）中，约有 22% 的 ClinVar 错义变异具有不确定的意义，其预测结果处于趋势不一致区间。因此，全基因组校准可能导致许多变异获得不适当的证据强度。为了便于对 SVI 的校准进行审查，我们开发了一个网络应用程序，使基因特异性预测和趋势性一致与不一致区间的可视化成为可能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes.

In silico variant effect predictions are available for nearly all missense variants but played a minimal role in clinical variant classification because they were deemed to provide only supporting evidence. Recently, the ClinGen Sequence Variant Interpretation (SVI) Working Group updated recommendations for variant effect prediction use. By analyzing control pathogenic and benign variants across all genes, they were able to compute evidence strength for predictor score intervals with some intervals generating moderate, strong, or even very strong evidence. However, this genome-wide approach could obscure heterogeneous predictor performance in different genes. We quantified the gene-by-gene performance of two top predictors, REVEL and BayesDel, by analyzing control variants in each predictor score interval in 3,668 disease-relevant genes. Approximately 10% of intervals had sufficient control variants for analysis, and ∼70% of these intervals exceeded the maximum number of incorrect predictions implied by the SVI recommendations. These trending discordant intervals arose owing to the divergence of the gene-specific distribution of predictions from the genome-wide distribution, suggesting that gene-specific calibration is needed in many cases. Approximately 22% of ClinVar missense variants of uncertain significance in genes we analyzed (REVEL = 100,629, BayesDel = 71,928) had predictions in trending discordant intervals. Thus, genome-wide calibrations could result in many variants receiving inappropriate evidence strength. To facilitate a review of the SVI's calibrations, we developed a web application enabling visualization of gene-specific predictions and trending concordant and discordant intervals.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

American journal of human genetics 生物-遗传学

CiteScore

14.70

自引率

4.10%

发文量

185

审稿时长

1 months

期刊介绍： The American Journal of Human Genetics (AJHG) is a monthly journal published by Cell Press, chosen by The American Society of Human Genetics (ASHG) as its premier publication starting from January 2008. AJHG represents Cell Press's first society-owned journal, and both ASHG and Cell Press anticipate significant synergies between AJHG content and that of other Cell Press titles.