基于65个变异效应预测指标的变异临床解释的见解

IF 3 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Ragousandirane Radjasandirane, Julien Diharce, Jean-Christophe Gelly , Alexandre G. de Brevern
{"title":"基于65个变异效应预测指标的变异临床解释的见解","authors":"Ragousandirane Radjasandirane,&nbsp;Julien Diharce,&nbsp;Jean-Christophe Gelly ,&nbsp;Alexandre G. de Brevern","doi":"10.1016/j.ygeno.2025.111036","DOIUrl":null,"url":null,"abstract":"<div><div>Single amino acid substitutions in protein sequences are generally harmless, but a certain number of these changes can lead to disease. Accurately predicting the effect of genetic variants is crucial for clinicians as it accelerates the diagnosis of patients with missense variants associated with health problems. Many computational tools have been developed to predict the pathogenicity of genetic variants with various approaches. Analysing the performance of these different computational tools is crucial to provide guidance to both future users and especially clinicians. In this study, a large-scale investigation of 65 tools was conducted. Variants from both clinical and functional contexts were used, incorporating data from the ClinVar database and bibliographic sources. The analysis showed that AlphaMissense often performed very well and was in fact one of the best options among the existing tools. In addition, as expected, meta-predictors perform well on average. Tools using evolutionary information showed the best performance for functional variants. These results also highlighted some heterogeneity in the difficulty of predicting some specific variants while others are always well categorized. Strikingly, the majority of variants from the ClinVar database appear to be easy to predict, while variants from other sources of data are more challenging. This raises questions about the use of ClinVar and the dataset used to validate tools accuracy. In addition, these results show that this variant predictability can be divided into three distinct classes: easy, moderate and hard to predict. We analyzed the parameters leading to these differences and showed that the classes are related to structural and functional information.</div></div>","PeriodicalId":12521,"journal":{"name":"Genomics","volume":"117 3","pages":"Article 111036"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Insights for variant clinical interpretation based on a benchmark of 65 variant effect predictors\",\"authors\":\"Ragousandirane Radjasandirane,&nbsp;Julien Diharce,&nbsp;Jean-Christophe Gelly ,&nbsp;Alexandre G. de Brevern\",\"doi\":\"10.1016/j.ygeno.2025.111036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Single amino acid substitutions in protein sequences are generally harmless, but a certain number of these changes can lead to disease. Accurately predicting the effect of genetic variants is crucial for clinicians as it accelerates the diagnosis of patients with missense variants associated with health problems. Many computational tools have been developed to predict the pathogenicity of genetic variants with various approaches. Analysing the performance of these different computational tools is crucial to provide guidance to both future users and especially clinicians. In this study, a large-scale investigation of 65 tools was conducted. Variants from both clinical and functional contexts were used, incorporating data from the ClinVar database and bibliographic sources. The analysis showed that AlphaMissense often performed very well and was in fact one of the best options among the existing tools. In addition, as expected, meta-predictors perform well on average. Tools using evolutionary information showed the best performance for functional variants. These results also highlighted some heterogeneity in the difficulty of predicting some specific variants while others are always well categorized. Strikingly, the majority of variants from the ClinVar database appear to be easy to predict, while variants from other sources of data are more challenging. This raises questions about the use of ClinVar and the dataset used to validate tools accuracy. In addition, these results show that this variant predictability can be divided into three distinct classes: easy, moderate and hard to predict. We analyzed the parameters leading to these differences and showed that the classes are related to structural and functional information.</div></div>\",\"PeriodicalId\":12521,\"journal\":{\"name\":\"Genomics\",\"volume\":\"117 3\",\"pages\":\"Article 111036\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0888754325000527\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888754325000527","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

蛋白质序列中单个氨基酸的替换通常是无害的,但一定数量的这些变化可能导致疾病。准确预测基因变异的影响对临床医生来说至关重要,因为它可以加速诊断与健康问题相关的错义变异患者。许多计算工具已经被开发出来,用各种方法来预测遗传变异的致病性。分析这些不同计算工具的性能对于为未来的用户尤其是临床医生提供指导至关重要。在本研究中,对65种工具进行了大规模调查。使用临床和功能背景的变体,结合ClinVar数据库和书目来源的数据。分析表明,AlphaMissense通常表现非常好,实际上是现有工具中最好的选择之一。此外,正如预期的那样,元预测器平均表现良好。使用进化信息的工具在功能变异方面表现最佳。这些结果也强调了在预测某些特定变异的困难方面的一些异质性,而其他变异总是被很好地分类。引人注目的是,ClinVar数据库中的大多数变异似乎很容易预测,而来自其他数据来源的变异则更具挑战性。这就提出了关于ClinVar的使用和用于验证工具准确性的数据集的问题。此外,这些结果表明,这种变异可预测性可分为容易预测、中等预测和难以预测三个不同的类别。我们分析了导致这些差异的参数,并表明这些类与结构和功能信息有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Insights for variant clinical interpretation based on a benchmark of 65 variant effect predictors
Single amino acid substitutions in protein sequences are generally harmless, but a certain number of these changes can lead to disease. Accurately predicting the effect of genetic variants is crucial for clinicians as it accelerates the diagnosis of patients with missense variants associated with health problems. Many computational tools have been developed to predict the pathogenicity of genetic variants with various approaches. Analysing the performance of these different computational tools is crucial to provide guidance to both future users and especially clinicians. In this study, a large-scale investigation of 65 tools was conducted. Variants from both clinical and functional contexts were used, incorporating data from the ClinVar database and bibliographic sources. The analysis showed that AlphaMissense often performed very well and was in fact one of the best options among the existing tools. In addition, as expected, meta-predictors perform well on average. Tools using evolutionary information showed the best performance for functional variants. These results also highlighted some heterogeneity in the difficulty of predicting some specific variants while others are always well categorized. Strikingly, the majority of variants from the ClinVar database appear to be easy to predict, while variants from other sources of data are more challenging. This raises questions about the use of ClinVar and the dataset used to validate tools accuracy. In addition, these results show that this variant predictability can be divided into three distinct classes: easy, moderate and hard to predict. We analyzed the parameters leading to these differences and showed that the classes are related to structural and functional information.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Genomics
Genomics 生物-生物工程与应用微生物
CiteScore
9.60
自引率
2.30%
发文量
260
审稿时长
60 days
期刊介绍: Genomics is a forum for describing the development of genome-scale technologies and their application to all areas of biological investigation. As a journal that has evolved with the field that carries its name, Genomics focuses on the development and application of cutting-edge methods, addressing fundamental questions with potential interest to a wide audience. Our aim is to publish the highest quality research and to provide authors with rapid, fair and accurate review and publication of manuscripts falling within our scope.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信