Toward streamline variant classification: discrepancies in variant nomenclature and syntax for ClinVar pathogenic variants across annotation tools.

IF 4.3 3区医学 Q2 GENETICS & HEREDITY

Human Genomics Pub Date : 2025-06-21 DOI:10.1186/s40246-025-00778-x

Yu-An Chen, Tzu-Hang Yuan, Jia-Hsin Huang, Yu-Bin Wang, Tzu-Mao Hung, Chien-Yu Chen, Pei-Lung Chen, Jacob Shujui Hsu

{"title":"Toward streamline variant classification: discrepancies in variant nomenclature and syntax for ClinVar pathogenic variants across annotation tools.","authors":"Yu-An Chen, Tzu-Hang Yuan, Jia-Hsin Huang, Yu-Bin Wang, Tzu-Mao Hung, Chien-Yu Chen, Pei-Lung Chen, Jacob Shujui Hsu","doi":"10.1186/s40246-025-00778-x","DOIUrl":null,"url":null,"abstract":"Background: High-throughput sequencing has revolutionized genetic disorder diagnosis, but variant pathogenicity interpretation is still challenging. Even though the human genome variation society (HGVS) provides recommendations for variant nomenclature, discrepancies in annotation remain a significant hurdle.Results: In this study, we evaluated the annotation concordance between three tools-ANNOVAR, SnpEff, and variant effect predictor (VEP)-using 164,549 two-star variants from ClinVar. The analysis used HGVS nomenclature string-match comparisons to assess annotation consistency from each tool, corresponding coding impacts, and associated ACMG criteria inferred from the annotations. The analysis revealed variable concordance rates, with 58.52% agreement for HGVSc, 84.04% for HGVSp, and 85.58% for the coding impact. SnpEff showed the highest match for HGVSc (0.988), while VEP bettered for HGVSp (0.977). The substantial discrepancies were noted in the loss-of-function (LoF) category. Incorrect PVS1 interpretations affected the final pathogenicity and downgraded PLP variants (ANNOVAR 55.9%, SnpEff 66.5%, VEP 67.3%), risking false negatives of clinically relevant variants in reports.Conclusions: These findings highlight the critical challenges in accurately interpreting variant pathogenicity due to discrepancies in annotations. To enhance the reliability of genetic variant interpretation in clinical practice, standardizing transcript sets and systematically cross-validating results across multiple annotation tools is essential.","PeriodicalId":13183,"journal":{"name":"Human Genomics","volume":"19 1","pages":"70"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12181866/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Genomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40246-025-00778-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: High-throughput sequencing has revolutionized genetic disorder diagnosis, but variant pathogenicity interpretation is still challenging. Even though the human genome variation society (HGVS) provides recommendations for variant nomenclature, discrepancies in annotation remain a significant hurdle.

Results: In this study, we evaluated the annotation concordance between three tools-ANNOVAR, SnpEff, and variant effect predictor (VEP)-using 164,549 two-star variants from ClinVar. The analysis used HGVS nomenclature string-match comparisons to assess annotation consistency from each tool, corresponding coding impacts, and associated ACMG criteria inferred from the annotations. The analysis revealed variable concordance rates, with 58.52% agreement for HGVSc, 84.04% for HGVSp, and 85.58% for the coding impact. SnpEff showed the highest match for HGVSc (0.988), while VEP bettered for HGVSp (0.977). The substantial discrepancies were noted in the loss-of-function (LoF) category. Incorrect PVS1 interpretations affected the final pathogenicity and downgraded PLP variants (ANNOVAR 55.9%, SnpEff 66.5%, VEP 67.3%), risking false negatives of clinically relevant variants in reports.

Conclusions: These findings highlight the critical challenges in accurately interpreting variant pathogenicity due to discrepancies in annotations. To enhance the reliability of genetic variant interpretation in clinical practice, standardizing transcript sets and systematically cross-validating results across multiple annotation tools is essential.

查看原文本刊更多论文

简化变体分类：跨注释工具的ClinVar致病变体的变体命名和语法差异。

背景：高通量测序已经彻底改变了遗传疾病的诊断，但变异致病性的解释仍然具有挑战性。尽管人类基因组变异协会（HGVS）为变异命名提供了建议，但注释上的差异仍然是一个重大障碍。结果：在本研究中，我们使用来自ClinVar的164,549个二星变异，评估了annovar、SnpEff和变异效应预测器（VEP）这三个工具之间的注释一致性。该分析使用HGVS命名字符串匹配比较来评估每个工具的注释一致性、相应的编码影响以及从注释推断的相关ACMG标准。HGVSc的一致性为58.52%，HGVSp的一致性为84.04%，编码影响的一致性为85.58%。SnpEff对HGVSc的匹配度最高（0.988），而VEP对HGVSp的匹配度较好（0.977）。在功能丧失（LoF）类别中注意到实质性差异。不正确的PVS1解释影响了最终的致病性和PLP变异的降级（ANNOVAR为55.9%，SnpEff为66.5%，VEP为67.3%），有可能在报告中出现临床相关变异的假阴性。结论：这些发现突出了由于注释差异而准确解释变异致病性的关键挑战。为了提高临床实践中遗传变异解释的可靠性，标准化转录集和跨多个注释工具系统地交叉验证结果至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Human Genomics GENETICS & HEREDITY-

CiteScore

6.00

自引率

2.20%

发文量

审稿时长

11 weeks

期刊介绍： Human Genomics is a peer-reviewed, open access, online journal that focuses on the application of genomic analysis in all aspects of human health and disease, as well as genomic analysis of drug efficacy and safety, and comparative genomics. Topics covered by the journal include, but are not limited to: pharmacogenomics, genome-wide association studies, genome-wide sequencing, exome sequencing, next-generation deep-sequencing, functional genomics, epigenomics, translational genomics, expression profiling, proteomics, bioinformatics, animal models, statistical genetics, genetic epidemiology, human population genetics and comparative genomics.