Anastasia Bratulin , Wesley A. Goar , Kori Kuzma , Lawrence Babb , Kyle Ferriter , Terry O'Neill , Austin A. Antoniou , Jeremy A. Arbesfeld , Daniel Puthawala , James S. Stevenson , Jiachen Liu , Xuelu Liu , Brian Walsh , William C. Ray , Savanna Funk , Bimal P. Chaudhari , Heidi L. Rehm
{"title":"34. Identifying challenges in variant normalization","authors":"Anastasia Bratulin , Wesley A. Goar , Kori Kuzma , Lawrence Babb , Kyle Ferriter , Terry O'Neill , Austin A. Antoniou , Jeremy A. Arbesfeld , Daniel Puthawala , James S. Stevenson , Jiachen Liu , Xuelu Liu , Brian Walsh , William C. Ray , Savanna Funk , Bimal P. Chaudhari , Heidi L. Rehm","doi":"10.1016/j.cancergen.2024.08.036","DOIUrl":null,"url":null,"abstract":"<div><div>Genomic medicine relies on collecting information from multiple sources to make optimal therapeutic and diagnostic decisions for the patient. However, integration of this information, at the time of variant interpretation, is a major bottleneck. Challenges including inconsistent formats (e.g. HGVS and SPDI), and variant contexts (genome and proteome), increase the time and effort required to formulate and communicate a complete variant interpretation. To more clearly understand inter-resource differences in variant representation, variants from the Clinical Interpretations of Variants in Cancer (CIViC), Molecular Oncology Almanac (MOAlmanac), and ClinVar were evaluated using a normalization protocol. Of the variants in the knowledgebases, ∼53% of the CIViC, ∼42% of the MOAlmanac, and ∼99% of ClinVar variants were successfully normalized. We categorically assessed remaining variants for which normalization methods are still needed, and analyzed these for clinical impact. Gene fusion (e.g. 'ALK G1202R') and region-defined variant categories (e.g. '3′ UTR Mutations') respectively constitute 16% and 10% of all the variants that were not normalized, and were found to hold the greatest potential clinical impact. Additionally, fusion variants are responsible for ∼25% of all of the evidence items associated with not normalized variants from CIViC and MOAlmanac, illustrating the weight that fusion variants carry in the overall group. The Variation Normalizer is an open-source toolkit and is available for use with independent data sets to facilitate precise matching of evidence from knowledgebases to genomic variant data.</div></div>","PeriodicalId":49225,"journal":{"name":"Cancer Genetics","volume":"286 ","pages":"Page S11"},"PeriodicalIF":1.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Genetics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210776224000747","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Genomic medicine relies on collecting information from multiple sources to make optimal therapeutic and diagnostic decisions for the patient. However, integration of this information, at the time of variant interpretation, is a major bottleneck. Challenges including inconsistent formats (e.g. HGVS and SPDI), and variant contexts (genome and proteome), increase the time and effort required to formulate and communicate a complete variant interpretation. To more clearly understand inter-resource differences in variant representation, variants from the Clinical Interpretations of Variants in Cancer (CIViC), Molecular Oncology Almanac (MOAlmanac), and ClinVar were evaluated using a normalization protocol. Of the variants in the knowledgebases, ∼53% of the CIViC, ∼42% of the MOAlmanac, and ∼99% of ClinVar variants were successfully normalized. We categorically assessed remaining variants for which normalization methods are still needed, and analyzed these for clinical impact. Gene fusion (e.g. 'ALK G1202R') and region-defined variant categories (e.g. '3′ UTR Mutations') respectively constitute 16% and 10% of all the variants that were not normalized, and were found to hold the greatest potential clinical impact. Additionally, fusion variants are responsible for ∼25% of all of the evidence items associated with not normalized variants from CIViC and MOAlmanac, illustrating the weight that fusion variants carry in the overall group. The Variation Normalizer is an open-source toolkit and is available for use with independent data sets to facilitate precise matching of evidence from knowledgebases to genomic variant data.
期刊介绍:
The aim of Cancer Genetics is to publish high quality scientific papers on the cellular, genetic and molecular aspects of cancer, including cancer predisposition and clinical diagnostic applications. Specific areas of interest include descriptions of new chromosomal, molecular or epigenetic alterations in benign and malignant diseases; novel laboratory approaches for identification and characterization of chromosomal rearrangements or genomic alterations in cancer cells; correlation of genetic changes with pathology and clinical presentation; and the molecular genetics of cancer predisposition. To reach a basic science and clinical multidisciplinary audience, we welcome original full-length articles, reviews, meeting summaries, brief reports, and letters to the editor.