Kate Sears, Caylin Hickey, Ryan Vincent, Jennifer Stocks-Candelaria, Jason Tate, Cody Bumgardner, Shulin Zhang, Justin B Miller
{"title":"Establishing a Variant Allele Frequency Cutoff for Manual Curation of Medical Exome Sequencing Data.","authors":"Kate Sears, Caylin Hickey, Ryan Vincent, Jennifer Stocks-Candelaria, Jason Tate, Cody Bumgardner, Shulin Zhang, Justin B Miller","doi":"10.1016/j.jmoldx.2024.09.006","DOIUrl":null,"url":null,"abstract":"<p><p>Medical exome sequencing pipelines consist of various preprocessing steps to prioritize credible causal variants before a pathologist or variant curation scientist manually interprets potential findings that are then reported to patients. The variant allele frequency (VAF), reported as the fraction of sequencing reads supporting a variant call, can be used to screen for technical artifacts, yet a specific filtering threshold has yet to be established. A total of 13,122 manually curated variants, sequenced from 289 patients using the Agilent SureSelect Focused Exome enrichment kit at the University of Kentucky Clinical Genomics laboratory from October 2019 to May 2023, were evaluated. Totals of 278 single-nucleotide polymorphisms (SNPs) and 3340 SNPs as technical artifacts are clinically reported. All reported variants had a VAF between 0.33 and 0.63, and 82% (2725/3340) of sequencing artifacts had a VAF of <0.33. It is proposed that removing SNPs in which the VAF is less than approximately 0.30 reduces manual curation time by approximately 20% while capturing all medically relevant variants in medical exome sequencing data sets.</p>","PeriodicalId":50128,"journal":{"name":"Journal of Molecular Diagnostics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jmoldx.2024.09.006","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Medical exome sequencing pipelines consist of various preprocessing steps to prioritize credible causal variants before a pathologist or variant curation scientist manually interprets potential findings that are then reported to patients. The variant allele frequency (VAF), reported as the fraction of sequencing reads supporting a variant call, can be used to screen for technical artifacts, yet a specific filtering threshold has yet to be established. A total of 13,122 manually curated variants, sequenced from 289 patients using the Agilent SureSelect Focused Exome enrichment kit at the University of Kentucky Clinical Genomics laboratory from October 2019 to May 2023, were evaluated. Totals of 278 single-nucleotide polymorphisms (SNPs) and 3340 SNPs as technical artifacts are clinically reported. All reported variants had a VAF between 0.33 and 0.63, and 82% (2725/3340) of sequencing artifacts had a VAF of <0.33. It is proposed that removing SNPs in which the VAF is less than approximately 0.30 reduces manual curation time by approximately 20% while capturing all medically relevant variants in medical exome sequencing data sets.
期刊介绍:
The Journal of Molecular Diagnostics, the official publication of the Association for Molecular Pathology (AMP), co-owned by the American Society for Investigative Pathology (ASIP), seeks to publish high quality original papers on scientific advances in the translation and validation of molecular discoveries in medicine into the clinical diagnostic setting, and the description and application of technological advances in the field of molecular diagnostic medicine. The editors welcome for review articles that contain: novel discoveries or clinicopathologic correlations including studies in oncology, infectious diseases, inherited diseases, predisposition to disease, clinical informatics, or the description of polymorphisms linked to disease states or normal variations; the application of diagnostic methodologies in clinical trials; or the development of new or improved molecular methods which may be applied to diagnosis or monitoring of disease or disease predisposition.