{"title":"Chasing Sequencing Perfection: Marching Toward Higher Accuracy and Lower Costs","authors":"Hangxing Jia, Shengjun Tan, Yong E. Zhang","doi":"10.1093/gpbjnl/qzae024","DOIUrl":null,"url":null,"abstract":"\n Next-generation sequencing (NGS), represented by Illumina platforms, has been an essential cornerstone of basic and applied research. However, the sequencing error rate of 1 per 1000 base pairs (10−3) represents a serious hurdle for research areas focusing on rare mutations, such as somatic mosaicism or microbe heterogeneity. By examining the high-fidelity sequencing methods developed in the past decade, we summarized three major factors underlying errors and the corresponding 12 strategies mitigating these errors. We then proposed a novel framework to classify 11 preexisting representative methods according to the corresponding combinatory strategies and identified 3 trends that emerged during methodological developments. We further extended this analysis to 8 long-read sequencing methods, emphasizing error reduction strategies. Finally, we suggest 2 promising future directions that could achieve comparable or even higher accuracy with lower costs in both NGS and long-read sequencing.","PeriodicalId":170516,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":"1 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics, Proteomics & Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gpbjnl/qzae024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Next-generation sequencing (NGS), represented by Illumina platforms, has been an essential cornerstone of basic and applied research. However, the sequencing error rate of 1 per 1000 base pairs (10−3) represents a serious hurdle for research areas focusing on rare mutations, such as somatic mosaicism or microbe heterogeneity. By examining the high-fidelity sequencing methods developed in the past decade, we summarized three major factors underlying errors and the corresponding 12 strategies mitigating these errors. We then proposed a novel framework to classify 11 preexisting representative methods according to the corresponding combinatory strategies and identified 3 trends that emerged during methodological developments. We further extended this analysis to 8 long-read sequencing methods, emphasizing error reduction strategies. Finally, we suggest 2 promising future directions that could achieve comparable or even higher accuracy with lower costs in both NGS and long-read sequencing.