Mostafa Y Abdel-Glil, Christian Brandt, Mathias W Pletz, Heinrich Neubauer, Lisa D Sprague
{"title":"High intra-laboratory reproducibility of nanopore sequencing in bacterial species underscores advances in its accuracy.","authors":"Mostafa Y Abdel-Glil, Christian Brandt, Mathias W Pletz, Heinrich Neubauer, Lisa D Sprague","doi":"10.1099/mgen.0.001372","DOIUrl":null,"url":null,"abstract":"<p><p>Nanopore sequencing is a third-generation technology known for its portability, real-time analysis and ability to generate long reads. It has great potential for use in clinical diagnostics, but thorough validation is required to address accuracy concerns and ensure reliable and reproducible results. In this study, we automated an open-source workflow (freely available at https://gitlab.com/FLI_Bioinfo/nanobacta) for the assembly of Oxford Nanopore sequencing data and used it to investigate the reproducibility of assembly results under consistent conditions. We used a benchmark dataset of five bacterial reference strains and generated eight technical sequencing replicates of the same DNA using the Ligation and Rapid Barcoding kits together with the Flongle and MinION flow cells. We assessed reproducibility by measuring discrepancies such as substitution and insertion/deletion errors, analysing plasmid recovery results and examining genetic markers and clustering information. We compared the results of genome assemblies with and without short-read polishing. Our results show an average reproducibility accuracy of 99.999955% for nanopore-only assemblies and 99.999996% when the short reads were used for polishing. The genomic analysis results were highly reproducible for the nanopore-only assemblies without short read in the following areas: identification of genetic markers for antimicrobial resistance and virulence, classical MLST, taxonomic classification, genome completeness and contamination analysis. Interestingly, the clustering information results from the core genome SNP and core genome MLST analyses were also highly reproducible for the nanopore-only assemblies, with pairwise differences of up to two allele differences in core genome MLST and two SNPs in core genome SNP across replicates. After polishing the assemblies with short reads, the pairwise differences for cgMLST were 0 and for cgSNP were 0-1 SNP across replicates. These results highlight the advances in sequencing accuracy of nanopore data without the use of short reads.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"11 3","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927881/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001372","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
摘要
纳米孔测序是第三代技术,以其便携性、实时分析和生成长读数的能力而著称。它在临床诊断中的应用潜力巨大,但需要进行全面的验证,以解决准确性问题,确保结果可靠且可重复。在本研究中,我们将牛津纳米孔测序数据组装的开源工作流程(可在 https://gitlab.com/FLI_Bioinfo/nanobacta 免费获取)自动化,并用它来研究在一致条件下组装结果的可重复性。我们使用了由五个细菌参考菌株组成的基准数据集,并使用连接和快速条形码试剂盒以及 Flongle 和 MinION 流式细胞对相同的 DNA 进行了八次技术测序重复。我们通过测量替换和插入/删除错误等差异、分析质粒回收结果以及检查遗传标记和聚类信息来评估重现性。我们比较了使用和不使用短读数抛光的基因组组装结果。结果表明,纯纳米孔组装的平均重现准确率为 99.999955%,而使用短读数抛光后的重现准确率为 99.999996%。在以下方面,不使用短读的纯纳米孔组装的基因组分析结果具有很高的可重复性:抗菌性和毒力遗传标记的鉴定、经典 MLST、分类学分类、基因组完整性和污染分析。有趣的是,核心基因组 SNP 和核心基因组 MLST 分析的聚类信息结果在纯纳米孔组装中也具有很高的重现性,核心基因组 MLST 的成对差异可达两个等位基因,核心基因组 SNP 的成对差异可达两个 SNP。用短读码对组装结果进行打磨后,cgMLST 和 cgSNP 在不同重复序列中的配对差异分别为 0 和 0-1 SNP。这些结果突显了在不使用短读数的情况下纳米孔数据测序准确性的进步。
High intra-laboratory reproducibility of nanopore sequencing in bacterial species underscores advances in its accuracy.
Nanopore sequencing is a third-generation technology known for its portability, real-time analysis and ability to generate long reads. It has great potential for use in clinical diagnostics, but thorough validation is required to address accuracy concerns and ensure reliable and reproducible results. In this study, we automated an open-source workflow (freely available at https://gitlab.com/FLI_Bioinfo/nanobacta) for the assembly of Oxford Nanopore sequencing data and used it to investigate the reproducibility of assembly results under consistent conditions. We used a benchmark dataset of five bacterial reference strains and generated eight technical sequencing replicates of the same DNA using the Ligation and Rapid Barcoding kits together with the Flongle and MinION flow cells. We assessed reproducibility by measuring discrepancies such as substitution and insertion/deletion errors, analysing plasmid recovery results and examining genetic markers and clustering information. We compared the results of genome assemblies with and without short-read polishing. Our results show an average reproducibility accuracy of 99.999955% for nanopore-only assemblies and 99.999996% when the short reads were used for polishing. The genomic analysis results were highly reproducible for the nanopore-only assemblies without short read in the following areas: identification of genetic markers for antimicrobial resistance and virulence, classical MLST, taxonomic classification, genome completeness and contamination analysis. Interestingly, the clustering information results from the core genome SNP and core genome MLST analyses were also highly reproducible for the nanopore-only assemblies, with pairwise differences of up to two allele differences in core genome MLST and two SNPs in core genome SNP across replicates. After polishing the assemblies with short reads, the pairwise differences for cgMLST were 0 and for cgSNP were 0-1 SNP across replicates. These results highlight the advances in sequencing accuracy of nanopore data without the use of short reads.
期刊介绍:
Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.