IF 4 2区 生物学 Q1 GENETICS & HEREDITY
Mostafa Y Abdel-Glil, Christian Brandt, Mathias W Pletz, Heinrich Neubauer, Lisa D Sprague
{"title":"High intra-laboratory reproducibility of nanopore sequencing in bacterial species underscores advances in its accuracy.","authors":"Mostafa Y Abdel-Glil, Christian Brandt, Mathias W Pletz, Heinrich Neubauer, Lisa D Sprague","doi":"10.1099/mgen.0.001372","DOIUrl":null,"url":null,"abstract":"<p><p>Nanopore sequencing is a third-generation technology known for its portability, real-time analysis and ability to generate long reads. It has great potential for use in clinical diagnostics, but thorough validation is required to address accuracy concerns and ensure reliable and reproducible results. In this study, we automated an open-source workflow (freely available at https://gitlab.com/FLI_Bioinfo/nanobacta) for the assembly of Oxford Nanopore sequencing data and used it to investigate the reproducibility of assembly results under consistent conditions. We used a benchmark dataset of five bacterial reference strains and generated eight technical sequencing replicates of the same DNA using the Ligation and Rapid Barcoding kits together with the Flongle and MinION flow cells. We assessed reproducibility by measuring discrepancies such as substitution and insertion/deletion errors, analysing plasmid recovery results and examining genetic markers and clustering information. We compared the results of genome assemblies with and without short-read polishing. Our results show an average reproducibility accuracy of 99.999955% for nanopore-only assemblies and 99.999996% when the short reads were used for polishing. The genomic analysis results were highly reproducible for the nanopore-only assemblies without short read in the following areas: identification of genetic markers for antimicrobial resistance and virulence, classical MLST, taxonomic classification, genome completeness and contamination analysis. Interestingly, the clustering information results from the core genome SNP and core genome MLST analyses were also highly reproducible for the nanopore-only assemblies, with pairwise differences of up to two allele differences in core genome MLST and two SNPs in core genome SNP across replicates. After polishing the assemblies with short reads, the pairwise differences for cgMLST were 0 and for cgSNP were 0-1 SNP across replicates. These results highlight the advances in sequencing accuracy of nanopore data without the use of short reads.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"11 3","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927881/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001372","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

纳米孔测序是第三代技术,以其便携性、实时分析和生成长读数的能力而著称。它在临床诊断中的应用潜力巨大,但需要进行全面的验证,以解决准确性问题,确保结果可靠且可重复。在本研究中,我们将牛津纳米孔测序数据组装的开源工作流程(可在 https://gitlab.com/FLI_Bioinfo/nanobacta 免费获取)自动化,并用它来研究在一致条件下组装结果的可重复性。我们使用了由五个细菌参考菌株组成的基准数据集,并使用连接和快速条形码试剂盒以及 Flongle 和 MinION 流式细胞对相同的 DNA 进行了八次技术测序重复。我们通过测量替换和插入/删除错误等差异、分析质粒回收结果以及检查遗传标记和聚类信息来评估重现性。我们比较了使用和不使用短读数抛光的基因组组装结果。结果表明,纯纳米孔组装的平均重现准确率为 99.999955%,而使用短读数抛光后的重现准确率为 99.999996%。在以下方面,不使用短读的纯纳米孔组装的基因组分析结果具有很高的可重复性:抗菌性和毒力遗传标记的鉴定、经典 MLST、分类学分类、基因组完整性和污染分析。有趣的是,核心基因组 SNP 和核心基因组 MLST 分析的聚类信息结果在纯纳米孔组装中也具有很高的重现性,核心基因组 MLST 的成对差异可达两个等位基因,核心基因组 SNP 的成对差异可达两个 SNP。用短读码对组装结果进行打磨后,cgMLST 和 cgSNP 在不同重复序列中的配对差异分别为 0 和 0-1 SNP。这些结果突显了在不使用短读数的情况下纳米孔数据测序准确性的进步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
High intra-laboratory reproducibility of nanopore sequencing in bacterial species underscores advances in its accuracy.

Nanopore sequencing is a third-generation technology known for its portability, real-time analysis and ability to generate long reads. It has great potential for use in clinical diagnostics, but thorough validation is required to address accuracy concerns and ensure reliable and reproducible results. In this study, we automated an open-source workflow (freely available at https://gitlab.com/FLI_Bioinfo/nanobacta) for the assembly of Oxford Nanopore sequencing data and used it to investigate the reproducibility of assembly results under consistent conditions. We used a benchmark dataset of five bacterial reference strains and generated eight technical sequencing replicates of the same DNA using the Ligation and Rapid Barcoding kits together with the Flongle and MinION flow cells. We assessed reproducibility by measuring discrepancies such as substitution and insertion/deletion errors, analysing plasmid recovery results and examining genetic markers and clustering information. We compared the results of genome assemblies with and without short-read polishing. Our results show an average reproducibility accuracy of 99.999955% for nanopore-only assemblies and 99.999996% when the short reads were used for polishing. The genomic analysis results were highly reproducible for the nanopore-only assemblies without short read in the following areas: identification of genetic markers for antimicrobial resistance and virulence, classical MLST, taxonomic classification, genome completeness and contamination analysis. Interestingly, the clustering information results from the core genome SNP and core genome MLST analyses were also highly reproducible for the nanopore-only assemblies, with pairwise differences of up to two allele differences in core genome MLST and two SNPs in core genome SNP across replicates. After polishing the assemblies with short reads, the pairwise differences for cgMLST were 0 and for cgSNP were 0-1 SNP across replicates. These results highlight the advances in sequencing accuracy of nanopore data without the use of short reads.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Microbial Genomics
Microbial Genomics Medicine-Epidemiology
CiteScore
6.60
自引率
2.60%
发文量
153
审稿时长
12 weeks
期刊介绍: Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信