{"title":"ONT R10.4.1长读测序对三种主要呼吸道细菌病原体的准确基因分型","authors":"Nora Zidane, Carla Rodrigues, Valerie Bouchez, Martin Rethoret-Pasty, Virginie Passet, Sylvain Brisse, Chiara Crestani","doi":"10.1101/gr.279829.124","DOIUrl":null,"url":null,"abstract":"High-throughput massive parallel sequencing has significantly improved bacterial pathogen genomics, diagnostics, and epidemiology. Despite its high accuracy, short-read sequencing struggles with complete genome reconstruction and assembly of extrachromosomal elements such as plasmids. Long-read sequencing with Oxford Nanopore Technologies (ONT) presents an alternative that offers benefits including real-time sequencing and cost-efficiency, particularly useful in resource-limited settings. However, the historically higher error rates of ONT data have so far limited its application in high-precision genomic typing. The recent release of ONT's R10.4.1 chemistry, with significantly improved raw read accuracy (Q20+), offers a potential solution to this problem. The aim of this study was to evaluate the performance of ONT's latest chemistry for bacterial genomic typing against the gold standard Illumina technology, focusing on three respiratory pathogens of public health importance, <em>Klebsiella pneumoniae</em>, <em>Bordetella pertussis</em>, and <em>Corynebacterium diphtheriae</em>, and their related species. Using the Rapid Barcoding Kit V14, we generated and analyzed genome assemblies with different basecalling models, at different simulated depths of coverage. ONT assemblies were compared to the Illumina reference for completeness and core genome multilocus sequence typing (cgMLST) accuracy (number of allelic mismatches). Our results show that genomes obtained from raw ONT data basecalled with Dorado SUP v0.9.0, assembled with Flye, and with a minimum coverage depth of 35×, optimized accuracy for all bacterial species tested. Error rates were consistently below 0.5% for each cgMLST scheme, indicating that ONT R10.4.1 data is suitable for high-resolution genomic typing applied to outbreak investigations and public health surveillance.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"136 1","pages":"gr.279829.124"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accurate genotyping of three major respiratory bacterial pathogens with ONT R10.4.1 long-read sequencing\",\"authors\":\"Nora Zidane, Carla Rodrigues, Valerie Bouchez, Martin Rethoret-Pasty, Virginie Passet, Sylvain Brisse, Chiara Crestani\",\"doi\":\"10.1101/gr.279829.124\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High-throughput massive parallel sequencing has significantly improved bacterial pathogen genomics, diagnostics, and epidemiology. Despite its high accuracy, short-read sequencing struggles with complete genome reconstruction and assembly of extrachromosomal elements such as plasmids. Long-read sequencing with Oxford Nanopore Technologies (ONT) presents an alternative that offers benefits including real-time sequencing and cost-efficiency, particularly useful in resource-limited settings. However, the historically higher error rates of ONT data have so far limited its application in high-precision genomic typing. The recent release of ONT's R10.4.1 chemistry, with significantly improved raw read accuracy (Q20+), offers a potential solution to this problem. The aim of this study was to evaluate the performance of ONT's latest chemistry for bacterial genomic typing against the gold standard Illumina technology, focusing on three respiratory pathogens of public health importance, <em>Klebsiella pneumoniae</em>, <em>Bordetella pertussis</em>, and <em>Corynebacterium diphtheriae</em>, and their related species. Using the Rapid Barcoding Kit V14, we generated and analyzed genome assemblies with different basecalling models, at different simulated depths of coverage. ONT assemblies were compared to the Illumina reference for completeness and core genome multilocus sequence typing (cgMLST) accuracy (number of allelic mismatches). Our results show that genomes obtained from raw ONT data basecalled with Dorado SUP v0.9.0, assembled with Flye, and with a minimum coverage depth of 35×, optimized accuracy for all bacterial species tested. Error rates were consistently below 0.5% for each cgMLST scheme, indicating that ONT R10.4.1 data is suitable for high-resolution genomic typing applied to outbreak investigations and public health surveillance.\",\"PeriodicalId\":12678,\"journal\":{\"name\":\"Genome research\",\"volume\":\"136 1\",\"pages\":\"gr.279829.124\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genome research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1101/gr.279829.124\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gr.279829.124","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Accurate genotyping of three major respiratory bacterial pathogens with ONT R10.4.1 long-read sequencing
High-throughput massive parallel sequencing has significantly improved bacterial pathogen genomics, diagnostics, and epidemiology. Despite its high accuracy, short-read sequencing struggles with complete genome reconstruction and assembly of extrachromosomal elements such as plasmids. Long-read sequencing with Oxford Nanopore Technologies (ONT) presents an alternative that offers benefits including real-time sequencing and cost-efficiency, particularly useful in resource-limited settings. However, the historically higher error rates of ONT data have so far limited its application in high-precision genomic typing. The recent release of ONT's R10.4.1 chemistry, with significantly improved raw read accuracy (Q20+), offers a potential solution to this problem. The aim of this study was to evaluate the performance of ONT's latest chemistry for bacterial genomic typing against the gold standard Illumina technology, focusing on three respiratory pathogens of public health importance, Klebsiella pneumoniae, Bordetella pertussis, and Corynebacterium diphtheriae, and their related species. Using the Rapid Barcoding Kit V14, we generated and analyzed genome assemblies with different basecalling models, at different simulated depths of coverage. ONT assemblies were compared to the Illumina reference for completeness and core genome multilocus sequence typing (cgMLST) accuracy (number of allelic mismatches). Our results show that genomes obtained from raw ONT data basecalled with Dorado SUP v0.9.0, assembled with Flye, and with a minimum coverage depth of 35×, optimized accuracy for all bacterial species tested. Error rates were consistently below 0.5% for each cgMLST scheme, indicating that ONT R10.4.1 data is suitable for high-resolution genomic typing applied to outbreak investigations and public health surveillance.
期刊介绍:
Launched in 1995, Genome Research is an international, continuously published, peer-reviewed journal that focuses on research that provides novel insights into the genome biology of all organisms, including advances in genomic medicine.
Among the topics considered by the journal are genome structure and function, comparative genomics, molecular evolution, genome-scale quantitative and population genetics, proteomics, epigenomics, and systems biology. The journal also features exciting gene discoveries and reports of cutting-edge computational biology and high-throughput methodologies.
New data in these areas are published as research papers, or methods and resource reports that provide novel information on technologies or tools that will be of interest to a broad readership. Complete data sets are presented electronically on the journal''s web site where appropriate. The journal also provides Reviews, Perspectives, and Insight/Outlook articles, which present commentary on the latest advances published both here and elsewhere, placing such progress in its broader biological context.