Samuel T. Horsfield, Basil C.T. Fok, Yuhan Fu, Paul Turner, John A. Lees, Nicholas J. Croucher
{"title":"Optimizing nanopore adaptive sampling for pneumococcal serotype surveillance in complex samples using the graph-based GNASTy algorithm","authors":"Samuel T. Horsfield, Basil C.T. Fok, Yuhan Fu, Paul Turner, John A. Lees, Nicholas J. Croucher","doi":"10.1101/gr.279435.124","DOIUrl":null,"url":null,"abstract":"Serotype surveillance of <em>Streptococcus pneumoniae</em> (the pneumococcus) is critical for understanding the effectiveness of current vaccination strategies. However, existing methods for serotyping are limited in their ability to identify the co-carriage of multiple pneumococci and detect novel serotypes. To develop a scalable and portable serotyping method that overcomes these challenges, we employed Nanopore Adaptive Sampling (NAS), an on-sequencer enrichment method that selects for target DNA in real-time, for direct detection of <em>S. pneumoniae</em> in complex samples. Whereas NAS targeting the whole <em>S. pneumoniae</em> genome was ineffective in the presence of nonpathogenic streptococci, the method was both specific and sensitive when targeting the capsular biosynthetic locus (CBL), the operon that determines <em>S. pneumoniae</em> serotype. NAS significantly improved coverage and yield of the CBL relative to sequencing without NAS, and accurately quantified the relative prevalence of serotypes in samples representing co-carriage. To maximize the sensitivity of NAS to detect novel serotypes, we developed and benchmarked a new pangenome-graph algorithm, named GNASTy. We show that GNASTy outperforms the current NAS implementation, which is based on linear genome alignment, when a sample contains a serotype absent from the database of targeted sequences. The methods developed in this work provide an improved approach for novel serotype discovery and routine <em>S. pneumoniae</em> surveillance that is fast, accurate and feasible in low-resource settings. Although NAS facilitates whole-genome enrichment under ideal circumstances, GNASTy enables targeted enrichment to optimize serotype surveillance in complex samples.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"28 1","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gr.279435.124","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
肺炎链球菌血清型监测对于了解当前疫苗接种策略的有效性至关重要。然而,现有的血清分型方法在识别多种肺炎球菌共同携带和检测新型血清型方面能力有限。为了开发一种可扩展的便携式血清分型方法来克服这些挑战,我们采用了纳米孔自适应采样(NAS)技术,这是一种实时选择目标DNA的序列富集方法,可直接检测复杂样本中的肺炎链球菌。针对肺炎双球菌全基因组的 NAS 在非致病性链球菌存在的情况下无效,而针对决定肺炎双球菌血清型的操作子囊生物合成位点(CBL)的方法既特异又灵敏。与不使用 NAS 的测序方法相比,NAS 大大提高了 CBL 的覆盖率和检出率,并能准确量化共同携带样本中血清型的相对流行率。为了最大限度地提高NAS检测新型血清型的灵敏度,我们开发了一种新的泛基因组图算法,命名为GNASTy,并进行了基准测试。我们的研究表明,当样本中含有目标序列数据库中没有的血清型时,GNASTy 优于目前基于线性基因组比对的 NAS 实现。这项工作中开发的方法为新型血清型的发现和肺炎双球菌的常规监测提供了一种改进的方法,这种方法快速、准确,在资源匮乏的环境中也是可行的。虽然在理想情况下 NAS 可促进全基因组富集,但 GNASTy 可实现有针对性的富集,以优化复杂样本中的血清型监测。
Optimizing nanopore adaptive sampling for pneumococcal serotype surveillance in complex samples using the graph-based GNASTy algorithm
Serotype surveillance of Streptococcus pneumoniae (the pneumococcus) is critical for understanding the effectiveness of current vaccination strategies. However, existing methods for serotyping are limited in their ability to identify the co-carriage of multiple pneumococci and detect novel serotypes. To develop a scalable and portable serotyping method that overcomes these challenges, we employed Nanopore Adaptive Sampling (NAS), an on-sequencer enrichment method that selects for target DNA in real-time, for direct detection of S. pneumoniae in complex samples. Whereas NAS targeting the whole S. pneumoniae genome was ineffective in the presence of nonpathogenic streptococci, the method was both specific and sensitive when targeting the capsular biosynthetic locus (CBL), the operon that determines S. pneumoniae serotype. NAS significantly improved coverage and yield of the CBL relative to sequencing without NAS, and accurately quantified the relative prevalence of serotypes in samples representing co-carriage. To maximize the sensitivity of NAS to detect novel serotypes, we developed and benchmarked a new pangenome-graph algorithm, named GNASTy. We show that GNASTy outperforms the current NAS implementation, which is based on linear genome alignment, when a sample contains a serotype absent from the database of targeted sequences. The methods developed in this work provide an improved approach for novel serotype discovery and routine S. pneumoniae surveillance that is fast, accurate and feasible in low-resource settings. Although NAS facilitates whole-genome enrichment under ideal circumstances, GNASTy enables targeted enrichment to optimize serotype surveillance in complex samples.
期刊介绍:
Launched in 1995, Genome Research is an international, continuously published, peer-reviewed journal that focuses on research that provides novel insights into the genome biology of all organisms, including advances in genomic medicine.
Among the topics considered by the journal are genome structure and function, comparative genomics, molecular evolution, genome-scale quantitative and population genetics, proteomics, epigenomics, and systems biology. The journal also features exciting gene discoveries and reports of cutting-edge computational biology and high-throughput methodologies.
New data in these areas are published as research papers, or methods and resource reports that provide novel information on technologies or tools that will be of interest to a broad readership. Complete data sets are presented electronically on the journal''s web site where appropriate. The journal also provides Reviews, Perspectives, and Insight/Outlook articles, which present commentary on the latest advances published both here and elsewhere, placing such progress in its broader biological context.