Tracking SARS-CoV-2 variants of concern in wastewater: an assessment of nine computational tools using simulated genomic data.

IF 4 2区 生物学 Q1 GENETICS & HEREDITY
Steven G Sutcliffe, Susanne A Kraemer, Isaac Ellmen, Jennifer J Knapp, Alyssa K Overton, Delaney Nash, Jozef I Nissimov, Trevor C Charles, David Dreifuss, Ivan Topolsky, Pelin I Baykal, Lara Fuhrmann, Kim P Jablonski, Niko Beerenwinkel, Joshua I Levy, Abayomi S Olabode, Devan G Becker, Gopi Gugan, Erin Brintnell, Art F Y Poon, Renan Valieris, Rodrigo D Drummond, Alexandre Defelicibus, Emmanuel Dias-Neto, Rafael A Rosales, Israel Tojal da Silva, Aspasia Orfanou, Fotis Psomopoulos, Nikolaos Pechlivanis, Lenore Pipes, Zihao Chen, Jasmijn A Baaijens, Michael Baym, B Jesse Shapiro
{"title":"Tracking SARS-CoV-2 variants of concern in wastewater: an assessment of nine computational tools using simulated genomic data.","authors":"Steven G Sutcliffe, Susanne A Kraemer, Isaac Ellmen, Jennifer J Knapp, Alyssa K Overton, Delaney Nash, Jozef I Nissimov, Trevor C Charles, David Dreifuss, Ivan Topolsky, Pelin I Baykal, Lara Fuhrmann, Kim P Jablonski, Niko Beerenwinkel, Joshua I Levy, Abayomi S Olabode, Devan G Becker, Gopi Gugan, Erin Brintnell, Art F Y Poon, Renan Valieris, Rodrigo D Drummond, Alexandre Defelicibus, Emmanuel Dias-Neto, Rafael A Rosales, Israel Tojal da Silva, Aspasia Orfanou, Fotis Psomopoulos, Nikolaos Pechlivanis, Lenore Pipes, Zihao Chen, Jasmijn A Baaijens, Michael Baym, B Jesse Shapiro","doi":"10.1099/mgen.0.001249","DOIUrl":null,"url":null,"abstract":"<p><p>Wastewater-based surveillance (WBS) is an important epidemiological and public health tool for tracking pathogens across the scale of a building, neighbourhood, city, or region. WBS gained widespread adoption globally during the SARS-CoV-2 pandemic for estimating community infection levels by qPCR. Sequencing pathogen genes or genomes from wastewater adds information about pathogen genetic diversity, which can be used to identify viral lineages (including variants of concern) that are circulating in a local population. Capturing the genetic diversity by WBS sequencing is not trivial, as wastewater samples often contain a diverse mixture of viral lineages with real mutations and sequencing errors, which must be deconvoluted computationally from short sequencing reads. In this study we assess nine different computational tools that have recently been developed to address this challenge. We simulated 100 wastewater sequence samples consisting of SARS-CoV-2 BA.1, BA.2, and Delta lineages, in various mixtures, as well as a Delta-Omicron recombinant and a synthetic 'novel' lineage. Most tools performed well in identifying the true lineages present and estimating their relative abundances and were generally robust to variation in sequencing depth and read length. While many tools identified lineages present down to 1 % frequency, results were more reliable above a 5 % threshold. The presence of an unknown synthetic lineage, which represents an unclassified SARS-CoV-2 lineage, increases the error in relative abundance estimates of other lineages, but the magnitude of this effect was small for most tools. The tools also varied in how they labelled novel synthetic lineages and recombinants. While our simulated dataset represents just one of many possible use cases for these methods, we hope it helps users understand potential sources of error or bias in wastewater sequencing analysis and to appreciate the commonalities and differences across methods.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"10 5","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11165662/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001249","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Wastewater-based surveillance (WBS) is an important epidemiological and public health tool for tracking pathogens across the scale of a building, neighbourhood, city, or region. WBS gained widespread adoption globally during the SARS-CoV-2 pandemic for estimating community infection levels by qPCR. Sequencing pathogen genes or genomes from wastewater adds information about pathogen genetic diversity, which can be used to identify viral lineages (including variants of concern) that are circulating in a local population. Capturing the genetic diversity by WBS sequencing is not trivial, as wastewater samples often contain a diverse mixture of viral lineages with real mutations and sequencing errors, which must be deconvoluted computationally from short sequencing reads. In this study we assess nine different computational tools that have recently been developed to address this challenge. We simulated 100 wastewater sequence samples consisting of SARS-CoV-2 BA.1, BA.2, and Delta lineages, in various mixtures, as well as a Delta-Omicron recombinant and a synthetic 'novel' lineage. Most tools performed well in identifying the true lineages present and estimating their relative abundances and were generally robust to variation in sequencing depth and read length. While many tools identified lineages present down to 1 % frequency, results were more reliable above a 5 % threshold. The presence of an unknown synthetic lineage, which represents an unclassified SARS-CoV-2 lineage, increases the error in relative abundance estimates of other lineages, but the magnitude of this effect was small for most tools. The tools also varied in how they labelled novel synthetic lineages and recombinants. While our simulated dataset represents just one of many possible use cases for these methods, we hope it helps users understand potential sources of error or bias in wastewater sequencing analysis and to appreciate the commonalities and differences across methods.

追踪废水中令人担忧的 SARS-CoV-2 变异体:利用模拟基因组数据对九种计算工具进行评估。
废水监测(WBS)是一种重要的流行病学和公共卫生工具,用于在建筑物、社区、城市或地区范围内追踪病原体。在 SARS-CoV-2 大流行期间,全球广泛采用 WBS 通过 qPCR 估算社区感染水平。对废水中的病原体基因或基因组进行测序可增加有关病原体遗传多样性的信息,可用于识别在当地人群中流行的病毒系(包括令人担忧的变种)。通过 WBS 测序捕捉遗传多样性并非易事,因为废水样本通常包含多种病毒系的混合物,其中既有真正的突变,也有测序错误,必须通过计算从短测序读数中去卷积。在本研究中,我们评估了最近为应对这一挑战而开发的九种不同的计算工具。我们模拟了 100 份废水序列样本,其中包括 SARS-CoV-2 BA.1、BA.2 和 Delta 系的各种混合物,以及 Delta-Omicron 重组和合成的 "新型 "系。大多数工具都能很好地识别出真正存在的品系,并估算出它们的相对丰度,而且对测序深度和读数长度的变化一般都很稳健。虽然许多工具都能识别出频率低至 1% 的品系,但 5% 以上的结果更为可靠。未知合成品系(代表未分类的 SARS-CoV-2 品系)的存在会增加其他品系相对丰度估计值的误差,但对大多数工具来说,这种影响的程度很小。这些工具在标记新的合成系和重组体方面也各不相同。虽然我们的模拟数据集仅代表了这些方法的众多可能用例之一,但我们希望它能帮助用户了解废水测序分析中潜在的误差或偏差来源,并理解不同方法之间的共性和差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Microbial Genomics
Microbial Genomics Medicine-Epidemiology
CiteScore
6.60
自引率
2.60%
发文量
153
审稿时长
12 weeks
期刊介绍: Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信