金标准数据集和评估从废水中估算血统丰度的方法。

IF 8 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Science of the Total Environment Pub Date : 2024-10-20 Epub Date: 2024-07-05 DOI:10.1016/j.scitotenv.2024.174515
Jannatul Ferdous, Samuel Kunkleman, William Taylor, April Harris, Cynthia J Gibas, Jessica A Schlueter
{"title":"金标准数据集和评估从废水中估算血统丰度的方法。","authors":"Jannatul Ferdous, Samuel Kunkleman, William Taylor, April Harris, Cynthia J Gibas, Jessica A Schlueter","doi":"10.1016/j.scitotenv.2024.174515","DOIUrl":null,"url":null,"abstract":"<p><p>During the SARS-CoV-2 pandemic, genome-based wastewater surveillance sequencing has been a powerful tool for public health to monitor circulating and emerging viral variants. As a medium, wastewater is very complex because of its mixed matrix nature, which makes the deconvolution of wastewater samples more difficult. Here we introduce a gold standard dataset constructed from synthetic viral control mixtures of known composition, spiked into a wastewater RNA matrix and sequenced on the Oxford Nanopore Technologies platform. We compare the performance of eight of the most commonly used deconvolution tools in identifying SARS-CoV-2 variants present in these mixtures. The software evaluated was primarily chosen for its relevance to the CDC wastewater surveillance reporting protocol, which until recently employed a pipeline that incorporates results from four deconvolution methods: Freyja, kallisto, Kraken 2/Bracken, and LCS. We also tested Lollipop, a deconvolution method used by the Swiss SARS-CoV-2 Sequencing Consortium, and three additional methods not used in the C-WAP pipeline: lineagespot, Alcov, and VaQuERo. We found that the commonly used software Freyja outperformed the other CDC pipeline tools in correct identification of lineages present in the control mixtures, and that the VaQuERo method was similarly accurate, with minor differences in the ability of the two methods to avoid false negatives and suppress false positives. Our results also provide insight into the effect of the tiling primer scheme and wastewater RNA extract matrix on viral sequencing and data deconvolution outcomes.</p>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":" ","pages":"174515"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A gold standard dataset and evaluation of methods for lineage abundance estimation from wastewater.\",\"authors\":\"Jannatul Ferdous, Samuel Kunkleman, William Taylor, April Harris, Cynthia J Gibas, Jessica A Schlueter\",\"doi\":\"10.1016/j.scitotenv.2024.174515\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>During the SARS-CoV-2 pandemic, genome-based wastewater surveillance sequencing has been a powerful tool for public health to monitor circulating and emerging viral variants. As a medium, wastewater is very complex because of its mixed matrix nature, which makes the deconvolution of wastewater samples more difficult. Here we introduce a gold standard dataset constructed from synthetic viral control mixtures of known composition, spiked into a wastewater RNA matrix and sequenced on the Oxford Nanopore Technologies platform. We compare the performance of eight of the most commonly used deconvolution tools in identifying SARS-CoV-2 variants present in these mixtures. The software evaluated was primarily chosen for its relevance to the CDC wastewater surveillance reporting protocol, which until recently employed a pipeline that incorporates results from four deconvolution methods: Freyja, kallisto, Kraken 2/Bracken, and LCS. We also tested Lollipop, a deconvolution method used by the Swiss SARS-CoV-2 Sequencing Consortium, and three additional methods not used in the C-WAP pipeline: lineagespot, Alcov, and VaQuERo. We found that the commonly used software Freyja outperformed the other CDC pipeline tools in correct identification of lineages present in the control mixtures, and that the VaQuERo method was similarly accurate, with minor differences in the ability of the two methods to avoid false negatives and suppress false positives. Our results also provide insight into the effect of the tiling primer scheme and wastewater RNA extract matrix on viral sequencing and data deconvolution outcomes.</p>\",\"PeriodicalId\":422,\"journal\":{\"name\":\"Science of the Total Environment\",\"volume\":\" \",\"pages\":\"174515\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science of the Total Environment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.scitotenv.2024.174515\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/7/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.scitotenv.2024.174515","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在 SARS-CoV-2 大流行期间,基于基因组的废水监测测序已成为公共卫生监测循环和新出现的病毒变种的有力工具。作为一种介质,废水因其混合基质的性质而非常复杂,这使得废水样本的解卷积变得更加困难。在此,我们介绍一种黄金标准数据集,该数据集由已知成分的合成病毒对照混合物构建而成,将其添加到废水 RNA 基质中,并在牛津纳米孔技术平台上进行测序。我们比较了八种最常用的解卷积工具在识别这些混合物中的 SARS-CoV-2 变异体方面的性能。所评估的软件之所以被选中,主要是因为它与美国疾病预防控制中心的废水监测报告协议相关,直到最近,该报告协议才采用了一个包含四种解卷积方法结果的管道:Freyja、kallisto、Kraken 2/Bracken 和 LCS。我们还测试了瑞士 SARS-CoV-2 测序联盟(Swiss SARS-CoV-2 Sequencing Consortium)使用的解卷积方法 Lollipop,以及 C-WAP 管道未使用的另外三种方法:lineagespot、Alcov 和 VaQuERo。我们发现,常用软件 Freyja 在正确识别对照混合物中存在的系谱方面优于其他 CDC 管道工具,而 VaQuERo 方法也同样准确,只是这两种方法在避免假阴性和抑制假阳性的能力上略有不同。我们的研究结果还让我们深入了解了平铺引物方案和废水 RNA 提取矩阵对病毒测序和数据解旋结果的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A gold standard dataset and evaluation of methods for lineage abundance estimation from wastewater.

During the SARS-CoV-2 pandemic, genome-based wastewater surveillance sequencing has been a powerful tool for public health to monitor circulating and emerging viral variants. As a medium, wastewater is very complex because of its mixed matrix nature, which makes the deconvolution of wastewater samples more difficult. Here we introduce a gold standard dataset constructed from synthetic viral control mixtures of known composition, spiked into a wastewater RNA matrix and sequenced on the Oxford Nanopore Technologies platform. We compare the performance of eight of the most commonly used deconvolution tools in identifying SARS-CoV-2 variants present in these mixtures. The software evaluated was primarily chosen for its relevance to the CDC wastewater surveillance reporting protocol, which until recently employed a pipeline that incorporates results from four deconvolution methods: Freyja, kallisto, Kraken 2/Bracken, and LCS. We also tested Lollipop, a deconvolution method used by the Swiss SARS-CoV-2 Sequencing Consortium, and three additional methods not used in the C-WAP pipeline: lineagespot, Alcov, and VaQuERo. We found that the commonly used software Freyja outperformed the other CDC pipeline tools in correct identification of lineages present in the control mixtures, and that the VaQuERo method was similarly accurate, with minor differences in the ability of the two methods to avoid false negatives and suppress false positives. Our results also provide insight into the effect of the tiling primer scheme and wastewater RNA extract matrix on viral sequencing and data deconvolution outcomes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Science of the Total Environment
Science of the Total Environment 环境科学-环境科学
CiteScore
17.60
自引率
10.20%
发文量
8726
审稿时长
2.4 months
期刊介绍: The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere. The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信