Evaluation of sample pooling for gene sequencing of SARS-CoV-2: a simulation study.

IF 1.4 4区 医学 Q4 INFECTIOUS DISEASES
Heng Chen, Yue Cheng, Xun He, Yuzhen Zhou, Wenjun Xie, Danyun Shen, Zhiqun He, Ruidan Li, Weixuan Liu, Liang Wang, Xuejun Zhang
{"title":"Evaluation of sample pooling for gene sequencing of SARS-CoV-2: a simulation study.","authors":"Heng Chen, Yue Cheng, Xun He, Yuzhen Zhou, Wenjun Xie, Danyun Shen, Zhiqun He, Ruidan Li, Weixuan Liu, Liang Wang, Xuejun Zhang","doi":"10.3855/jidc.20348","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.</p><p><strong>Methodology: </strong>A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.</p><p><strong>Results: </strong>The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.</p><p><strong>Conclusions: </strong>Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.</p>","PeriodicalId":49160,"journal":{"name":"Journal of Infection in Developing Countries","volume":"19 1","pages":"1-8"},"PeriodicalIF":1.4000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Infection in Developing Countries","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3855/jidc.20348","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.

Methodology: A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.

Results: The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.

Conclusions: Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.

评估SARS-CoV-2基因测序的样本池:模拟研究
2019冠状病毒病(COVID-19)继续构成重大公共卫生威胁,需要进行流行病学和基因组监测。下一代测序(NGS)通常用于监测病毒进化,但成本较高。本研究评估了集合测序作为监测病毒变异的一种经济有效的工具。方法:通过模拟研究,评估样本池法对SARS-CoV-2测序的效果。总共收集了72组不同基因型的原始基因测序原始数据,并基于5种池化策略组合生成70个模拟样本。利用基于Freyja的生物信息学工具分析了这70个模拟池样本的变异组成。利用R软件评估不同池化策略、结果报告和基因型中恢复原始样本正确基因型的效率。结果:合并样本的遗传组成基本恢复了原始样本的基因型组成,其中前X个结果(其中X为原始样本在池中的数量)与完整结果存在差异(p < 0.05)。在5种池化策略中,前X个结果的基因型鉴定效率存在差异(p < 0.05),但在完整结果的报告中没有差异(p < 0.05)。一些低质量的原始样品没有被准确识别。结论:样本池结合精简型基因分型为SARS-CoV-2基因测序提供了一种具有成本效益的方法,这将有助于COVID-19基因组监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.70
自引率
5.30%
发文量
239
审稿时长
4-8 weeks
期刊介绍: The Journal of Infection in Developing Countries (JIDC) is an international journal, intended for the publication of scientific articles from Developing Countries by scientists from Developing Countries. JIDC is an independent, on-line publication with an international editorial board. JIDC is open access with no cost to view or download articles and reasonable cost for publication of research artcles, making JIDC easily availiable to scientists from resource restricted regions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信