Evaluation of sample pooling for gene sequencing of SARS-CoV-2: a simulation study.

IF 1.4 4区 医学 Q4 INFECTIOUS DISEASES
Heng Chen, Yue Cheng, Xun He, Yuzhen Zhou, Wenjun Xie, Danyun Shen, Zhiqun He, Ruidan Li, Weixuan Liu, Liang Wang, Xuejun Zhang
{"title":"Evaluation of sample pooling for gene sequencing of SARS-CoV-2: a simulation study.","authors":"Heng Chen, Yue Cheng, Xun He, Yuzhen Zhou, Wenjun Xie, Danyun Shen, Zhiqun He, Ruidan Li, Weixuan Liu, Liang Wang, Xuejun Zhang","doi":"10.3855/jidc.20348","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.</p><p><strong>Methodology: </strong>A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.</p><p><strong>Results: </strong>The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.</p><p><strong>Conclusions: </strong>Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.</p>","PeriodicalId":49160,"journal":{"name":"Journal of Infection in Developing Countries","volume":"19 1","pages":"1-8"},"PeriodicalIF":1.4000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Infection in Developing Countries","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3855/jidc.20348","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.

Methodology: A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.

Results: The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.

Conclusions: Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.

求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.70
自引率
5.30%
发文量
239
审稿时长
4-8 weeks
期刊介绍: The Journal of Infection in Developing Countries (JIDC) is an international journal, intended for the publication of scientific articles from Developing Countries by scientists from Developing Countries. JIDC is an independent, on-line publication with an international editorial board. JIDC is open access with no cost to view or download articles and reasonable cost for publication of research artcles, making JIDC easily availiable to scientists from resource restricted regions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信