{"title":"Evaluation of sample pooling for gene sequencing of SARS-CoV-2: a simulation study.","authors":"Heng Chen, Yue Cheng, Xun He, Yuzhen Zhou, Wenjun Xie, Danyun Shen, Zhiqun He, Ruidan Li, Weixuan Liu, Liang Wang, Xuejun Zhang","doi":"10.3855/jidc.20348","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.</p><p><strong>Methodology: </strong>A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.</p><p><strong>Results: </strong>The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.</p><p><strong>Conclusions: </strong>Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.</p>","PeriodicalId":49160,"journal":{"name":"Journal of Infection in Developing Countries","volume":"19 1","pages":"1-8"},"PeriodicalIF":1.4000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Infection in Developing Countries","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3855/jidc.20348","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Coronavirus disease 2019 (COVID-19) continues to pose a significant public health threat, requiring epidemiological and genomic surveillance. Next generation sequencing (NGS) is commonly utilized for monitoring viral evolution at a high cost. This study evaluated pooled sequencing as a cost-effective tool for monitoring virus variants.
Methodology: A simulation study was conducted to evaluate the efficacy of sample pooling for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing. In total, 72 original sets of raw data of gene sequencing with different genotypes were collected and combined to create 70 simulated samples based on five pooling strategies. A bioinformatics tool based on Freyja was utilized to analyze the variant composition of these 70 simulated pooled samples. The efficiency of recovering the correct genotypes of the original samples among different pooling strategies, result reports, and genotypes was evaluated with R software.
Results: The genetic composition of the pooled samples mostly recovered the genotype compositions of the original samples, with discrepancies between the top X results (where X is the number of original samples in the pool) and the complete results (p < 0.05). Variability in identification efficiency of genotypes were observed in the reports for the top X results (p < 0.05) across the five pooling strategies, but not in the reports of complete results (p > 0.05). Some original samples of low quality were not accurately identified.
Conclusions: Sample pooling coupled with streamlined genotyping offers a promising approach for cost-effective gene sequencing of SARS-CoV-2, which will aid in COVID-19 genomic surveillance.
期刊介绍:
The Journal of Infection in Developing Countries (JIDC) is an international journal, intended for the publication of scientific articles from Developing Countries by scientists from Developing Countries.
JIDC is an independent, on-line publication with an international editorial board. JIDC is open access with no cost to view or download articles and reasonable cost for publication of research artcles, making JIDC easily availiable to scientists from resource restricted regions.