选择最合适的NGS技术与标准化病毒富集方案相结合,从宏基因组样本中获得禽正呼肠孤病毒完整基因组。

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2025-02-04 eCollection Date: 2025-01-01 DOI:10.3389/fbinf.2025.1498921
Sonsiray Álvarez-Narváez, Telvin L Harrell, Islam Nour, Sujit K Mohanty, Steven J Conrad
{"title":"选择最合适的NGS技术与标准化病毒富集方案相结合,从宏基因组样本中获得禽正呼肠孤病毒完整基因组。","authors":"Sonsiray Álvarez-Narváez, Telvin L Harrell, Islam Nour, Sujit K Mohanty, Steven J Conrad","doi":"10.3389/fbinf.2025.1498921","DOIUrl":null,"url":null,"abstract":"<p><p>Since viruses are obligate intracellular pathogens, sequencing their genomes results in metagenomic data from both the virus and the host. Virology researchers are constantly seeking new, cost-effective strategies and bioinformatic pipelines for the retrieval of complete viral genomes from these metagenomic samples. Avian orthoreoviruses (ARVs) pose a significant and growing threat to the poultry industry and frequently cause economic losses associated with disease in production birds. Currently available commercial vaccines are ineffective against new ARV variants and ARV outbreaks are increasing worldwide, requiring whole genome sequencing (WGS) to characterize strains that evade vaccines. This study compares the effectiveness of long-read and short-read sequencing technologies for obtaining ARV complete genomes. We used eight clinical isolates of ARV, each previously processed using our published viral genome enrichment protocol. Additionally, we evaluate three assembly methods to determine which provided the most complete and reliable whole genomes: <i>De novo</i>, reference-guided or hybrid. The results suggest that our ARV genome enrichment protocol caused some fragmentation of the viral cDNA that impacted the length of the long reads (but not the short reads) and, as a result, caused a failure to produce complete genomes via <i>de novo</i> assembly. Overall, we observed that regardless of the sequencing technology, the best quality assemblies were generated by mapping quality-trimmed reads to a custom reference genome. The custom reference genomes were in turn constructed with the publicly available ARV genomic segments that shared the highest sequence similarity with the contigs from short-read <i>de novo</i> assemblies. Hence, we conclude that short-read sequencing is the most suitable technology to combine with our ARV genome enrichment protocol.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1498921"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11833334/pdf/","citationCount":"0","resultStr":"{\"title\":\"Choosing the most suitable NGS technology to combine with a standardized viral enrichment protocol for obtaining complete avian orthoreovirus genomes from metagenomic samples.\",\"authors\":\"Sonsiray Álvarez-Narváez, Telvin L Harrell, Islam Nour, Sujit K Mohanty, Steven J Conrad\",\"doi\":\"10.3389/fbinf.2025.1498921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Since viruses are obligate intracellular pathogens, sequencing their genomes results in metagenomic data from both the virus and the host. Virology researchers are constantly seeking new, cost-effective strategies and bioinformatic pipelines for the retrieval of complete viral genomes from these metagenomic samples. Avian orthoreoviruses (ARVs) pose a significant and growing threat to the poultry industry and frequently cause economic losses associated with disease in production birds. Currently available commercial vaccines are ineffective against new ARV variants and ARV outbreaks are increasing worldwide, requiring whole genome sequencing (WGS) to characterize strains that evade vaccines. This study compares the effectiveness of long-read and short-read sequencing technologies for obtaining ARV complete genomes. We used eight clinical isolates of ARV, each previously processed using our published viral genome enrichment protocol. Additionally, we evaluate three assembly methods to determine which provided the most complete and reliable whole genomes: <i>De novo</i>, reference-guided or hybrid. The results suggest that our ARV genome enrichment protocol caused some fragmentation of the viral cDNA that impacted the length of the long reads (but not the short reads) and, as a result, caused a failure to produce complete genomes via <i>de novo</i> assembly. Overall, we observed that regardless of the sequencing technology, the best quality assemblies were generated by mapping quality-trimmed reads to a custom reference genome. The custom reference genomes were in turn constructed with the publicly available ARV genomic segments that shared the highest sequence similarity with the contigs from short-read <i>de novo</i> assemblies. Hence, we conclude that short-read sequencing is the most suitable technology to combine with our ARV genome enrichment protocol.</p>\",\"PeriodicalId\":73066,\"journal\":{\"name\":\"Frontiers in bioinformatics\",\"volume\":\"5 \",\"pages\":\"1498921\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11833334/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fbinf.2025.1498921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1498921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

由于病毒是专性细胞内病原体,对其基因组进行测序可获得病毒和宿主的宏基因组数据。病毒学研究人员一直在寻求新的、具有成本效益的策略和生物信息学管道,以便从这些宏基因组样本中检索完整的病毒基因组。禽正呼肠病毒(arv)对家禽业构成重大且日益严重的威胁,并经常造成与生产禽的疾病相关的经济损失。目前可用的商业疫苗对新的抗逆转录病毒变种无效,而且抗逆转录病毒疫情在世界范围内不断增加,因此需要全基因组测序(WGS)来确定逃避疫苗的菌株的特征。本研究比较了长读和短读测序技术获得ARV全基因组的有效性。我们使用了8个ARV临床分离株,每个分离株之前都使用我们发表的病毒基因组富集方案进行了处理。此外,我们评估了三种组装方法,以确定哪种方法提供了最完整和可靠的全基因组:De novo,参考引导或杂交。结果表明,我们的ARV基因组富集方案导致病毒cDNA的一些片段化,影响了长reads的长度(但不影响短reads的长度),从而导致无法通过从头组装产生完整的基因组。总的来说,我们观察到,无论采用何种测序技术,通过将高质量修剪的reads映射到定制参考基因组,都可以生成最优质的组装。定制参考基因组依次使用公开可用的ARV基因组片段构建,这些片段与来自短读从头组装的基因组片段具有最高的序列相似性。因此,我们得出结论,短读测序是最适合与我们的ARV基因组富集方案结合的技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Choosing the most suitable NGS technology to combine with a standardized viral enrichment protocol for obtaining complete avian orthoreovirus genomes from metagenomic samples.

Since viruses are obligate intracellular pathogens, sequencing their genomes results in metagenomic data from both the virus and the host. Virology researchers are constantly seeking new, cost-effective strategies and bioinformatic pipelines for the retrieval of complete viral genomes from these metagenomic samples. Avian orthoreoviruses (ARVs) pose a significant and growing threat to the poultry industry and frequently cause economic losses associated with disease in production birds. Currently available commercial vaccines are ineffective against new ARV variants and ARV outbreaks are increasing worldwide, requiring whole genome sequencing (WGS) to characterize strains that evade vaccines. This study compares the effectiveness of long-read and short-read sequencing technologies for obtaining ARV complete genomes. We used eight clinical isolates of ARV, each previously processed using our published viral genome enrichment protocol. Additionally, we evaluate three assembly methods to determine which provided the most complete and reliable whole genomes: De novo, reference-guided or hybrid. The results suggest that our ARV genome enrichment protocol caused some fragmentation of the viral cDNA that impacted the length of the long reads (but not the short reads) and, as a result, caused a failure to produce complete genomes via de novo assembly. Overall, we observed that regardless of the sequencing technology, the best quality assemblies were generated by mapping quality-trimmed reads to a custom reference genome. The custom reference genomes were in turn constructed with the publicly available ARV genomic segments that shared the highest sequence similarity with the contigs from short-read de novo assemblies. Hence, we conclude that short-read sequencing is the most suitable technology to combine with our ARV genome enrichment protocol.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信