Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs.

IF 13.8 1区 生物学 Q1 MICROBIOLOGY
Annika Jochheim, Florian A Jochheim, Alexandra Kolodyazhnaya, Étienne Morice, Martin Steinegger, Johannes Söding
{"title":"Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs.","authors":"Annika Jochheim, Florian A Jochheim, Alexandra Kolodyazhnaya, Étienne Morice, Martin Steinegger, Johannes Söding","doi":"10.1186/s40168-024-01904-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Metagenomics is a powerful approach to study environmental and human-associated microbial communities and, in particular, the role of viruses in shaping them. Viral genomes are challenging to assemble from metagenomic samples due to their genomic diversity caused by high mutation rates. In the standard de Bruijn graph assemblers, this genomic diversity leads to complex k-mer assembly graphs with a plethora of loops and bulges that are challenging to resolve into strains or haplotypes because variants more than the k-mer size apart cannot be phased. In contrast, overlap assemblers can phase variants as long as they are covered by a single read.</p><p><strong>Results: </strong>Here, we present PenguiN, a software for strain resolved assembly of viral DNA and RNA genomes and bacterial 16S rRNA from shotgun metagenomics. Its exhaustive detection of all read overlaps in linear time combined with a Bayesian model to select strain-resolved extensions allow it to assemble longer viral contigs, less fragmented genomes, and more strains than existing assembly tools, on both real and simulated datasets. We show a 3-40-fold increase in complete viral genomes and a 6-fold increase in bacterial 16S rRNA genes.</p><p><strong>Conclusion: </strong>PenguiN is the first overlap-based assembler for viral genome and 16S rRNA assembly from large and complex metagenomic datasets, which we hope will facilitate studying the key roles of viruses in microbial communities. Video Abstract.</p>","PeriodicalId":18447,"journal":{"name":"Microbiome","volume":"12 1","pages":"187"},"PeriodicalIF":13.8000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11443906/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s40168-024-01904-y","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Metagenomics is a powerful approach to study environmental and human-associated microbial communities and, in particular, the role of viruses in shaping them. Viral genomes are challenging to assemble from metagenomic samples due to their genomic diversity caused by high mutation rates. In the standard de Bruijn graph assemblers, this genomic diversity leads to complex k-mer assembly graphs with a plethora of loops and bulges that are challenging to resolve into strains or haplotypes because variants more than the k-mer size apart cannot be phased. In contrast, overlap assemblers can phase variants as long as they are covered by a single read.

Results: Here, we present PenguiN, a software for strain resolved assembly of viral DNA and RNA genomes and bacterial 16S rRNA from shotgun metagenomics. Its exhaustive detection of all read overlaps in linear time combined with a Bayesian model to select strain-resolved extensions allow it to assemble longer viral contigs, less fragmented genomes, and more strains than existing assembly tools, on both real and simulated datasets. We show a 3-40-fold increase in complete viral genomes and a 6-fold increase in bacterial 16S rRNA genes.

Conclusion: PenguiN is the first overlap-based assembler for viral genome and 16S rRNA assembly from large and complex metagenomic datasets, which we hope will facilitate studying the key roles of viruses in microbial communities. Video Abstract.

病毒基因组和微生物 16S rRNA 的菌株分辨元组组装。
背景:元基因组学是研究环境和人类相关微生物群落,特别是研究病毒在形成这些群落中的作用的一种强有力的方法。病毒基因组因突变率高而具有多样性,因此从元基因组样本中组装病毒基因组具有挑战性。在标准的 de Bruijn 图形组装器中,这种基因组多样性导致了复杂的 k-mer 组装图,其中有大量的环和凸起,由于相差超过 k-mer 大小的变异无法分阶段,因此很难将其解析为菌株或单倍型。相比之下,重叠组装器可以对变异进行分期,只要它们被单个读数覆盖即可:在此,我们介绍了一款用于对病毒 DNA 和 RNA 基因组以及细菌 16S rRNA 进行菌株解析组装的软件 PenguiN。该软件能在线性时间内详尽检测所有读数重叠,并结合贝叶斯模型选择菌株分辨扩展,因此与现有的组装工具相比,该软件能在真实和模拟数据集上组装出更长的病毒等位组、更少片段的基因组和更多的菌株。我们发现,完整的病毒基因组增加了 3-40 倍,细菌 16S rRNA 基因增加了 6 倍:结论:PenguiN 是首个基于重叠的组装器,可从大型复杂的元基因组数据集中组装病毒基因组和 16S rRNA,我们希望这将有助于研究病毒在微生物群落中的关键作用。视频摘要
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Microbiome
Microbiome MICROBIOLOGY-
CiteScore
21.90
自引率
2.60%
发文量
198
审稿时长
4 weeks
期刊介绍: Microbiome is a journal that focuses on studies of microbiomes in humans, animals, plants, and the environment. It covers both natural and manipulated microbiomes, such as those in agriculture. The journal is interested in research that uses meta-omics approaches or novel bioinformatics tools and emphasizes the community/host interaction and structure-function relationship within the microbiome. Studies that go beyond descriptive omics surveys and include experimental or theoretical approaches will be considered for publication. The journal also encourages research that establishes cause and effect relationships and supports proposed microbiome functions. However, studies of individual microbial isolates/species without exploring their impact on the host or the complex microbiome structures and functions will not be considered for publication. Microbiome is indexed in BIOSIS, Current Contents, DOAJ, Embase, MEDLINE, PubMed, PubMed Central, and Science Citations Index Expanded.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信