Prophage-DB: A comprehensive database to explore diversity, distribution, and ecology of prophages

bioRxiv Pub Date : 2024-07-16 DOI:10.1101/2024.07.11.603044
Etan Dieppa-Colón, Cody Martin, Karthik Anantharaman
{"title":"Prophage-DB: A comprehensive database to explore diversity, distribution, and ecology of prophages","authors":"Etan Dieppa-Colón, Cody Martin, Karthik Anantharaman","doi":"10.1101/2024.07.11.603044","DOIUrl":null,"url":null,"abstract":"Background Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, infection dynamics, and effects of bacteriophage infection on host cell metabolism are extremely underexplored. Phages are classified as virulent or temperate based on their life cycles. Temperate phages adopt the lysogenic mode of infection, where the genome integrates into the host cell genome forming a prophage. Prophages enable viral genome replication without host cell lysis, and often contribute novel and beneficial traits to the host genome. Current phage research predominantly focuses on lytic phages, leaving a significant gap in knowledge regarding prophages, including their biology, diversity, and ecological roles. Results Here we develop and describe Prophage-DB, a database of prophages, their proteins, and associated metadata that will serve as a resource for viral genomics and microbial ecology. To create the database, we identified and characterized prophages from genomes in three of the largest publicly available databases. We applied several state-of-the-art tools in our pipeline to annotate these viruses, cluster and taxonomically classify them, and detect their respective auxiliary metabolic genes. In total, we identify and characterize over 350,000 prophages and 35,000 auxiliary metabolic genes. Our prophage database is highly representative based on statistical results and contains prophages from a diverse set of archaeal and bacterial hosts which show a wide environmental distribution. Conclusion Prophages are particularly overlooked in viral ecology and merit increased attention due to their vital implications for microbiomes and their hosts. Here, we created Prophage-DB to advance our comprehension of prophages in microbiomes through a comprehensive characterization of prophages in publicly available genomes. We propose that Prophage-DB will serve as a valuable resource for advancing phage research, offering insights into viral taxonomy, host relationships, auxiliary metabolic genes, and environmental distribution.","PeriodicalId":9124,"journal":{"name":"bioRxiv","volume":"37 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.11.603044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, infection dynamics, and effects of bacteriophage infection on host cell metabolism are extremely underexplored. Phages are classified as virulent or temperate based on their life cycles. Temperate phages adopt the lysogenic mode of infection, where the genome integrates into the host cell genome forming a prophage. Prophages enable viral genome replication without host cell lysis, and often contribute novel and beneficial traits to the host genome. Current phage research predominantly focuses on lytic phages, leaving a significant gap in knowledge regarding prophages, including their biology, diversity, and ecological roles. Results Here we develop and describe Prophage-DB, a database of prophages, their proteins, and associated metadata that will serve as a resource for viral genomics and microbial ecology. To create the database, we identified and characterized prophages from genomes in three of the largest publicly available databases. We applied several state-of-the-art tools in our pipeline to annotate these viruses, cluster and taxonomically classify them, and detect their respective auxiliary metabolic genes. In total, we identify and characterize over 350,000 prophages and 35,000 auxiliary metabolic genes. Our prophage database is highly representative based on statistical results and contains prophages from a diverse set of archaeal and bacterial hosts which show a wide environmental distribution. Conclusion Prophages are particularly overlooked in viral ecology and merit increased attention due to their vital implications for microbiomes and their hosts. Here, we created Prophage-DB to advance our comprehension of prophages in microbiomes through a comprehensive characterization of prophages in publicly available genomes. We propose that Prophage-DB will serve as a valuable resource for advancing phage research, offering insights into viral taxonomy, host relationships, auxiliary metabolic genes, and environmental distribution.
噬菌体数据库:探索噬菌体多样性、分布和生态的综合数据库
背景 感染原核生物的病毒(噬菌体)是生物制剂中数量最多的一类,在微生物系统中发挥着举足轻重的作用。众所周知,它们会影响微生物群落动力学、微生物生态学和进化。目前对噬菌体的多样性、宿主范围、感染动态以及感染对宿主细胞新陈代谢的影响等方面的研究还极为欠缺。噬菌体根据其生命周期可分为毒性噬菌体和温性噬菌体。温性噬菌体采用溶原感染模式,其基因组整合到宿主细胞基因组中,形成噬菌体。噬菌体能在不溶解宿主细胞的情况下复制病毒基因组,而且往往能为宿主基因组带来新的有益特性。目前的噬菌体研究主要集中在裂解噬菌体上,对噬菌体的生物学、多样性和生态作用等方面的了解还存在很大差距。结果 在这里,我们开发并描述了噬菌体数据库(Prophage-DB),这是一个包含噬菌体、其蛋白质和相关元数据的数据库,将成为病毒基因组学和微生物生态学的资源库。为了创建该数据库,我们从三个最大的公开数据库中的基因组中鉴定并描述了噬菌体。我们在程序中应用了几种最先进的工具来注释这些病毒,对它们进行聚类和分类,并检测它们各自的辅助代谢基因。我们总共识别并描述了超过 350,000 个噬菌体和 35,000 个辅助代谢基因。根据统计结果,我们的噬菌体数据库具有很高的代表性,其中包含了来自不同古细菌和细菌宿主的噬菌体,这些噬菌体显示出广泛的环境分布。结论 在病毒生态学中,噬菌体尤其容易被忽视,由于它们对微生物组及其宿主具有重要影响,因此值得更多关注。在此,我们创建了 Prophage-DB,通过对公开基因组中的噬菌体进行全面描述,加深我们对微生物组中噬菌体的理解。我们认为,Prophage-DB 将成为推动噬菌体研究的宝贵资源,为病毒分类、宿主关系、辅助代谢基因和环境分布提供见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信