ssRNA bacteriophage metagenomes reveal a diverse set of novel protein families.

IF 5.2 3区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Protein Science Pub Date : 2026-05-01 DOI:10.1002/pro.70582
Jānis Rūmnieks, Ieva Baltā, Mihails Šišovs, Kaspars Tārs
{"title":"ssRNA bacteriophage metagenomes reveal a diverse set of novel protein families.","authors":"Jānis Rūmnieks, Ieva Baltā, Mihails Šišovs, Kaspars Tārs","doi":"10.1002/pro.70582","DOIUrl":null,"url":null,"abstract":"<p><p>The bacteriophages with single-stranded RNA (ssRNA) genomes (class Leviviricetes) are among the simplest known viruses that encode only three core proteins: a receptor-binding protein, a capsid protein, and an RNA-dependent RNA polymerase. The number of isolated ssRNA phages has remained very low, but the accumulating RNA metagenome data have uncovered a large variety of these viruses in many environments. Besides the core proteins, many of these genomes putatively encode additional proteins, which up to now have remained uncharacterized. We looked for non-conserved open reading frames (ORFs) in Leviviricetes sequences from the IMG/VR virus metagenome database and used sequence- and structure-based clustering to organize them into similarity groups. Potential ORFs were found throughout the ssRNA phage genomes but almost exclusively on the positive-sense RNA strand, suggestive of their protein-coding potential. The prevalence of the non-conserved ORFs varied in various phage lineages, and their distribution among different genome positions was markedly uneven. Most of the identified ORFs encode all-α proteins, a portion of which contain transmembrane segments that resemble a group of known ssRNA phage lysis proteins, while many others represent previously uncharacterized families of globular or semi-globular α-helical proteins. We additionally uncovered a major class of globular α/β proteins and experimentally determined the structure of a representative protein of this group. These results pave the way for further functional studies of novel ssRNA phage proteins for a better understanding of this diverse virus group.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"35 5","pages":"e70582"},"PeriodicalIF":5.2000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13114773/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70582","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The bacteriophages with single-stranded RNA (ssRNA) genomes (class Leviviricetes) are among the simplest known viruses that encode only three core proteins: a receptor-binding protein, a capsid protein, and an RNA-dependent RNA polymerase. The number of isolated ssRNA phages has remained very low, but the accumulating RNA metagenome data have uncovered a large variety of these viruses in many environments. Besides the core proteins, many of these genomes putatively encode additional proteins, which up to now have remained uncharacterized. We looked for non-conserved open reading frames (ORFs) in Leviviricetes sequences from the IMG/VR virus metagenome database and used sequence- and structure-based clustering to organize them into similarity groups. Potential ORFs were found throughout the ssRNA phage genomes but almost exclusively on the positive-sense RNA strand, suggestive of their protein-coding potential. The prevalence of the non-conserved ORFs varied in various phage lineages, and their distribution among different genome positions was markedly uneven. Most of the identified ORFs encode all-α proteins, a portion of which contain transmembrane segments that resemble a group of known ssRNA phage lysis proteins, while many others represent previously uncharacterized families of globular or semi-globular α-helical proteins. We additionally uncovered a major class of globular α/β proteins and experimentally determined the structure of a representative protein of this group. These results pave the way for further functional studies of novel ssRNA phage proteins for a better understanding of this diverse virus group.

ssRNA噬菌体宏基因组揭示了一系列新的蛋白质家族。
具有单链RNA (ssRNA)基因组的噬菌体是已知最简单的病毒之一,它只编码三种核心蛋白:受体结合蛋白、衣壳蛋白和依赖RNA的RNA聚合酶。分离的ssRNA噬菌体的数量仍然非常低,但积累的RNA宏基因组数据已经揭示了许多环境中这些病毒的种类繁多。除了核心蛋白外,这些基因组中有许多被认为编码了额外的蛋白质,这些蛋白质到目前为止仍未被表征。我们从IMG/VR病毒宏基因组数据库中寻找非保守的开放阅读框(orf),并使用基于序列和结构的聚类方法将它们组织成相似组。在整个ssRNA噬菌体基因组中都发现了潜在的orf,但几乎只在正义RNA链上,这表明它们具有蛋白质编码潜力。非保守orf在不同噬菌体谱系中的流行率不同,在不同基因组位置的分布明显不均匀。大多数已鉴定的orf编码所有α蛋白,其中一部分包含类似于一组已知的ssRNA噬菌体裂解蛋白的跨膜片段,而许多其他orf代表以前未表征的球形或半球形α-螺旋蛋白家族。我们还发现了一类主要的球状α/β蛋白,并通过实验确定了该类代表性蛋白的结构。这些结果为进一步研究新型ssRNA噬菌体蛋白的功能铺平了道路,从而更好地了解这种不同的病毒群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Protein Science
Protein Science 生物-生化与分子生物学
CiteScore
12.40
自引率
1.20%
发文量
246
审稿时长
1 months
期刊介绍: Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution. Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics. The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication. Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书