Jānis Rūmnieks, Ieva Baltā, Mihails Šišovs, Kaspars Tārs
{"title":"ssRNA bacteriophage metagenomes reveal a diverse set of novel protein families.","authors":"Jānis Rūmnieks, Ieva Baltā, Mihails Šišovs, Kaspars Tārs","doi":"10.1002/pro.70582","DOIUrl":null,"url":null,"abstract":"<p><p>The bacteriophages with single-stranded RNA (ssRNA) genomes (class Leviviricetes) are among the simplest known viruses that encode only three core proteins: a receptor-binding protein, a capsid protein, and an RNA-dependent RNA polymerase. The number of isolated ssRNA phages has remained very low, but the accumulating RNA metagenome data have uncovered a large variety of these viruses in many environments. Besides the core proteins, many of these genomes putatively encode additional proteins, which up to now have remained uncharacterized. We looked for non-conserved open reading frames (ORFs) in Leviviricetes sequences from the IMG/VR virus metagenome database and used sequence- and structure-based clustering to organize them into similarity groups. Potential ORFs were found throughout the ssRNA phage genomes but almost exclusively on the positive-sense RNA strand, suggestive of their protein-coding potential. The prevalence of the non-conserved ORFs varied in various phage lineages, and their distribution among different genome positions was markedly uneven. Most of the identified ORFs encode all-α proteins, a portion of which contain transmembrane segments that resemble a group of known ssRNA phage lysis proteins, while many others represent previously uncharacterized families of globular or semi-globular α-helical proteins. We additionally uncovered a major class of globular α/β proteins and experimentally determined the structure of a representative protein of this group. These results pave the way for further functional studies of novel ssRNA phage proteins for a better understanding of this diverse virus group.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"35 5","pages":"e70582"},"PeriodicalIF":5.2000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13114773/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70582","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The bacteriophages with single-stranded RNA (ssRNA) genomes (class Leviviricetes) are among the simplest known viruses that encode only three core proteins: a receptor-binding protein, a capsid protein, and an RNA-dependent RNA polymerase. The number of isolated ssRNA phages has remained very low, but the accumulating RNA metagenome data have uncovered a large variety of these viruses in many environments. Besides the core proteins, many of these genomes putatively encode additional proteins, which up to now have remained uncharacterized. We looked for non-conserved open reading frames (ORFs) in Leviviricetes sequences from the IMG/VR virus metagenome database and used sequence- and structure-based clustering to organize them into similarity groups. Potential ORFs were found throughout the ssRNA phage genomes but almost exclusively on the positive-sense RNA strand, suggestive of their protein-coding potential. The prevalence of the non-conserved ORFs varied in various phage lineages, and their distribution among different genome positions was markedly uneven. Most of the identified ORFs encode all-α proteins, a portion of which contain transmembrane segments that resemble a group of known ssRNA phage lysis proteins, while many others represent previously uncharacterized families of globular or semi-globular α-helical proteins. We additionally uncovered a major class of globular α/β proteins and experimentally determined the structure of a representative protein of this group. These results pave the way for further functional studies of novel ssRNA phage proteins for a better understanding of this diverse virus group.
期刊介绍:
Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution.
Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics.
The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication.
Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).