The protein structurome of Orthornavirae and its dark matter.

IF 5.1 1区 生物学 Q1 MICROBIOLOGY
mBio Pub Date : 2025-02-05 Epub Date: 2024-12-23 DOI:10.1128/mbio.03200-24
Pascal Mutz, Antonio Pedro Camargo, Harutyun Sahakyan, Uri Neri, Anamarija Butkovic, Yuri I Wolf, Mart Krupovic, Valerian V Dolja, Eugene V Koonin
{"title":"The protein structurome of Orthornavirae and its dark matter.","authors":"Pascal Mutz, Antonio Pedro Camargo, Harutyun Sahakyan, Uri Neri, Anamarija Butkovic, Yuri I Wolf, Mart Krupovic, Valerian V Dolja, Eugene V Koonin","doi":"10.1128/mbio.03200-24","DOIUrl":null,"url":null,"abstract":"<p><p>Metatranscriptomics is uncovering more and more diverse families of viruses with RNA genomes comprising the viral kingdom Orthornavirae in the realm Riboviria. Thorough protein annotation and comparison are essential to get insights into the functions of viral proteins and virus evolution. In addition to sequence- and hmm profile‑based methods, protein structure comparison adds a powerful tool to uncover protein functions and relationships. We constructed an Orthornavirae \"structurome\" consisting of already annotated as well as unannotated (\"dark matter\") proteins and domains encoded in viral genomes. We used protein structure modeling and similarity searches to illuminate the remaining dark matter in hundreds of thousands of orthornavirus genomes. The vast majority of the dark matter domains showed either \"generic\" folds, such as single α-helices, or no high confidence structure predictions. Nevertheless, a variety of lineage-specific globular domains that were new either to orthornaviruses in general or to particular virus families were identified within the proteomic dark matter of orthornaviruses, including several predicted nucleic acid-binding domains and nucleases. In addition, we identified a case of exaptation of a cellular nucleoside monophosphate kinase as an RNA-binding protein in several virus families. Notwithstanding the continuing discovery of numerous orthornaviruses, it appears that all the protein domains conserved in large groups of viruses have already been identified. The rest of the viral proteome seems to be dominated by poorly structured domains including intrinsically disordered ones that likely mediate specific virus-host interactions.</p><p><strong>Importance: </strong>Advanced methods for protein structure prediction, such as AlphaFold2, greatly expand our capability to identify protein domains and infer their likely functions and evolutionary relationships. This is particularly pertinent for proteins encoded by viruses that are known to evolve rapidly and as a result often cannot be adequately characterized by analysis of the protein sequences. We performed an exhaustive structure prediction and comparative analysis for uncharacterized proteins and domains (\"dark matter\") encoded by viruses with RNA genomes. The results show the dark matter of RNA virus proteome consists mostly of disordered and all-α-helical domains that cannot be readily assigned a specific function and that likely mediate various interactions between viral proteins and between viral and host proteins. The great majority of globular proteins and domains of RNA viruses are already known although we identified several unexpected domains represented in individual viral families.</p>","PeriodicalId":18315,"journal":{"name":"mBio","volume":" ","pages":"e0320024"},"PeriodicalIF":5.1000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11796362/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mBio","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/mbio.03200-24","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Metatranscriptomics is uncovering more and more diverse families of viruses with RNA genomes comprising the viral kingdom Orthornavirae in the realm Riboviria. Thorough protein annotation and comparison are essential to get insights into the functions of viral proteins and virus evolution. In addition to sequence- and hmm profile‑based methods, protein structure comparison adds a powerful tool to uncover protein functions and relationships. We constructed an Orthornavirae "structurome" consisting of already annotated as well as unannotated ("dark matter") proteins and domains encoded in viral genomes. We used protein structure modeling and similarity searches to illuminate the remaining dark matter in hundreds of thousands of orthornavirus genomes. The vast majority of the dark matter domains showed either "generic" folds, such as single α-helices, or no high confidence structure predictions. Nevertheless, a variety of lineage-specific globular domains that were new either to orthornaviruses in general or to particular virus families were identified within the proteomic dark matter of orthornaviruses, including several predicted nucleic acid-binding domains and nucleases. In addition, we identified a case of exaptation of a cellular nucleoside monophosphate kinase as an RNA-binding protein in several virus families. Notwithstanding the continuing discovery of numerous orthornaviruses, it appears that all the protein domains conserved in large groups of viruses have already been identified. The rest of the viral proteome seems to be dominated by poorly structured domains including intrinsically disordered ones that likely mediate specific virus-host interactions.

Importance: Advanced methods for protein structure prediction, such as AlphaFold2, greatly expand our capability to identify protein domains and infer their likely functions and evolutionary relationships. This is particularly pertinent for proteins encoded by viruses that are known to evolve rapidly and as a result often cannot be adequately characterized by analysis of the protein sequences. We performed an exhaustive structure prediction and comparative analysis for uncharacterized proteins and domains ("dark matter") encoded by viruses with RNA genomes. The results show the dark matter of RNA virus proteome consists mostly of disordered and all-α-helical domains that cannot be readily assigned a specific function and that likely mediate various interactions between viral proteins and between viral and host proteins. The great majority of globular proteins and domains of RNA viruses are already known although we identified several unexpected domains represented in individual viral families.

Orthornavirae的蛋白质结构及其暗物质。
亚转录组学正在发现越来越多的不同的病毒家族,其RNA基因组包括病毒王国Orthornavirae在Riboviria领域。彻底的蛋白质注释和比较是深入了解病毒蛋白质功能和病毒进化的必要条件。除了基于序列和hmm谱的方法,蛋白质结构比较增加了一个强大的工具来揭示蛋白质的功能和关系。我们构建了一个Orthornavirae“结构体”,由已经注释的和未注释的(“暗物质”)蛋白质和病毒基因组编码的结构域组成。我们使用蛋白质结构建模和相似性搜索来阐明数十万个orththornavvirus基因组中剩余的暗物质。绝大多数暗物质区域要么显示出“一般的”褶皱,比如单个α-螺旋,要么没有高可信度的结构预测。然而,在直胸腺病毒的蛋白质组暗物质中发现了各种谱系特异性的球状结构域,这些结构域对于一般的直胸腺病毒或特定的病毒科来说都是新的,包括一些预测的核酸结合结构域和核酸酶。此外,我们在几个病毒家族中发现了细胞核苷单磷酸激酶作为rna结合蛋白的缺失。尽管不断发现大量的直鼻病毒,但似乎大部分病毒中保守的所有蛋白质结构域都已被确定。病毒蛋白质组的其余部分似乎由结构不良的结构域主导,包括可能介导特异性病毒-宿主相互作用的内在无序结构域。重要性:先进的蛋白质结构预测方法,如AlphaFold2,极大地扩展了我们识别蛋白质结构域并推断其可能的功能和进化关系的能力。这尤其与已知进化迅速的病毒编码的蛋白质有关,因此通常不能通过分析蛋白质序列来充分表征。我们对带有RNA基因组的病毒编码的未表征蛋白质和结构域(“暗物质”)进行了详尽的结构预测和比较分析。结果表明,RNA病毒蛋白质组的暗物质主要由无序和全α-螺旋结构域组成,这些结构域不容易被指定特定的功能,可能介导病毒蛋白之间以及病毒与宿主蛋白之间的各种相互作用。绝大多数的球形蛋白和RNA病毒的结构域是已知的,尽管我们在单个病毒家族中发现了一些意想不到的结构域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
mBio
mBio MICROBIOLOGY-
CiteScore
10.50
自引率
3.10%
发文量
762
审稿时长
1 months
期刊介绍: mBio® is ASM''s first broad-scope, online-only, open access journal. mBio offers streamlined review and publication of the best research in microbiology and allied fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信