蛋白质结构域插入结构的家族级专业化。

IF 5.2 3区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Protein Science Pub Date : 2026-05-01 DOI:10.1002/pro.70586
R Dustin Schaeffer, Rui Guo, Jing Zhang, Qian Cong, Nick V Grishin
{"title":"蛋白质结构域插入结构的家族级专业化。","authors":"R Dustin Schaeffer, Rui Guo, Jing Zhang, Qian Cong, Nick V Grishin","doi":"10.1002/pro.70586","DOIUrl":null,"url":null,"abstract":"<p><p>Domain insertion creates architectures where one domain interrupts another's sequence. Analysis across 2.7 million classified domains reveals that insertions occur in 20% of multidomain proteins, with 331 families exhibiting consistent architectural roles: 162 function exclusively as hosts, while 169 exclusively serve as inserted modules, such as zinc-binding dehydrogenases appearing as insertions across 450 events. The remaining 1116 families with sufficient insertion activity demonstrate versatile behavior, adopting different roles depending on partnership context. Size analysis shows inserted domains are consistently smaller than their hosts (median 115 vs. 199 residues), with role-consistent families exhibiting 1.7-fold size differences. Insertions frequently involve domains from different structural superfamilies: 31,925 events (65.8% of total) occur between families from different H-groups, such as P-loop hydrolases with tRNA modification domains. While most insertions are simple single-level architectures, insertion mechanisms can create complex organizations, including six-level nested structures in cyanobacterial RNA polymerase. This work provides a comprehensive dataset of 48,551 insertion events across 5701 families, with quantitative characterization of size relationships and partnership patterns that can inform structure prediction and protein design efforts.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"35 5","pages":"e70586"},"PeriodicalIF":5.2000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13114784/pdf/","citationCount":"0","resultStr":"{\"title\":\"Family-level specialization in protein domain insertion architectures.\",\"authors\":\"R Dustin Schaeffer, Rui Guo, Jing Zhang, Qian Cong, Nick V Grishin\",\"doi\":\"10.1002/pro.70586\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Domain insertion creates architectures where one domain interrupts another's sequence. Analysis across 2.7 million classified domains reveals that insertions occur in 20% of multidomain proteins, with 331 families exhibiting consistent architectural roles: 162 function exclusively as hosts, while 169 exclusively serve as inserted modules, such as zinc-binding dehydrogenases appearing as insertions across 450 events. The remaining 1116 families with sufficient insertion activity demonstrate versatile behavior, adopting different roles depending on partnership context. Size analysis shows inserted domains are consistently smaller than their hosts (median 115 vs. 199 residues), with role-consistent families exhibiting 1.7-fold size differences. Insertions frequently involve domains from different structural superfamilies: 31,925 events (65.8% of total) occur between families from different H-groups, such as P-loop hydrolases with tRNA modification domains. While most insertions are simple single-level architectures, insertion mechanisms can create complex organizations, including six-level nested structures in cyanobacterial RNA polymerase. This work provides a comprehensive dataset of 48,551 insertion events across 5701 families, with quantitative characterization of size relationships and partnership patterns that can inform structure prediction and protein design efforts.</p>\",\"PeriodicalId\":20761,\"journal\":{\"name\":\"Protein Science\",\"volume\":\"35 5\",\"pages\":\"e70586\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2026-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13114784/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Protein Science\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/pro.70586\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70586","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

域插入创建了一个域中断另一个域序列的体系结构。对270万个分类结构域的分析表明,插入发生在20%的多结构域蛋白中,其中331个家族表现出一致的结构角色:162个家族完全作为宿主,而169个家族完全作为插入模块,例如锌结合脱氢酶在450个事件中作为插入出现。其余1116个具有充分插入活动的家庭表现出多种行为,根据伙伴关系背景采取不同的角色。大小分析显示,插入的结构域始终小于其宿主(中位数为115对199个残基),角色一致的家族显示出1.7倍的大小差异。插入经常涉及来自不同结构超家族的结构域:31,925个事件(占总数的65.8%)发生在来自不同h基团的家族之间,例如具有tRNA修饰结构域的p环水解酶。虽然大多数插入都是简单的单级结构,但插入机制可以创建复杂的组织,包括蓝藻RNA聚合酶中的六层嵌套结构。这项工作提供了5701个家族中48551个插入事件的综合数据集,并对大小关系和伙伴关系模式进行了定量表征,可以为结构预测和蛋白质设计工作提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Family-level specialization in protein domain insertion architectures.

Domain insertion creates architectures where one domain interrupts another's sequence. Analysis across 2.7 million classified domains reveals that insertions occur in 20% of multidomain proteins, with 331 families exhibiting consistent architectural roles: 162 function exclusively as hosts, while 169 exclusively serve as inserted modules, such as zinc-binding dehydrogenases appearing as insertions across 450 events. The remaining 1116 families with sufficient insertion activity demonstrate versatile behavior, adopting different roles depending on partnership context. Size analysis shows inserted domains are consistently smaller than their hosts (median 115 vs. 199 residues), with role-consistent families exhibiting 1.7-fold size differences. Insertions frequently involve domains from different structural superfamilies: 31,925 events (65.8% of total) occur between families from different H-groups, such as P-loop hydrolases with tRNA modification domains. While most insertions are simple single-level architectures, insertion mechanisms can create complex organizations, including six-level nested structures in cyanobacterial RNA polymerase. This work provides a comprehensive dataset of 48,551 insertion events across 5701 families, with quantitative characterization of size relationships and partnership patterns that can inform structure prediction and protein design efforts.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Protein Science
Protein Science 生物-生化与分子生物学
CiteScore
12.40
自引率
1.20%
发文量
246
审稿时长
1 months
期刊介绍: Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution. Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics. The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication. Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书