产生帧移的蛋白质串联重复序列可以产生新的结构状态和功能。

IF 4.2
Zarifa Osmanli, Gudrun Aldrian, Jeremy Leclercq, Theo Falgarone, Santiago M Gómez Bergna, Denis N Prada Gori, Andrew V Oleinikov, Ilham Shahmuradov, Andrey V Kajava
{"title":"产生帧移的蛋白质串联重复序列可以产生新的结构状态和功能。","authors":"Zarifa Osmanli, Gudrun Aldrian, Jeremy Leclercq, Theo Falgarone, Santiago M Gómez Bergna, Denis N Prada Gori, Andrew V Oleinikov, Ilham Shahmuradov, Andrey V Kajava","doi":"10.1111/febs.70273","DOIUrl":null,"url":null,"abstract":"<p><p>The genetic code uses three-nucleotide units to encode each amino acid in proteins. Insertions or deletions of nucleotides not divisible by three shift the reading frames, resulting in significantly different protein sequences. These events are disruptive but can also create variability important for evolution. Previous studies suggested that the genetic code and gene sequences evolve to minimize frameshift effects, maintaining similar physicochemical properties to their reference proteins. Here, we focused on tandem repeat sequences, known as frameshift hotspots. Using cutting-edge bioinformatics tools, we compared reference and frameshifted protein sequences within tandem repeats across 50 prokaryotic and eukaryotic proteomes. We showed that, in contrast to the general tendency, frameshifts within these regions, especially with short repeats, lead to a significant increase in hydrophobicity and arginine content. Additionally, the frameshifts, particularly in short tandem repeats, rearrange transmembrane regions, potentially converting soluble proteins into membrane proteins and vice versa. Given their occurrence in rapidly evolving, essential proteins, such changes may promote rapid adaptability. Our large-scale alphafold modeling suggested that frameshift events can generate novel structures and functions, enabling the synthesis of multiple protein variants within the same coding region. Overall, frameshifts cause more drastic changes in tandem repeat sequences compared to non-repetitive sequences and therefore can be a primary cause of altered functions, cellular localization, and the development of various pathologies.</p>","PeriodicalId":94226,"journal":{"name":"The FEBS journal","volume":" ","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Protein tandem repeats that produce frameshifts can generate new structural states and functions.\",\"authors\":\"Zarifa Osmanli, Gudrun Aldrian, Jeremy Leclercq, Theo Falgarone, Santiago M Gómez Bergna, Denis N Prada Gori, Andrew V Oleinikov, Ilham Shahmuradov, Andrey V Kajava\",\"doi\":\"10.1111/febs.70273\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The genetic code uses three-nucleotide units to encode each amino acid in proteins. Insertions or deletions of nucleotides not divisible by three shift the reading frames, resulting in significantly different protein sequences. These events are disruptive but can also create variability important for evolution. Previous studies suggested that the genetic code and gene sequences evolve to minimize frameshift effects, maintaining similar physicochemical properties to their reference proteins. Here, we focused on tandem repeat sequences, known as frameshift hotspots. Using cutting-edge bioinformatics tools, we compared reference and frameshifted protein sequences within tandem repeats across 50 prokaryotic and eukaryotic proteomes. We showed that, in contrast to the general tendency, frameshifts within these regions, especially with short repeats, lead to a significant increase in hydrophobicity and arginine content. Additionally, the frameshifts, particularly in short tandem repeats, rearrange transmembrane regions, potentially converting soluble proteins into membrane proteins and vice versa. Given their occurrence in rapidly evolving, essential proteins, such changes may promote rapid adaptability. Our large-scale alphafold modeling suggested that frameshift events can generate novel structures and functions, enabling the synthesis of multiple protein variants within the same coding region. Overall, frameshifts cause more drastic changes in tandem repeat sequences compared to non-repetitive sequences and therefore can be a primary cause of altered functions, cellular localization, and the development of various pathologies.</p>\",\"PeriodicalId\":94226,\"journal\":{\"name\":\"The FEBS journal\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The FEBS journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1111/febs.70273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The FEBS journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/febs.70273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

遗传密码使用三个核苷酸单位来编码蛋白质中的每个氨基酸。不能被三分的核苷酸的插入或缺失使阅读框移位,导致蛋白质序列显著不同。这些事件是破坏性的,但也会产生对进化很重要的可变性。先前的研究表明,遗传密码和基因序列的进化使移码效应最小化,保持与其参考蛋白相似的物理化学性质。在这里,我们专注于串联重复序列,称为移码热点。利用尖端的生物信息学工具,我们比较了50个原核和真核蛋白质组串联重复序列中的参考和移框蛋白序列。我们发现,与一般趋势相反,这些区域内的帧移,特别是短重复,导致疏水性和精氨酸含量显著增加。此外,帧移,特别是在短串联重复序列中,重新排列跨膜区域,可能将可溶性蛋白转化为膜蛋白,反之亦然。鉴于它们发生在快速进化的必需蛋白质中,这种变化可能促进快速适应性。我们的大规模阿尔法折叠模型表明,移码事件可以产生新的结构和功能,从而在同一编码区域内合成多种蛋白质变体。总的来说,与非重复序列相比,帧移位导致串联重复序列发生更剧烈的变化,因此可能是功能改变、细胞定位和各种病理发展的主要原因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Protein tandem repeats that produce frameshifts can generate new structural states and functions.

The genetic code uses three-nucleotide units to encode each amino acid in proteins. Insertions or deletions of nucleotides not divisible by three shift the reading frames, resulting in significantly different protein sequences. These events are disruptive but can also create variability important for evolution. Previous studies suggested that the genetic code and gene sequences evolve to minimize frameshift effects, maintaining similar physicochemical properties to their reference proteins. Here, we focused on tandem repeat sequences, known as frameshift hotspots. Using cutting-edge bioinformatics tools, we compared reference and frameshifted protein sequences within tandem repeats across 50 prokaryotic and eukaryotic proteomes. We showed that, in contrast to the general tendency, frameshifts within these regions, especially with short repeats, lead to a significant increase in hydrophobicity and arginine content. Additionally, the frameshifts, particularly in short tandem repeats, rearrange transmembrane regions, potentially converting soluble proteins into membrane proteins and vice versa. Given their occurrence in rapidly evolving, essential proteins, such changes may promote rapid adaptability. Our large-scale alphafold modeling suggested that frameshift events can generate novel structures and functions, enabling the synthesis of multiple protein variants within the same coding region. Overall, frameshifts cause more drastic changes in tandem repeat sequences compared to non-repetitive sequences and therefore can be a primary cause of altered functions, cellular localization, and the development of various pathologies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信