人类蛋白质中的同源重复密码子频率是否存在偏差?

IF 2 4区 生物学 Q2 BIOLOGY
Nikita V. Dovidchenko , Mikhail Yu. Lobanov , Oxana V. Galzitskaya
{"title":"人类蛋白质中的同源重复密码子频率是否存在偏差?","authors":"Nikita V. Dovidchenko ,&nbsp;Mikhail Yu. Lobanov ,&nbsp;Oxana V. Galzitskaya","doi":"10.1016/j.biosystems.2024.105357","DOIUrl":null,"url":null,"abstract":"<div><div>It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.</div></div>","PeriodicalId":50730,"journal":{"name":"Biosystems","volume":"246 ","pages":"Article 105357"},"PeriodicalIF":2.0000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?\",\"authors\":\"Nikita V. Dovidchenko ,&nbsp;Mikhail Yu. Lobanov ,&nbsp;Oxana V. Galzitskaya\",\"doi\":\"10.1016/j.biosystems.2024.105357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.</div></div>\",\"PeriodicalId\":50730,\"journal\":{\"name\":\"Biosystems\",\"volume\":\"246 \",\"pages\":\"Article 105357\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biosystems\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0303264724002429\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosystems","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0303264724002429","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

众所周知,基因组中存在密码子使用偏差,即某些密码子比其他密码子更常见。涉及人类蛋白质中同源重复区域的密码子也不例外。在这项工作中,我们分析了来自规范审查的人类蛋白质组 20447 个蛋白质序列中 3753 个人类蛋白质中大于 4 的同源重复序列中所有氨基酸残基的密码子使用偏倚。我们发现,人类蛋白质组中几乎所有的同源重复序列(其中大部分编码 Ala、Glu、Gly、Leu、Pro 和 Ser,占所有同源重复序列的 80%)都存在密码子使用偏差,即主要由一个密码子编码。此外,在同源重复序列中,富含 GC 的密码子的含量有很大的变化。在具有有序和无序状态的 PDB 中,Ala、Glu、Gly、Leu、Pro 和 Ser 的同源重复占主导地位。通过研究剪接位点的分布,我们发现约有 15%的同源重复序列包含剪接位点或位于剪接位点 10 个核苷酸以内,而在这些同源重复序列中,Glu 和 Leu 占主导地位。我们的数据对今后研究同源重复序列的功能、蛋白质与蛋白质之间的相互作用以及进化适应性具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?
It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biosystems
Biosystems 生物-生物学
CiteScore
3.70
自引率
18.80%
发文量
129
审稿时长
34 days
期刊介绍: BioSystems encourages experimental, computational, and theoretical articles that link biology, evolutionary thinking, and the information processing sciences. The link areas form a circle that encompasses the fundamental nature of biological information processing, computational modeling of complex biological systems, evolutionary models of computation, the application of biological principles to the design of novel computing systems, and the use of biomolecular materials to synthesize artificial systems that capture essential principles of natural biological information processing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信