Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Nikita V. Dovidchenko , Mikhail Yu. Lobanov , Oxana V. Galzitskaya
{"title":"Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?","authors":"Nikita V. Dovidchenko ,&nbsp;Mikhail Yu. Lobanov ,&nbsp;Oxana V. Galzitskaya","doi":"10.1016/j.biosystems.2024.105357","DOIUrl":null,"url":null,"abstract":"<div><div>It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.</div></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0303264724002429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.
人类蛋白质中的同源重复密码子频率是否存在偏差?
众所周知,基因组中存在密码子使用偏差,即某些密码子比其他密码子更常见。涉及人类蛋白质中同源重复区域的密码子也不例外。在这项工作中,我们分析了来自规范审查的人类蛋白质组 20447 个蛋白质序列中 3753 个人类蛋白质中大于 4 的同源重复序列中所有氨基酸残基的密码子使用偏倚。我们发现,人类蛋白质组中几乎所有的同源重复序列(其中大部分编码 Ala、Glu、Gly、Leu、Pro 和 Ser,占所有同源重复序列的 80%)都存在密码子使用偏差,即主要由一个密码子编码。此外,在同源重复序列中,富含 GC 的密码子的含量有很大的变化。在具有有序和无序状态的 PDB 中,Ala、Glu、Gly、Leu、Pro 和 Ser 的同源重复占主导地位。通过研究剪接位点的分布,我们发现约有 15%的同源重复序列包含剪接位点或位于剪接位点 10 个核苷酸以内,而在这些同源重复序列中,Glu 和 Leu 占主导地位。我们的数据对今后研究同源重复序列的功能、蛋白质与蛋白质之间的相互作用以及进化适应性具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信