Maahil Arshad, Matthew Uchmanowicz, Vanshika Rana, Brett Trost, Stephen W Scherer, Muhammad Arshad Rafiq
{"title":"Mapping the inter- and intra-genic codon-usage landscape in <i>Homo sapiens</i>.","authors":"Maahil Arshad, Matthew Uchmanowicz, Vanshika Rana, Brett Trost, Stephen W Scherer, Muhammad Arshad Rafiq","doi":"10.1093/nargab/lqag024","DOIUrl":null,"url":null,"abstract":"<p><p>Although the genetic code is degenerate, codon selection is nonrandom and reflects significant functional constraints. Codon-usage bias (CUB) acts as a layer of post-transcriptional regulation, influencing messenger RNA (mRNA) stability, translation kinetics, and co-translational protein folding. While CUB is well-characterized in unicellular organisms, its regulatory scope and functional consequences in humans remain complex and less defined. Our study offers a comprehensive evaluation of human codon usage. We report that genes exhibiting the strongest codon bias are enriched in high-stoichiometry biological processes, such as skin development and oxygen/carbon dioxide transport, and harbor significantly fewer synonymous variants than expected (ρ = -0.24, <i>P </i>< 2.2 × 10<sup>-16</sup>). Furthermore, we find that codon optimization is structurally distinct: it is significantly more pronounced in structured protein domains compared to intrinsically disordered regions (IDRs) (Cliff's Δ= 0.26, <i>P </i>< 2.2 × 10<sup>-16</sup>). Consistent with translational selection, the most frequently used codons are supported by higher transfer RNA (tRNA) gene copy numbers (ρ = 0.49, <i>P</i> < 6.4 × 10<sup>-4</sup>). Finally, by correcting for GC3 content, we reveal that the apparent correlation between effective number of codon and adaptation indices (CAI/tAI) vanishes, allowing us to disentangle mutational pressure from translational selection. Collectively, our findings position CUB as a central, evolutionarily conserved regulator of translation and protein folding in humans. Our results provide a comprehensive and integrated view of intergenic and intragenic CUB in humans, reinforcing the biological relevance of synonymous codon choice in shaping translational dynamics and protein biogenesis. This provides a refined framework for interpreting synonymous variation and guiding functional genomics.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"8 1","pages":"lqag024"},"PeriodicalIF":2.8000,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12954173/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAR Genomics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/nargab/lqag024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Although the genetic code is degenerate, codon selection is nonrandom and reflects significant functional constraints. Codon-usage bias (CUB) acts as a layer of post-transcriptional regulation, influencing messenger RNA (mRNA) stability, translation kinetics, and co-translational protein folding. While CUB is well-characterized in unicellular organisms, its regulatory scope and functional consequences in humans remain complex and less defined. Our study offers a comprehensive evaluation of human codon usage. We report that genes exhibiting the strongest codon bias are enriched in high-stoichiometry biological processes, such as skin development and oxygen/carbon dioxide transport, and harbor significantly fewer synonymous variants than expected (ρ = -0.24, P < 2.2 × 10-16). Furthermore, we find that codon optimization is structurally distinct: it is significantly more pronounced in structured protein domains compared to intrinsically disordered regions (IDRs) (Cliff's Δ= 0.26, P < 2.2 × 10-16). Consistent with translational selection, the most frequently used codons are supported by higher transfer RNA (tRNA) gene copy numbers (ρ = 0.49, P < 6.4 × 10-4). Finally, by correcting for GC3 content, we reveal that the apparent correlation between effective number of codon and adaptation indices (CAI/tAI) vanishes, allowing us to disentangle mutational pressure from translational selection. Collectively, our findings position CUB as a central, evolutionarily conserved regulator of translation and protein folding in humans. Our results provide a comprehensive and integrated view of intergenic and intragenic CUB in humans, reinforcing the biological relevance of synonymous codon choice in shaping translational dynamics and protein biogenesis. This provides a refined framework for interpreting synonymous variation and guiding functional genomics.
虽然遗传密码是退化的,但密码子选择是非随机的,反映了显著的功能约束。密码子使用偏差(CUB)作为转录后调控层,影响信使RNA (mRNA)的稳定性、翻译动力学和共翻译蛋白折叠。虽然CUB在单细胞生物中有很好的特征,但其在人类中的调节范围和功能后果仍然复杂且不太明确。我们的研究提供了一个全面的评估人类密码子的使用。我们报告说,表现出最强密码子偏好的基因在高化学测量生物学过程中丰富,如皮肤发育和氧气/二氧化碳运输,并且具有比预期更少的同义变体(ρ = -0.24, P -16)。此外,我们发现密码子优化在结构上是不同的:与内在无序区域(idr)相比,它在结构蛋白区域中更为明显(Cliff's Δ= 0.26, P -16)。与翻译选择一致,最常用的密码子受到更高的转移RNA (tRNA)基因拷贝数的支持(ρ = 0.49, P -4)。最后,通过校正GC3含量,我们发现有效密码子数与适应指数(CAI/tAI)之间的明显相关性消失,使我们能够解开突变压力与翻译选择的关系。总的来说,我们的研究结果表明,CUB是人类翻译和蛋白质折叠的核心、进化保守的调节因子。我们的研究结果为人类基因间和基因内的CUB提供了一个全面和综合的观点,加强了同义密码子选择在塑造翻译动力学和蛋白质生物发生中的生物学相关性。这为解释同义变异和指导功能基因组学提供了一个完善的框架。