Machine learning approach identifies prominent codons from different degenerate groups influencing gene expression in bacteria.

IF 1.3
Piyali Sen, Annushree Kurmi, Suvendra Kumar Ray, Siddhartha Sankar Satapathy
{"title":"Machine learning approach identifies prominent codons from different degenerate groups influencing gene expression in bacteria.","authors":"Piyali Sen,&nbsp;Annushree Kurmi,&nbsp;Suvendra Kumar Ray,&nbsp;Siddhartha Sankar Satapathy","doi":"10.1111/gtc.12977","DOIUrl":null,"url":null,"abstract":"<p><p>Unequal usage of synonymous codons is known as codon usage bias (CUB), which is generally different between the high-expression genes (HEG) and low-expression genes (LEG) in organisms is not yet adequately reported across different bacteria. In this study, a machine learning-based approach was implemented initially to find out codons that are significantly different between the HEG and LEG in Escherichia coli. It identified Cys codons such as UGU and UGC, Lys codons such as AAA and AAG that were least influenced by gene expression. Codons such as UCU (Ser), CUG (Leu), GGG (Gly), CGG (Arg) etc. were identified to be influenced maximum by the gene expression. The study was extended to analyze codon usage in 683 other bacterial species. Cys (UGU/UGC) and Ser (AGU/AGC) codons were identified being the least different between the two groups of genes across these bacterial species. Codons such as CGA, CUG, GGG, GCC, ACC, AUA, and AUC were identified to be influenced by the gene expression across majority of these species. This study supports the role of CUB on gene expression across bacteria and demonstrates a commonality among bacteria regarding behavior of certain codons with regard to gene expression.</p>","PeriodicalId":520630,"journal":{"name":"Genes to cells : devoted to molecular & cellular mechanisms","volume":" ","pages":"591-601"},"PeriodicalIF":1.3000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genes to cells : devoted to molecular & cellular mechanisms","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/gtc.12977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/8/22 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Unequal usage of synonymous codons is known as codon usage bias (CUB), which is generally different between the high-expression genes (HEG) and low-expression genes (LEG) in organisms is not yet adequately reported across different bacteria. In this study, a machine learning-based approach was implemented initially to find out codons that are significantly different between the HEG and LEG in Escherichia coli. It identified Cys codons such as UGU and UGC, Lys codons such as AAA and AAG that were least influenced by gene expression. Codons such as UCU (Ser), CUG (Leu), GGG (Gly), CGG (Arg) etc. were identified to be influenced maximum by the gene expression. The study was extended to analyze codon usage in 683 other bacterial species. Cys (UGU/UGC) and Ser (AGU/AGC) codons were identified being the least different between the two groups of genes across these bacterial species. Codons such as CGA, CUG, GGG, GCC, ACC, AUA, and AUC were identified to be influenced by the gene expression across majority of these species. This study supports the role of CUB on gene expression across bacteria and demonstrates a commonality among bacteria regarding behavior of certain codons with regard to gene expression.

机器学习方法从影响细菌基因表达的不同退化群中识别出突出的密码子。
同义密码子的不平等使用被称为密码子使用偏差(CUB),这是生物体内高表达基因(HEG)和低表达基因(LEG)之间普遍存在的差异,但尚未在不同细菌中得到充分的报道。在本研究中,我们初步采用了一种基于机器学习的方法来发现大肠杆菌HEG和LEG之间存在显著差异的密码子。发现受基因表达影响最小的Cys密码子为UGU、UGC, Lys密码子为AAA、AAG。发现受基因表达影响最大的密码子有UCU (Ser)、CUG (Leu)、GGG (Gly)、CGG (Arg)等。该研究扩展到分析683种其他细菌的密码子使用。Cys (UGU/UGC)和Ser (AGU/AGC)密码子在两组基因间差异最小。在这些物种中,CGA、CUG、GGG、GCC、ACC、AUA和AUC等密码子受基因表达的影响。本研究支持了CUB在细菌间基因表达中的作用,并证明了细菌间某些密码子在基因表达方面的行为具有共性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信