Vincent G Osnaya, Laura Gómez-Romero, Gabriel Moreno-Hagelsieb, Greco Hernández
{"title":"AUGcontext DB:真核生物中mRNA AUG启动子密码子上下文的综合目录。","authors":"Vincent G Osnaya, Laura Gómez-Romero, Gabriel Moreno-Hagelsieb, Greco Hernández","doi":"10.1080/15476286.2025.2465196","DOIUrl":null,"url":null,"abstract":"<p><p>The mRNA translation defines the composition of the cell proteome in all forms of life and diseases. In this process, precise selection of the mRNA translation initiation site (TIS) is crucial, as it establishes the correct open reading frame for triplet decoding. We have gathered and curated all published TIS consensus context sequences. We also included the TIS consensus context from novel 538 fungal genomes available from NCBI's RefSeq database. To do so, we wrote ad hoc programs in PERL to find and extract the TIS for each annotated gene, plus ten bases upstream and three downstream. For each genome, the sequences around the TIS of each gene were obtained, and the consensus was further calculated according to the Cavener rules and by the LOGOS algorithm. We created AUGcontext DB, a portal with a comprehensive collection of TIS context sequences across eukaryotes in a range from -10 to + 6. The compilation covers species of 30 vertebrates, 17 invertebrates, 25 plants, 14 fungi, and 11 protists studied in silico; 23 experimental studies; data on biotechnology; and the discovery of 8 diseases associated with specific mutations. Additionally, TIS context sequences of cellular IRESs were included. AUGcontext DB belongs to the National Institute of Cancer (Instituto Nacional de Cancerología, INCan), Mexico, and is freely available at http://108.161.138.77:8096/. Our catalogue allows us to do comparative studies between species, may help improve the diagnosis of certain diseases, and will be key to maximize the production of recombinant proteins.</p>","PeriodicalId":21351,"journal":{"name":"RNA Biology","volume":" ","pages":"1-5"},"PeriodicalIF":3.6000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11834415/pdf/","citationCount":"0","resultStr":"{\"title\":\"AUGcontext DB: a comprehensive catalog of the mRNA AUG initiator codon context across eukaryotes.\",\"authors\":\"Vincent G Osnaya, Laura Gómez-Romero, Gabriel Moreno-Hagelsieb, Greco Hernández\",\"doi\":\"10.1080/15476286.2025.2465196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The mRNA translation defines the composition of the cell proteome in all forms of life and diseases. In this process, precise selection of the mRNA translation initiation site (TIS) is crucial, as it establishes the correct open reading frame for triplet decoding. We have gathered and curated all published TIS consensus context sequences. We also included the TIS consensus context from novel 538 fungal genomes available from NCBI's RefSeq database. To do so, we wrote ad hoc programs in PERL to find and extract the TIS for each annotated gene, plus ten bases upstream and three downstream. For each genome, the sequences around the TIS of each gene were obtained, and the consensus was further calculated according to the Cavener rules and by the LOGOS algorithm. We created AUGcontext DB, a portal with a comprehensive collection of TIS context sequences across eukaryotes in a range from -10 to + 6. The compilation covers species of 30 vertebrates, 17 invertebrates, 25 plants, 14 fungi, and 11 protists studied in silico; 23 experimental studies; data on biotechnology; and the discovery of 8 diseases associated with specific mutations. Additionally, TIS context sequences of cellular IRESs were included. AUGcontext DB belongs to the National Institute of Cancer (Instituto Nacional de Cancerología, INCan), Mexico, and is freely available at http://108.161.138.77:8096/. Our catalogue allows us to do comparative studies between species, may help improve the diagnosis of certain diseases, and will be key to maximize the production of recombinant proteins.</p>\",\"PeriodicalId\":21351,\"journal\":{\"name\":\"RNA Biology\",\"volume\":\" \",\"pages\":\"1-5\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11834415/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RNA Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1080/15476286.2025.2465196\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RNA Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/15476286.2025.2465196","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
导读:mRNA翻译定义了所有生命形式和疾病中细胞蛋白质组的组成。在这个过程中,精确选择mRNA翻译起始位点(TIS)是至关重要的,因为它为三联体解码建立了正确的开放阅读框。方法:我们收集并整理了所有已发表的TIS共识上下文序列。我们还从NCBI的RefSeq数据库中获取了新的538个真菌基因组,并纳入了TIS共识上下文。为此,我们用PERL编写了专门的程序来查找和提取每个注释基因的TIS,以及上游的10个碱基和下游的3个碱基。对于每个基因组,获得每个基因TIS周围的序列,并根据Cavener规则和LOGOS算法进一步计算一致性。结果:我们创建了AUGcontext DB,这是一个全面收集真核生物TIS上下文序列的门户,范围从-10到+ 6。该汇编包括30种脊椎动物、17种无脊椎动物、25种植物、14种真菌和11种原生生物。实验研究23项;生物技术数据;发现了8种与特定突变相关的疾病。此外,还包括细胞IRESs的TIS上下文序列。AUGcontext DB属于墨西哥国家癌症研究所(Instituto Nacional de Cancerología, INCan),并可在http://108.161.138.77:8096/.Discussion免费获得:我们的目录允许进行物种之间的比较研究,可能有助于提高某些疾病的诊断,并将是最大限度地生产重组蛋白的关键。
AUGcontext DB: a comprehensive catalog of the mRNA AUG initiator codon context across eukaryotes.
The mRNA translation defines the composition of the cell proteome in all forms of life and diseases. In this process, precise selection of the mRNA translation initiation site (TIS) is crucial, as it establishes the correct open reading frame for triplet decoding. We have gathered and curated all published TIS consensus context sequences. We also included the TIS consensus context from novel 538 fungal genomes available from NCBI's RefSeq database. To do so, we wrote ad hoc programs in PERL to find and extract the TIS for each annotated gene, plus ten bases upstream and three downstream. For each genome, the sequences around the TIS of each gene were obtained, and the consensus was further calculated according to the Cavener rules and by the LOGOS algorithm. We created AUGcontext DB, a portal with a comprehensive collection of TIS context sequences across eukaryotes in a range from -10 to + 6. The compilation covers species of 30 vertebrates, 17 invertebrates, 25 plants, 14 fungi, and 11 protists studied in silico; 23 experimental studies; data on biotechnology; and the discovery of 8 diseases associated with specific mutations. Additionally, TIS context sequences of cellular IRESs were included. AUGcontext DB belongs to the National Institute of Cancer (Instituto Nacional de Cancerología, INCan), Mexico, and is freely available at http://108.161.138.77:8096/. Our catalogue allows us to do comparative studies between species, may help improve the diagnosis of certain diseases, and will be key to maximize the production of recombinant proteins.
期刊介绍:
RNA has played a central role in all cellular processes since the beginning of life: decoding the genome, regulating gene expression, mediating molecular interactions, catalyzing chemical reactions. RNA Biology, as a leading journal in the field, provides a platform for presenting and discussing cutting-edge RNA research.
RNA Biology brings together a multidisciplinary community of scientists working in the areas of:
Transcription and splicing
Post-transcriptional regulation of gene expression
Non-coding RNAs
RNA localization
Translation and catalysis by RNA
Structural biology
Bioinformatics
RNA in disease and therapy