解开斑马鱼基因注释:解决 TCOF1 和 NOLC1 同源物评估中的复杂性和命名问题

IF 2.1 3区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY
Guillermina Hill-Terán, Julieta Petrich, Maria Lorena Falcone Ferreyra, Manuel J. Aybar, Gabriela Coux
{"title":"解开斑马鱼基因注释:解决 TCOF1 和 NOLC1 同源物评估中的复杂性和命名问题","authors":"Guillermina Hill-Terán, Julieta Petrich, Maria Lorena Falcone Ferreyra, Manuel J. Aybar, Gabriela Coux","doi":"10.1007/s00239-024-10200-0","DOIUrl":null,"url":null,"abstract":"<p>Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the <i>TCOF1</i> gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the <i>TCOF1</i> orthologue. However, database updates renamed this gene as <i>nolc1</i> and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the <i>TCOF1</i> orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that <i>NOLC1</i> and <i>TCOF1</i> are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the <i>TCOF1</i> gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them “LisH-containing Low Complexity Proteins” (LLCP). Interestingly, the gene on chromosome 13 <i>(nolc1</i>) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of <i>TCOF1</i> functions. These findings suggest that in teleost fish, the nucleolar functions described for both <i>NOLC1</i> and <i>TCOF1</i> mediated by their repeated motifs, are carried out by a single gene, <i>nolc1</i>. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":"5 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Untangling Zebrafish Genetic Annotation: Addressing Complexities and Nomenclature Issues in Orthologous Evaluation of TCOF1 and NOLC1\",\"authors\":\"Guillermina Hill-Terán, Julieta Petrich, Maria Lorena Falcone Ferreyra, Manuel J. Aybar, Gabriela Coux\",\"doi\":\"10.1007/s00239-024-10200-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the <i>TCOF1</i> gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the <i>TCOF1</i> orthologue. However, database updates renamed this gene as <i>nolc1</i> and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the <i>TCOF1</i> orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that <i>NOLC1</i> and <i>TCOF1</i> are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the <i>TCOF1</i> gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them “LisH-containing Low Complexity Proteins” (LLCP). Interestingly, the gene on chromosome 13 <i>(nolc1</i>) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of <i>TCOF1</i> functions. These findings suggest that in teleost fish, the nucleolar functions described for both <i>NOLC1</i> and <i>TCOF1</i> mediated by their repeated motifs, are carried out by a single gene, <i>nolc1</i>. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.</p>\",\"PeriodicalId\":16366,\"journal\":{\"name\":\"Journal of Molecular Evolution\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Evolution\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s00239-024-10200-0\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00239-024-10200-0","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

特雷撤-科林斯综合征(TCS)是一种影响面部发育的遗传性疾病,主要由 TCOF1 基因突变引起。TCOF1 和 NOLC1 在核糖体 RNA 的转录和处理中发挥着重要作用。此前,通过敲除 13 号染色体上一个基因(编码为 Uniprot ID B8JIY2)的表达,一个 TCS 斑马鱼模型成功重现了该综合征的主要特征。然而,数据库更新后将该基因重新命名为 nolc1,斑马鱼数据库(ZFIN)则将 14 号染色体上的另一个基因确定为 TCOF1 同源物(编码为 Uniprot ID E7F9D9)。NOLC1 和 TCOF1 都是大型蛋白质,其非结构化区域和重复序列使比对和比较变得复杂。此外,长臂猿额外的全基因组复制也增加了难度。在本研究中,我们提出的证据支持 NOLC1 和 TCOF1 是旁系亲属,14 号染色体上的斑马鱼基因是一种低复杂度的含 LisH 结构域的因子,与 NOLC1 同源,但缺乏完成 TCOF1 核极功能的基本序列特征。我们的分析还支持这样一种观点,即斑马鱼与其他非四足类脊椎动物一样,缺乏与三方核仁相关的 TCOF1 基因。通过在一组远志动物基因组中进行 BLAST 搜索,我们发现了与 E7F9D9 斑马鱼蛋白相似的鱼类特异性序列。我们建议将它们命名为 "含 LisH 的低复杂性蛋白"(LLCP)。有趣的是,第 13 号染色体上的基因(nolc1)显示了 TCOF1 功能所特有的序列特征、发育表达模式和缺失的表型影响。这些研究结果表明,在长臂猿鱼类中,NOLC1 和 TCOF1 通过其重复基团介导的核小体功能是由一个基因 nolc1 实现的。我们的研究主要是基于免费网络算法的计算工具,可以帮助解决斑马鱼中类似的基因同源冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Untangling Zebrafish Genetic Annotation: Addressing Complexities and Nomenclature Issues in Orthologous Evaluation of TCOF1 and NOLC1

Untangling Zebrafish Genetic Annotation: Addressing Complexities and Nomenclature Issues in Orthologous Evaluation of TCOF1 and NOLC1

Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the TCOF1 gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the TCOF1 orthologue. However, database updates renamed this gene as nolc1 and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the TCOF1 orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that NOLC1 and TCOF1 are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the TCOF1 gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them “LisH-containing Low Complexity Proteins” (LLCP). Interestingly, the gene on chromosome 13 (nolc1) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of TCOF1 functions. These findings suggest that in teleost fish, the nucleolar functions described for both NOLC1 and TCOF1 mediated by their repeated motifs, are carried out by a single gene, nolc1. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Molecular Evolution
Journal of Molecular Evolution 生物-进化生物学
CiteScore
5.50
自引率
2.60%
发文量
36
审稿时长
3 months
期刊介绍: Journal of Molecular Evolution covers experimental, computational, and theoretical work aimed at deciphering features of molecular evolution and the processes bearing on these features, from the initial formation of macromolecular systems through their evolution at the molecular level, the co-evolution of their functions in cellular and organismal systems, and their influence on organismal adaptation, speciation, and ecology. Topics addressed include the evolution of informational macromolecules and their relation to more complex levels of biological organization, including populations and taxa, as well as the molecular basis for the evolution of ecological interactions of species and the use of molecular data to infer fundamental processes in evolutionary ecology. This coverage accommodates such subfields as new genome sequences, comparative structural and functional genomics, population genetics, the molecular evolution of development, the evolution of gene regulation and gene interaction networks, and in vitro evolution of DNA and RNA, molecular evolutionary ecology, and the development of methods and theory that enable molecular evolutionary inference, including but not limited to, phylogenetic methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信