Identification and preliminary characterization of conserved uncharacterized proteins from Chlamydomonas reinhardtii, Arabidopsis thaliana, and Setaria viridis.

IF 2.3 3区 生物学 Q2 PLANT SCIENCES
Plant Direct Pub Date : 2023-12-01 DOI:10.1002/pld3.527
Eric P Knoshaug, Peipei Sun, Ambarish Nag, Huong Nguyen, Erin M Mattoon, Ningning Zhang, Jian Liu, Chen Chen, Jianlin Cheng, Ru Zhang, Peter St John, James Umen
{"title":"Identification and preliminary characterization of conserved uncharacterized proteins from <i>Chlamydomonas reinhardtii</i>, <i>Arabidopsis thaliana</i>, and <i>Setaria viridis</i>.","authors":"Eric P Knoshaug, Peipei Sun, Ambarish Nag, Huong Nguyen, Erin M Mattoon, Ningning Zhang, Jian Liu, Chen Chen, Jianlin Cheng, Ru Zhang, Peter St John, James Umen","doi":"10.1002/pld3.527","DOIUrl":null,"url":null,"abstract":"<p><p>The rapid accumulation of sequenced plant genomes in the past decade has outpaced the still difficult problem of genome-wide protein-coding gene annotation. A substantial fraction of protein-coding genes in all plant genomes are poorly annotated or unannotated and remain functionally uncharacterized. We identified unannotated proteins in three model organisms representing distinct branches of the green lineage (Viridiplantae): <i>Arabidopsis thaliana</i> (eudicot), <i>Setaria viridis</i> (monocot), and <i>Chlamydomonas reinhardtii</i> (Chlorophyte alga). Using similarity searching, we identified a subset of unannotated proteins that were conserved between these species and defined them as Deep Green proteins. Bioinformatic, genomic, and structural predictions were performed to begin classifying Deep Green genes and proteins. Compared to whole proteomes for each species, the Deep Green set was enriched for proteins with predicted chloroplast targeting signals predictive of photosynthetic or plastid functions, a result that was consistent with enrichment for daylight phase diurnal expression patterning. Structural predictions using AlphaFold and comparisons to known structures showed that a significant proportion of Deep Green proteins may possess novel folds. Though only available for three organisms, the Deep Green genes and proteins provide a starting resource of high-value targets for further investigation of potentially new protein structures and functions conserved across the green lineage.</p>","PeriodicalId":20230,"journal":{"name":"Plant Direct","volume":"7 12","pages":"e527"},"PeriodicalIF":2.3000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10690477/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Direct","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pld3.527","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The rapid accumulation of sequenced plant genomes in the past decade has outpaced the still difficult problem of genome-wide protein-coding gene annotation. A substantial fraction of protein-coding genes in all plant genomes are poorly annotated or unannotated and remain functionally uncharacterized. We identified unannotated proteins in three model organisms representing distinct branches of the green lineage (Viridiplantae): Arabidopsis thaliana (eudicot), Setaria viridis (monocot), and Chlamydomonas reinhardtii (Chlorophyte alga). Using similarity searching, we identified a subset of unannotated proteins that were conserved between these species and defined them as Deep Green proteins. Bioinformatic, genomic, and structural predictions were performed to begin classifying Deep Green genes and proteins. Compared to whole proteomes for each species, the Deep Green set was enriched for proteins with predicted chloroplast targeting signals predictive of photosynthetic or plastid functions, a result that was consistent with enrichment for daylight phase diurnal expression patterning. Structural predictions using AlphaFold and comparisons to known structures showed that a significant proportion of Deep Green proteins may possess novel folds. Though only available for three organisms, the Deep Green genes and proteins provide a starting resource of high-value targets for further investigation of potentially new protein structures and functions conserved across the green lineage.

reinhardtii衣藻、拟南芥和蛇尾草中未鉴定的保守蛋白的鉴定和初步鉴定。
在过去的十年中,植物基因组测序的快速积累已经超过了全基因组蛋白质编码基因注释的难题。在所有植物基因组中,有相当一部分蛋白质编码基因的注释很差或未注释,并且在功能上仍未被表征。我们在代表绿色谱系(绿藻)不同分支的三种模式生物中鉴定了未注释的蛋白质:拟南芥(拟南芥)、绿草芥(单子叶)和莱茵衣藻(绿藻)。通过相似性搜索,我们确定了这些物种之间保守的未注释蛋白子集,并将其定义为深绿蛋白。进行生物信息学、基因组学和结构预测,开始对深绿色基因和蛋白质进行分类。与每个物种的整个蛋白质组相比,深绿色组富含预测叶绿体靶向信号的蛋白质,预测光合作用或质体功能,结果与日光期日表达模式的富集一致。使用AlphaFold进行结构预测并与已知结构进行比较表明,很大一部分深绿蛋白可能具有新的褶皱。虽然深绿色基因和蛋白质仅适用于三种生物,但它们为进一步研究绿色谱系中保守的潜在新蛋白质结构和功能提供了高价值靶点的起始资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Plant Direct
Plant Direct Environmental Science-Ecology
CiteScore
5.00
自引率
3.30%
发文量
101
审稿时长
14 weeks
期刊介绍: Plant Direct is a monthly, sound science journal for the plant sciences that gives prompt and equal consideration to papers reporting work dealing with a variety of subjects. Topics include but are not limited to genetics, biochemistry, development, cell biology, biotic stress, abiotic stress, genomics, phenomics, bioinformatics, physiology, molecular biology, and evolution. A collaborative journal launched by the American Society of Plant Biologists, the Society for Experimental Biology and Wiley, Plant Direct publishes papers submitted directly to the journal as well as those referred from a select group of the societies’ journals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信