Comprehensive meta-analysis and machine learning approaches identified the role of novel drought specific genes in Oryza sativa

IF 2.2 Q3 GENETICS & HEREDITY
Raja Rajeswary Thanmalagan, Abhijeet Roy, Aiswarya Jayaprakash, P.T.V. Lakshmi
{"title":"Comprehensive meta-analysis and machine learning approaches identified the role of novel drought specific genes in Oryza sativa","authors":"Raja Rajeswary Thanmalagan,&nbsp;Abhijeet Roy,&nbsp;Aiswarya Jayaprakash,&nbsp;P.T.V. Lakshmi","doi":"10.1016/j.plgene.2022.100382","DOIUrl":null,"url":null,"abstract":"<div><p><span>Rice is a major food crop and provides nutrition for half of the world's population. Rice production is majorly affected by drought at different developmental stages and accounted for annual yield loss depending on the intensity of drought. Hence, the need to study the molecular mechanism in a holistic manner behind drought tolerance is a prerequisite to mitigating this problem. Therefore, in the current study, the drought tolerance mechanism of rice plants was elucidated through a meta-analysis on the publically available </span>transcriptomic<span> datasets by integrating these datasets using a R package to remove the batch effects and applying machine learning approaches for prediction robustness and accuracy. Thus, the classifier model identified 128 essential genes through feature selection algorithms and classification methods on training datasets. The comprehensive study revealed that Naïve Bayes<span> classification and correlation-based feature selection was robust in the prediction of essential genes. The accuracy and performance of the classification model was validated with the independent test dataset and the prediction accuracy of the classifier was 93% with ROC (0.972) and F-measures (0.927). Further, the biological significance of the identified genes in drought tolerance was assessed. The current analysis highlighted the regulatory roles of novel genes such as Os01g0844300, Os06g0246500, Os05g03733900, Os05g0550600 Os08g0442900, Os08g0104400, Os01g0256500, Os02g0259900 and Os05g0572700 in the enhancement of drought tolerance mechanisms. Thus the identified genes might be the potential targets for molecular breeding of drought-tolerant rice cultivars.</span></span></p></div>","PeriodicalId":38041,"journal":{"name":"Plant Gene","volume":"32 ","pages":"Article 100382"},"PeriodicalIF":2.2000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Gene","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352407322000324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 1

Abstract

Rice is a major food crop and provides nutrition for half of the world's population. Rice production is majorly affected by drought at different developmental stages and accounted for annual yield loss depending on the intensity of drought. Hence, the need to study the molecular mechanism in a holistic manner behind drought tolerance is a prerequisite to mitigating this problem. Therefore, in the current study, the drought tolerance mechanism of rice plants was elucidated through a meta-analysis on the publically available transcriptomic datasets by integrating these datasets using a R package to remove the batch effects and applying machine learning approaches for prediction robustness and accuracy. Thus, the classifier model identified 128 essential genes through feature selection algorithms and classification methods on training datasets. The comprehensive study revealed that Naïve Bayes classification and correlation-based feature selection was robust in the prediction of essential genes. The accuracy and performance of the classification model was validated with the independent test dataset and the prediction accuracy of the classifier was 93% with ROC (0.972) and F-measures (0.927). Further, the biological significance of the identified genes in drought tolerance was assessed. The current analysis highlighted the regulatory roles of novel genes such as Os01g0844300, Os06g0246500, Os05g03733900, Os05g0550600 Os08g0442900, Os08g0104400, Os01g0256500, Os02g0259900 and Os05g0572700 in the enhancement of drought tolerance mechanisms. Thus the identified genes might be the potential targets for molecular breeding of drought-tolerant rice cultivars.

综合荟萃分析和机器学习方法确定了水稻中新的干旱特异性基因的作用
水稻是一种主要的粮食作物,为世界上一半的人口提供营养。水稻生产在不同发育阶段主要受干旱影响,并根据干旱强度造成年产量损失。因此,需要从整体上研究耐旱性背后的分子机制是缓解这一问题的先决条件。因此,在本研究中,通过对公开的转录组数据集进行meta分析,阐明了水稻植物的抗旱机制,使用R软件包对这些数据集进行整合,以消除批次效应,并应用机器学习方法来预测稳健性和准确性。因此,分类器模型通过特征选择算法和分类方法在训练数据集上识别出128个必需基因。综合研究表明,Naïve贝叶斯分类和基于相关性的特征选择在预测必需基因方面具有鲁棒性。用独立的测试数据集验证了分类模型的准确性和性能,分类器的预测准确率为93%,ROC (0.972), F-measures(0.927)。此外,还评估了所鉴定基因在抗旱方面的生物学意义。目前的分析重点是Os01g0844300、Os06g0246500、Os05g03733900、Os05g0550600、Os08g0442900、Os08g0104400、Os01g0256500、Os02g0259900和Os05g0572700等新基因在增强抗旱机制中的调控作用。因此,所鉴定的基因可能是水稻抗旱品种分子育种的潜在靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Plant Gene
Plant Gene Agricultural and Biological Sciences-Plant Science
CiteScore
4.50
自引率
0.00%
发文量
42
审稿时长
51 days
期刊介绍: Plant Gene publishes papers that focus on the regulation, expression, function and evolution of genes in plants, algae and other photosynthesizing organisms (e.g., cyanobacteria), and plant-associated microorganisms. Plant Gene strives to be a diverse plant journal and topics in multiple fields will be considered for publication. Although not limited to the following, some general topics include: Gene discovery and characterization, Gene regulation in response to environmental stress (e.g., salinity, drought, etc.), Genetic effects of transposable elements, Genetic control of secondary metabolic pathways and metabolic enzymes. Herbal Medicine - regulation and medicinal properties of plant products, Plant hormonal signaling, Plant evolutionary genetics, molecular evolution, population genetics, and phylogenetics, Profiling of plant gene expression and genetic variation, Plant-microbe interactions (e.g., influence of endophytes on gene expression; horizontal gene transfer studies; etc.), Agricultural genetics - biotechnology and crop improvement.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信