ASPTF:利用机器学习算法预测植物非生物胁迫响应转录因子的计算工具。

IF 2.8 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Upendra Kumar Pradhan , Anuradha Mahapatra , Sanchita Naha , Ajit Gupta , Rajender Parsad , Vijay Gahlaut , Surya Narayan Rath , Prabina Kumar Meher
{"title":"ASPTF:利用机器学习算法预测植物非生物胁迫响应转录因子的计算工具。","authors":"Upendra Kumar Pradhan ,&nbsp;Anuradha Mahapatra ,&nbsp;Sanchita Naha ,&nbsp;Ajit Gupta ,&nbsp;Rajender Parsad ,&nbsp;Vijay Gahlaut ,&nbsp;Surya Narayan Rath ,&nbsp;Prabina Kumar Meher","doi":"10.1016/j.bbagen.2024.130597","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Abiotic stresses pose serious threat to the growth and yield of crop plants. Several studies suggest that in plants, transcription factors (TFs) are important regulators of gene expression, especially when it comes to coping with abiotic stresses. Therefore, it is crucial to identify TFs associated with abiotic stress response for breeding of abiotic stress tolerant crop cultivars.</p></div><div><h3>Methods</h3><p>Based on a machine learning framework, a computational model was envisaged to predict TFs associated with abiotic stress response in plants. To numerically encode TF sequences, four distinct sequence derived features were generated. The prediction was performed using ten shallow learning and four deep learning algorithms. For prediction using more pertinent and informative features, feature selection techniques were also employed.</p></div><div><h3>Results</h3><p>Using the features chosen by the light-gradient boosting machine-variable importance measure (LGBM-VIM), the LGBM achieved the highest cross-validation performance metrics (accuracy: 86.81%, auROC: 92.98%, and auPRC: 94.03%). Further evaluation of the proposed model (LGBM prediction method + LGBM-VIM selected features) was also done using an independent test dataset, where the accuracy, auROC and auPRC were observed 81.98%, 90.65% and 91.30%, respectively.</p></div><div><h3>Conclusions</h3><p>To facilitate the adoption of the proposed strategy by users, the approach was implemented as a prediction server called ASPTF, accessible at <span>https://iasri-sg.icar.gov.in/asptf/</span><svg><path></path></svg>. The developed approach and the corresponding web application are anticipated to supplement experimental methods in the identification of transcription factors (TFs) responsive to abiotic stress in plants.</p></div>","PeriodicalId":8800,"journal":{"name":"Biochimica et biophysica acta. General subjects","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ASPTF: A computational tool to predict abiotic stress-responsive transcription factors in plants by employing machine learning algorithms\",\"authors\":\"Upendra Kumar Pradhan ,&nbsp;Anuradha Mahapatra ,&nbsp;Sanchita Naha ,&nbsp;Ajit Gupta ,&nbsp;Rajender Parsad ,&nbsp;Vijay Gahlaut ,&nbsp;Surya Narayan Rath ,&nbsp;Prabina Kumar Meher\",\"doi\":\"10.1016/j.bbagen.2024.130597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Abiotic stresses pose serious threat to the growth and yield of crop plants. Several studies suggest that in plants, transcription factors (TFs) are important regulators of gene expression, especially when it comes to coping with abiotic stresses. Therefore, it is crucial to identify TFs associated with abiotic stress response for breeding of abiotic stress tolerant crop cultivars.</p></div><div><h3>Methods</h3><p>Based on a machine learning framework, a computational model was envisaged to predict TFs associated with abiotic stress response in plants. To numerically encode TF sequences, four distinct sequence derived features were generated. The prediction was performed using ten shallow learning and four deep learning algorithms. For prediction using more pertinent and informative features, feature selection techniques were also employed.</p></div><div><h3>Results</h3><p>Using the features chosen by the light-gradient boosting machine-variable importance measure (LGBM-VIM), the LGBM achieved the highest cross-validation performance metrics (accuracy: 86.81%, auROC: 92.98%, and auPRC: 94.03%). Further evaluation of the proposed model (LGBM prediction method + LGBM-VIM selected features) was also done using an independent test dataset, where the accuracy, auROC and auPRC were observed 81.98%, 90.65% and 91.30%, respectively.</p></div><div><h3>Conclusions</h3><p>To facilitate the adoption of the proposed strategy by users, the approach was implemented as a prediction server called ASPTF, accessible at <span>https://iasri-sg.icar.gov.in/asptf/</span><svg><path></path></svg>. The developed approach and the corresponding web application are anticipated to supplement experimental methods in the identification of transcription factors (TFs) responsive to abiotic stress in plants.</p></div>\",\"PeriodicalId\":8800,\"journal\":{\"name\":\"Biochimica et biophysica acta. General subjects\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2024-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biochimica et biophysica acta. General subjects\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0304416524000400\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochimica et biophysica acta. General subjects","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304416524000400","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:非生物胁迫对农作物的生长和产量构成严重威胁。一些研究表明,在植物中,转录因子(TFs)是基因表达的重要调节因子,尤其是在应对非生物胁迫时。因此,识别与非生物胁迫响应相关的转录因子对于培育耐受非生物胁迫的作物栽培品种至关重要:方法:基于机器学习框架,设想建立一个计算模型来预测与植物非生物胁迫响应相关的TFs。为了对 TF 序列进行数字编码,生成了四种不同的序列衍生特征。预测使用了十种浅层学习算法和四种深度学习算法。为了使用更贴切、信息量更大的特征进行预测,还采用了特征选择技术:使用光梯度提升机器-变量重要性度量(LGBM-VIM)选择的特征,LGBM 实现了最高的交叉验证性能指标(准确率:86.81%;auROC:92.98%;auPRC:94.03%)。此外,还使用独立测试数据集对拟议模型(LGBM 预测方法 + LGBM-VIM 所选特征)进行了进一步评估,观察到准确率、auROC 和 auPRC 分别为 81.98%、90.65% 和 91.30%:为便于用户采用所提出的策略,我们实施了该技术,并将其作为一个名为 ASPTF 的预测服务器,可在 https://iasri-sg.icar.gov.in/asptf/ 上访问。预计所开发的方法和相应的网络应用程序将在鉴定植物对非生物胁迫有反应的转录因子(TFs)方面对实验方法起到补充作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
ASPTF: A computational tool to predict abiotic stress-responsive transcription factors in plants by employing machine learning algorithms

Background

Abiotic stresses pose serious threat to the growth and yield of crop plants. Several studies suggest that in plants, transcription factors (TFs) are important regulators of gene expression, especially when it comes to coping with abiotic stresses. Therefore, it is crucial to identify TFs associated with abiotic stress response for breeding of abiotic stress tolerant crop cultivars.

Methods

Based on a machine learning framework, a computational model was envisaged to predict TFs associated with abiotic stress response in plants. To numerically encode TF sequences, four distinct sequence derived features were generated. The prediction was performed using ten shallow learning and four deep learning algorithms. For prediction using more pertinent and informative features, feature selection techniques were also employed.

Results

Using the features chosen by the light-gradient boosting machine-variable importance measure (LGBM-VIM), the LGBM achieved the highest cross-validation performance metrics (accuracy: 86.81%, auROC: 92.98%, and auPRC: 94.03%). Further evaluation of the proposed model (LGBM prediction method + LGBM-VIM selected features) was also done using an independent test dataset, where the accuracy, auROC and auPRC were observed 81.98%, 90.65% and 91.30%, respectively.

Conclusions

To facilitate the adoption of the proposed strategy by users, the approach was implemented as a prediction server called ASPTF, accessible at https://iasri-sg.icar.gov.in/asptf/. The developed approach and the corresponding web application are anticipated to supplement experimental methods in the identification of transcription factors (TFs) responsive to abiotic stress in plants.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biochimica et biophysica acta. General subjects
Biochimica et biophysica acta. General subjects 生物-生化与分子生物学
CiteScore
6.40
自引率
0.00%
发文量
139
审稿时长
30 days
期刊介绍: BBA General Subjects accepts for submission either original, hypothesis-driven studies or reviews covering subjects in biochemistry and biophysics that are considered to have general interest for a wide audience. Manuscripts with interdisciplinary approaches are especially encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信