[AcidBasePred: a protein acid-base tolerance prediction platform based on deep learning].

Q4 Biochemistry, Genetics and Molecular Biology
Rong Huang, Hejian Zhang, Min Wu, Zhiyue Men, Huanyu Chu, Jie Bai, Hong Chang, Jian Cheng, Xiaoping Liao, Yuwan Liu, Yajian Song, Huifeng Jiang
{"title":"[AcidBasePred: a protein acid-base tolerance prediction platform based on deep learning].","authors":"Rong Huang, Hejian Zhang, Min Wu, Zhiyue Men, Huanyu Chu, Jie Bai, Hong Chang, Jian Cheng, Xiaoping Liao, Yuwan Liu, Yajian Song, Huifeng Jiang","doi":"10.13345/j.cjb.240255","DOIUrl":null,"url":null,"abstract":"<p><p>The structures and activities of enzymes are influenced by pH of the environment. Understanding and distinguishing the adaptation mechanisms of enzymes to extreme pH values is of great significance for elucidating the molecular mechanisms and promoting the industrial applications of enzymes. In this study, the ESM-2 protein language model was used to encode the secreted microbial proteins with the optimal performance above pH 9 and below pH 5, which yielded 47 725 high-pH protein sequences and 66 079 low-pH protein sequences, respectively. A deep learning model was constructed to identify protein acid-base tolerance based on amino acid sequences. The model showcased significantly higher accuracy than other methods, with the overall accuracy of 94.8%, precision of 91.8%, and a recall rate of 93.4% on the test set. Furthermore, we built a website (https://enzymepred.biodesign.ac.cn), which enabled users to predict the acid-base tolerance by submitting the protein sequences of enzymes. This study has accelerated the application of enzymes in various fields, including biotechnology, pharmaceuticals, and chemicals. It provides a powerful tool for the rapid screening and optimization of industrial enzymes.</p>","PeriodicalId":21778,"journal":{"name":"Sheng wu gong cheng xue bao = Chinese journal of biotechnology","volume":"40 12","pages":"4670-4681"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sheng wu gong cheng xue bao = Chinese journal of biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13345/j.cjb.240255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

The structures and activities of enzymes are influenced by pH of the environment. Understanding and distinguishing the adaptation mechanisms of enzymes to extreme pH values is of great significance for elucidating the molecular mechanisms and promoting the industrial applications of enzymes. In this study, the ESM-2 protein language model was used to encode the secreted microbial proteins with the optimal performance above pH 9 and below pH 5, which yielded 47 725 high-pH protein sequences and 66 079 low-pH protein sequences, respectively. A deep learning model was constructed to identify protein acid-base tolerance based on amino acid sequences. The model showcased significantly higher accuracy than other methods, with the overall accuracy of 94.8%, precision of 91.8%, and a recall rate of 93.4% on the test set. Furthermore, we built a website (https://enzymepred.biodesign.ac.cn), which enabled users to predict the acid-base tolerance by submitting the protein sequences of enzymes. This study has accelerated the application of enzymes in various fields, including biotechnology, pharmaceuticals, and chemicals. It provides a powerful tool for the rapid screening and optimization of industrial enzymes.

[AcidBasePred:基于深度学习的蛋白质酸碱耐受性预测平台]。
酶的结构和活性受环境pH的影响。了解和区分酶对极端pH值的适应机制,对于阐明酶的分子机制和促进酶的工业应用具有重要意义。本研究利用ESM-2蛋白语言模型编码pH值在9以上和5以下表现最佳的分泌微生物蛋白,分别获得了47 725条高pH蛋白序列和66 079条低pH蛋白序列。构建了基于氨基酸序列的蛋白质酸碱耐受性深度学习模型。该模型在测试集上的总体准确率为94.8%,精密度为91.8%,召回率为93.4%,准确率显著高于其他方法。此外,我们建立了一个网站(https://enzymepred.biodesign.ac.cn),用户可以通过提交酶的蛋白质序列来预测酶的酸碱耐受性。这项研究加速了酶在生物技术、制药和化学等各个领域的应用。它为工业酶的快速筛选和优化提供了有力的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Sheng wu gong cheng xue bao = Chinese journal of biotechnology
Sheng wu gong cheng xue bao = Chinese journal of biotechnology Biochemistry, Genetics and Molecular Biology-Biotechnology
CiteScore
1.50
自引率
0.00%
发文量
298
期刊介绍: Chinese Journal of Biotechnology (Chinese edition) , sponsored by the Institute of Microbiology, Chinese Academy of Sciences and the Chinese Society for Microbiology, is a peer-reviewed international journal. The journal is cited by many scientific databases , such as Chemical Abstract (CA), Biology Abstract (BA), MEDLINE, Russian Digest , Chinese Scientific Citation Index (CSCI), Chinese Journal Citation Report (CJCR), and Chinese Academic Journal (CD version). The Journal publishes new discoveries, techniques and developments in genetic engineering, cell engineering, enzyme engineering, biochemical engineering, tissue engineering, bioinformatics, biochips and other fields of biotechnology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信