Enzyme specificity prediction using cross attention graph neural networks.

IF 48.5 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Nature Pub Date : 2025-10-08 DOI:10.1038/s41586-025-09697-2
Haiyang Cui,Yufeng Su,Tanner J Dean,Tianhao Yu,Zhengyi Zhang,Jian Peng,Diwakar Shukla,Huimin Zhao
{"title":"Enzyme specificity prediction using cross attention graph neural networks.","authors":"Haiyang Cui,Yufeng Su,Tanner J Dean,Tianhao Yu,Zhengyi Zhang,Jian Peng,Diwakar Shukla,Huimin Zhao","doi":"10.1038/s41586-025-09697-2","DOIUrl":null,"url":null,"abstract":"Enzymes are the molecular machines of life, and a key property that governs their function is substrate specificity-the ability of an enzyme to recognize and selectively act on particular substrates. This specificity originates from the three-dimensional (3D) structure of the enzyme active site and complicated transition state of the reaction1,2. Many enzymes can promiscuously catalyze reactions or act on substrates beyond those for which they were originally evolved1,3-5. However, millions of known enzymes still lack reliable substrate specificity information, impeding their practical applications and comprehensive understanding of the biocatalytic diversity in nature. Herein, we developed a cross-attention-empowered SE(3)-equivariant graph neural network architecture named EZSpecificity for predicting enzyme substrate specificity, which was trained on a comprehensive tailor-made database of enzyme-substrate interactions at sequence and structural levels. EZSpecificity outperformed the existing machine learning models for enzyme substrate specificity prediction, as demonstrated by both an unknown substrate and enzyme database and seven proof-of-concept protein families. Experimental validation with eight halogenases and 78 substrates revealed that EZSpecificity achieved a 91.7% accuracy in identifying the single potential reactive substrate, significantly higher than that of the state-of-the-art model ESP (58.3%). EZSpecificity represents a general machine learning model for accurate prediction of substrate specificity for enzymes related to fundamental and applied research in biology and medicine.","PeriodicalId":18787,"journal":{"name":"Nature","volume":"26 1","pages":""},"PeriodicalIF":48.5000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41586-025-09697-2","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Enzymes are the molecular machines of life, and a key property that governs their function is substrate specificity-the ability of an enzyme to recognize and selectively act on particular substrates. This specificity originates from the three-dimensional (3D) structure of the enzyme active site and complicated transition state of the reaction1,2. Many enzymes can promiscuously catalyze reactions or act on substrates beyond those for which they were originally evolved1,3-5. However, millions of known enzymes still lack reliable substrate specificity information, impeding their practical applications and comprehensive understanding of the biocatalytic diversity in nature. Herein, we developed a cross-attention-empowered SE(3)-equivariant graph neural network architecture named EZSpecificity for predicting enzyme substrate specificity, which was trained on a comprehensive tailor-made database of enzyme-substrate interactions at sequence and structural levels. EZSpecificity outperformed the existing machine learning models for enzyme substrate specificity prediction, as demonstrated by both an unknown substrate and enzyme database and seven proof-of-concept protein families. Experimental validation with eight halogenases and 78 substrates revealed that EZSpecificity achieved a 91.7% accuracy in identifying the single potential reactive substrate, significantly higher than that of the state-of-the-art model ESP (58.3%). EZSpecificity represents a general machine learning model for accurate prediction of substrate specificity for enzymes related to fundamental and applied research in biology and medicine.
交叉注意图神经网络的酶特异性预测。
酶是生命的分子机器,控制其功能的一个关键特性是底物特异性——酶识别和选择性作用于特定底物的能力。这种特异性源于酶活性位点的三维(3D)结构和反应的复杂过渡状态1,2。许多酶可以混杂地催化反应或作用于它们最初进化的底物以外的底物1,3-5。然而,数以百万计的已知酶仍然缺乏可靠的底物特异性信息,阻碍了它们的实际应用和对自然界生物催化多样性的全面理解。在此,我们开发了一个名为EZSpecificity的交叉注意授权SE(3)-等变图神经网络架构,用于预测酶底物特异性,该架构是在一个全面定制的酶-底物相互作用的序列和结构水平数据库上进行训练的。正如未知底物和酶数据库以及七个概念验证蛋白家族所证明的那样,ezspecific在酶底物特异性预测方面优于现有的机器学习模型。8种卤化酶和78种底物的实验验证表明,ezspecific在识别单一潜在活性底物方面的准确率达到91.7%,显著高于最先进的模型ESP(58.3%)。ezspecific是一种通用的机器学习模型,用于准确预测与生物学和医学基础研究和应用研究相关的酶的底物特异性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature
Nature 综合性期刊-综合性期刊
CiteScore
90.00
自引率
1.20%
发文量
3652
审稿时长
3 months
期刊介绍: Nature is a prestigious international journal that publishes peer-reviewed research in various scientific and technological fields. The selection of articles is based on criteria such as originality, importance, interdisciplinary relevance, timeliness, accessibility, elegance, and surprising conclusions. In addition to showcasing significant scientific advances, Nature delivers rapid, authoritative, insightful news, and interpretation of current and upcoming trends impacting science, scientists, and the broader public. The journal serves a dual purpose: firstly, to promptly share noteworthy scientific advances and foster discussions among scientists, and secondly, to ensure the swift dissemination of scientific results globally, emphasizing their significance for knowledge, culture, and daily life.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信