{"title":"Reactzyme: A Benchmark for Enzyme-Reaction Prediction","authors":"Chenqing Hua, Bozitao Zhong, Sitao Luan, Liang Hong, Guy Wolf, Doina Precup, Shuangjia Zheng","doi":"arxiv-2408.13659","DOIUrl":null,"url":null,"abstract":"Enzymes, with their specific catalyzed reactions, are necessary for all\naspects of life, enabling diverse biological processes and adaptations.\nPredicting enzyme functions is essential for understanding biological pathways,\nguiding drug development, enhancing bioproduct yields, and facilitating\nevolutionary studies. Addressing the inherent complexities, we introduce a new\napproach to annotating enzymes based on their catalyzed reactions. This method\nprovides detailed insights into specific reactions and is adaptable to newly\ndiscovered reactions, diverging from traditional classifications by protein\nfamily or expert-derived reaction classes. We employ machine learning\nalgorithms to analyze enzyme reaction datasets, delivering a much more refined\nview on the functionality of enzymes. Our evaluation leverages the largest\nenzyme-reaction dataset to date, derived from the SwissProt and Rhea databases\nwith entries up to January 8, 2024. We frame the enzyme-reaction prediction as\na retrieval problem, aiming to rank enzymes by their catalytic ability for\nspecific reactions. With our model, we can recruit proteins for novel reactions\nand predict reactions in novel proteins, facilitating enzyme discovery and\nfunction annotation.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.13659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Enzymes, with their specific catalyzed reactions, are necessary for all
aspects of life, enabling diverse biological processes and adaptations.
Predicting enzyme functions is essential for understanding biological pathways,
guiding drug development, enhancing bioproduct yields, and facilitating
evolutionary studies. Addressing the inherent complexities, we introduce a new
approach to annotating enzymes based on their catalyzed reactions. This method
provides detailed insights into specific reactions and is adaptable to newly
discovered reactions, diverging from traditional classifications by protein
family or expert-derived reaction classes. We employ machine learning
algorithms to analyze enzyme reaction datasets, delivering a much more refined
view on the functionality of enzymes. Our evaluation leverages the largest
enzyme-reaction dataset to date, derived from the SwissProt and Rhea databases
with entries up to January 8, 2024. We frame the enzyme-reaction prediction as
a retrieval problem, aiming to rank enzymes by their catalytic ability for
specific reactions. With our model, we can recruit proteins for novel reactions
and predict reactions in novel proteins, facilitating enzyme discovery and
function annotation.