{"title":"DLKcat 无法预测突变体和陌生酶的 k cat 值。","authors":"Alexander Kroll, Martin J Lercher","doi":"10.1093/biomethods/bpae061","DOIUrl":null,"url":null,"abstract":"<p><p>The recently published DLKcat model, a deep learning approach for predicting enzyme turnover numbers (<i>k</i> <sub>cat</sub>), claims to enable high-throughput <i>k</i> <sub>cat</sub> predictions for metabolic enzymes from any organism and to capture <i>k</i> <sub>cat</sub> changes for mutated enzymes. Here, we critically evaluate these claims. We show that for enzymes with <60% sequence identity to the training data DLKcat predictions become worse than simply assuming a constant average <i>k</i> <sub>cat</sub> value for all reactions. Furthermore, DLKcat's ability to predict mutation effects is much weaker than implied, capturing none of the experimentally observed variation across mutants not included in the training data. These findings highlight significant limitations in DLKcat's generalizability and its practical utility for predicting <i>k</i> <sub>cat</sub> values for novel enzyme families or mutants, which are crucial applications in fields such as metabolic modeling.</p>","PeriodicalId":36528,"journal":{"name":"Biology Methods and Protocols","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11427335/pdf/","citationCount":"0","resultStr":"{\"title\":\"DLKcat cannot predict meaningful <i>k</i> <sub>cat</sub> values for mutants and unfamiliar enzymes.\",\"authors\":\"Alexander Kroll, Martin J Lercher\",\"doi\":\"10.1093/biomethods/bpae061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The recently published DLKcat model, a deep learning approach for predicting enzyme turnover numbers (<i>k</i> <sub>cat</sub>), claims to enable high-throughput <i>k</i> <sub>cat</sub> predictions for metabolic enzymes from any organism and to capture <i>k</i> <sub>cat</sub> changes for mutated enzymes. Here, we critically evaluate these claims. We show that for enzymes with <60% sequence identity to the training data DLKcat predictions become worse than simply assuming a constant average <i>k</i> <sub>cat</sub> value for all reactions. Furthermore, DLKcat's ability to predict mutation effects is much weaker than implied, capturing none of the experimentally observed variation across mutants not included in the training data. These findings highlight significant limitations in DLKcat's generalizability and its practical utility for predicting <i>k</i> <sub>cat</sub> values for novel enzyme families or mutants, which are crucial applications in fields such as metabolic modeling.</p>\",\"PeriodicalId\":36528,\"journal\":{\"name\":\"Biology Methods and Protocols\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11427335/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biology Methods and Protocols\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/biomethods/bpae061\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology Methods and Protocols","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/biomethods/bpae061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
摘要
最近发表的 DLKcat 模型是一种预测酶转化率(k cat)的深度学习方法,它声称能对任何生物体的代谢酶进行高通量 k cat 预测,并能捕捉突变酶的 k cat 变化。在此,我们对这些说法进行了严格的评估。我们发现,对于所有反应都有 k cat 值的酶来说,DLKcat 可以预测它们的 k cat 值。此外,DLKcat 预测突变效应的能力比所暗示的要弱得多,它捕捉不到实验观察到的未包含在训练数据中的突变体之间的变化。这些发现凸显了 DLKcat 的普适性及其在预测新型酶家族或突变体的 k cat 值方面的实用性存在重大局限,而这正是代谢建模等领域的关键应用。
DLKcat cannot predict meaningful kcat values for mutants and unfamiliar enzymes.
The recently published DLKcat model, a deep learning approach for predicting enzyme turnover numbers (kcat), claims to enable high-throughput kcat predictions for metabolic enzymes from any organism and to capture kcat changes for mutated enzymes. Here, we critically evaluate these claims. We show that for enzymes with <60% sequence identity to the training data DLKcat predictions become worse than simply assuming a constant average kcat value for all reactions. Furthermore, DLKcat's ability to predict mutation effects is much weaker than implied, capturing none of the experimentally observed variation across mutants not included in the training data. These findings highlight significant limitations in DLKcat's generalizability and its practical utility for predicting kcat values for novel enzyme families or mutants, which are crucial applications in fields such as metabolic modeling.