避免生物催化实验中的重复:用于酶级联优化的机器学习

IF 3.8 3区 化学 Q2 CHEMISTRY, PHYSICAL
ChemCatChem Pub Date : 2024-09-16 DOI:10.1002/cctc.202400777
Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal
{"title":"避免生物催化实验中的重复:用于酶级联优化的机器学习","authors":"Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal","doi":"10.1002/cctc.202400777","DOIUrl":null,"url":null,"abstract":"The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.","PeriodicalId":141,"journal":{"name":"ChemCatChem","volume":"12 1","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Avoiding Replicates in Biocatalysis Experiments: Machine Learning for Enzyme Cascade Optimization\",\"authors\":\"Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal\",\"doi\":\"10.1002/cctc.202400777\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.\",\"PeriodicalId\":141,\"journal\":{\"name\":\"ChemCatChem\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ChemCatChem\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1002/cctc.202400777\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ChemCatChem","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1002/cctc.202400777","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

由于涉及众多参数和协同效应,酶级联的优化是一项复杂且需要大量资源的任务。机器学习可以支持最佳反应条件的确定,例如,在贝叶斯优化(BO)的情况下,可以根据高斯过程回归(GPR)和预期改进(EI)提出新的实验方案。在此,我们使用贝叶斯优化法来优化酶级联反应组分的浓度。为了达到尽可能高的生产率,我们选择了生产率-成本-比率作为优化目标,并对所用材料的成本进行了归一化处理,以防止酶浓度不断升高。为了减少实验工作量,与生物实验中的常见做法相反,我们没有使用重复实验,而是依靠算法提出的实验和固有的不确定性量化。这种方法兼顾了参数空间的探索和利用,对于高效率、高效益地确定最佳反应条件至关重要。在我们研究确定的优化反应条件下,与参考实验相比,生产率-成本比翻了一番,达到 38.6 mmol L-1 h-1 €-1。参数优化只需要 52 次实验,同时对偏离的实验结果具有稳健性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Avoiding Replicates in Biocatalysis Experiments: Machine Learning for Enzyme Cascade Optimization
The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ChemCatChem
ChemCatChem 化学-物理化学
CiteScore
8.10
自引率
4.40%
发文量
511
审稿时长
1.3 months
期刊介绍: With an impact factor of 4.495 (2018), ChemCatChem is one of the premier journals in the field of catalysis. The journal provides primary research papers and critical secondary information on heterogeneous, homogeneous and bio- and nanocatalysis. The journal is well placed to strengthen cross-communication within between these communities. Its authors and readers come from academia, the chemical industry, and government laboratories across the world. It is published on behalf of Chemistry Europe, an association of 16 European chemical societies, and is supported by the German Catalysis Society.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信