Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal
{"title":"避免生物催化实验中的重复:用于酶级联优化的机器学习","authors":"Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal","doi":"10.1002/cctc.202400777","DOIUrl":null,"url":null,"abstract":"The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.","PeriodicalId":141,"journal":{"name":"ChemCatChem","volume":"12 1","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Avoiding Replicates in Biocatalysis Experiments: Machine Learning for Enzyme Cascade Optimization\",\"authors\":\"Regine Siedentop, Maximilian Siska, Johanna Hermes, Stephan Lütz, Eric von Lieres, Katrin Rosenthal\",\"doi\":\"10.1002/cctc.202400777\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.\",\"PeriodicalId\":141,\"journal\":{\"name\":\"ChemCatChem\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ChemCatChem\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1002/cctc.202400777\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ChemCatChem","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1002/cctc.202400777","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Avoiding Replicates in Biocatalysis Experiments: Machine Learning for Enzyme Cascade Optimization
The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, we used BO to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio was chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, we did not use replicates but instead relied on the algorithm’s proposed experiments and inherent uncertainty quantification. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in our study, the productivity-cost ratio was doubled to 38.6 mmol L-1 h-1 €-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results.
期刊介绍:
With an impact factor of 4.495 (2018), ChemCatChem is one of the premier journals in the field of catalysis. The journal provides primary research papers and critical secondary information on heterogeneous, homogeneous and bio- and nanocatalysis. The journal is well placed to strengthen cross-communication within between these communities. Its authors and readers come from academia, the chemical industry, and government laboratories across the world. It is published on behalf of Chemistry Europe, an association of 16 European chemical societies, and is supported by the German Catalysis Society.