Perman Jorayev, Sebastian Soritz, Simon Sung, Mohammed I. Jeraal, Danilo Russo, Alexandre Barthelme, Frédéric C. Toussaint, Matthew J. Gaunt, Alexei A. Lapkin
{"title":"连续流光氧化还原胺合成的机器学习驱动优化","authors":"Perman Jorayev, Sebastian Soritz, Simon Sung, Mohammed I. Jeraal, Danilo Russo, Alexandre Barthelme, Frédéric C. Toussaint, Matthew J. Gaunt, Alexei A. Lapkin","doi":"10.1021/acs.oprd.4c00533","DOIUrl":null,"url":null,"abstract":"Photoredox catalysis plays an important role in the synthesis of pharmaceutically relevant compounds such as C(sp<sup>3</sup>)-rich tertiary amines. The difficulty of identifying underlying mechanistic models for such novel transformations, coupled with the large reaction space of this reaction class, means that developing a robust process is challenging. In this work, we demonstrate the machine learning-driven optimization of a photoredox tertiary amine synthesis with six continuous variables (e.g., concentration, temperature, residence time) and solvent choice as a discrete variable, in a semiautomated continuous flow setup. Starting with a large library of solvents, the workflow included multiple steps of <i>a priori</i> knowledge generation (e.g., solubility predictions) to narrow the discrete space. A novel Bayesian optimization algorithm, nomadic exploratory multiobjective optimization (NEMO), was then deployed to identify and populate the Pareto front for the two reaction objectives─yield and reaction cost. Permutation feature importance and partial dependence plots identified the most important parameters for high yield, sig3, the asymmetry of the s-profile for the discrete space, and equivalences of alkene and Hantzsch ester for the continuous variables. Catalyst loading and residence time were found to be correlated to absorbed photon equivalence, while catalyst loading was additionally the main parameter to drive cost. Even though productivity was not an optimization objective, the best result achieved in flow was ∼25 times higher than reactions in batch, which equals to ∼12 g per day throughput.","PeriodicalId":55,"journal":{"name":"Organic Process Research & Development","volume":"11 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Driven Optimization of Continuous-Flow Photoredox Amine Synthesis\",\"authors\":\"Perman Jorayev, Sebastian Soritz, Simon Sung, Mohammed I. Jeraal, Danilo Russo, Alexandre Barthelme, Frédéric C. Toussaint, Matthew J. Gaunt, Alexei A. Lapkin\",\"doi\":\"10.1021/acs.oprd.4c00533\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Photoredox catalysis plays an important role in the synthesis of pharmaceutically relevant compounds such as C(sp<sup>3</sup>)-rich tertiary amines. The difficulty of identifying underlying mechanistic models for such novel transformations, coupled with the large reaction space of this reaction class, means that developing a robust process is challenging. In this work, we demonstrate the machine learning-driven optimization of a photoredox tertiary amine synthesis with six continuous variables (e.g., concentration, temperature, residence time) and solvent choice as a discrete variable, in a semiautomated continuous flow setup. Starting with a large library of solvents, the workflow included multiple steps of <i>a priori</i> knowledge generation (e.g., solubility predictions) to narrow the discrete space. A novel Bayesian optimization algorithm, nomadic exploratory multiobjective optimization (NEMO), was then deployed to identify and populate the Pareto front for the two reaction objectives─yield and reaction cost. Permutation feature importance and partial dependence plots identified the most important parameters for high yield, sig3, the asymmetry of the s-profile for the discrete space, and equivalences of alkene and Hantzsch ester for the continuous variables. Catalyst loading and residence time were found to be correlated to absorbed photon equivalence, while catalyst loading was additionally the main parameter to drive cost. Even though productivity was not an optimization objective, the best result achieved in flow was ∼25 times higher than reactions in batch, which equals to ∼12 g per day throughput.\",\"PeriodicalId\":55,\"journal\":{\"name\":\"Organic Process Research & Development\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Organic Process Research & Development\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.oprd.4c00533\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Organic Process Research & Development","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.oprd.4c00533","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
Machine Learning-Driven Optimization of Continuous-Flow Photoredox Amine Synthesis
Photoredox catalysis plays an important role in the synthesis of pharmaceutically relevant compounds such as C(sp3)-rich tertiary amines. The difficulty of identifying underlying mechanistic models for such novel transformations, coupled with the large reaction space of this reaction class, means that developing a robust process is challenging. In this work, we demonstrate the machine learning-driven optimization of a photoredox tertiary amine synthesis with six continuous variables (e.g., concentration, temperature, residence time) and solvent choice as a discrete variable, in a semiautomated continuous flow setup. Starting with a large library of solvents, the workflow included multiple steps of a priori knowledge generation (e.g., solubility predictions) to narrow the discrete space. A novel Bayesian optimization algorithm, nomadic exploratory multiobjective optimization (NEMO), was then deployed to identify and populate the Pareto front for the two reaction objectives─yield and reaction cost. Permutation feature importance and partial dependence plots identified the most important parameters for high yield, sig3, the asymmetry of the s-profile for the discrete space, and equivalences of alkene and Hantzsch ester for the continuous variables. Catalyst loading and residence time were found to be correlated to absorbed photon equivalence, while catalyst loading was additionally the main parameter to drive cost. Even though productivity was not an optimization objective, the best result achieved in flow was ∼25 times higher than reactions in batch, which equals to ∼12 g per day throughput.
期刊介绍:
The journal Organic Process Research & Development serves as a communication tool between industrial chemists and chemists working in universities and research institutes. As such, it reports original work from the broad field of industrial process chemistry but also presents academic results that are relevant, or potentially relevant, to industrial applications. Process chemistry is the science that enables the safe, environmentally benign and ultimately economical manufacturing of organic compounds that are required in larger amounts to help address the needs of society. Consequently, the Journal encompasses every aspect of organic chemistry, including all aspects of catalysis, synthetic methodology development and synthetic strategy exploration, but also includes aspects from analytical and solid-state chemistry and chemical engineering, such as work-up tools,process safety, or flow-chemistry. The goal of development and optimization of chemical reactions and processes is their transfer to a larger scale; original work describing such studies and the actual implementation on scale is highly relevant to the journal. However, studies on new developments from either industry, research institutes or academia that have not yet been demonstrated on scale, but where an industrial utility can be expected and where the study has addressed important prerequisites for a scale-up and has given confidence into the reliability and practicality of the chemistry, also serve the mission of OPR&D as a communication tool between the different contributors to the field.