Yingying Wang, Xinyi Sun, Yuanyuan Li, Li Wang and Jinglai Zhang
{"title":"An accurate and interpretable deep learning model for yield prediction using hybrid molecular representations†","authors":"Yingying Wang, Xinyi Sun, Yuanyuan Li, Li Wang and Jinglai Zhang","doi":"10.1039/D5RE00205B","DOIUrl":null,"url":null,"abstract":"<p >In recent years, imidazolium-based ionic liquids (ILs) and pyrazolium-based ILs have shown efficient catalytic abilities in CO<small><sub>2</sub></small> cycloaddition reactions. However, these catalysts require stringent conditions for the reactions in the absence of co-catalysts, thereby limiting their applicability. Therefore, there is an increasing demand for developing new IL catalysts capable of operating under milder conditions. Traditional methods for designing these ILs, whether through theoretical calculations or experimental exploration, are both costly and challenging. This study presents a deep learning model for predicting the yield of CO<small><sub>2</sub></small> cycloaddition reactions catalyzed by imidazolium-based and pyrazolium-based ILs. The model utilizes hybrid fingerprint features to describe the structural information of molecules, achieving a squared correlation coefficient (<em>R</em><small><sup>2</sup></small>) value of 0.85. Moreover, the SHapley Additive exPlanations (SHAP) technique is employed to identify the key factors influencing yield. Additionally, a molecular generation scheme is established to create new IL structures. Through a two-step screening strategy involving yield prediction using the deep learning model and energy barrier calculations <em>via</em> density functional theory (DFT), 14 promising imidazolium-based ILs are identified as potential efficient catalysts for CO<small><sub>2</sub></small> cycloaddition reactions with epichlorohydrin under mild conditions. This work introduces a novel machine learning approach for designing imidazolium-based IL and pyrazolium-based IL catalysts, aimed at reducing the experimental burden and exploration costs associated with catalyst development.</p>","PeriodicalId":101,"journal":{"name":"Reaction Chemistry & Engineering","volume":" 10","pages":" 2334-2344"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reaction Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/re/d5re00205b","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, imidazolium-based ionic liquids (ILs) and pyrazolium-based ILs have shown efficient catalytic abilities in CO2 cycloaddition reactions. However, these catalysts require stringent conditions for the reactions in the absence of co-catalysts, thereby limiting their applicability. Therefore, there is an increasing demand for developing new IL catalysts capable of operating under milder conditions. Traditional methods for designing these ILs, whether through theoretical calculations or experimental exploration, are both costly and challenging. This study presents a deep learning model for predicting the yield of CO2 cycloaddition reactions catalyzed by imidazolium-based and pyrazolium-based ILs. The model utilizes hybrid fingerprint features to describe the structural information of molecules, achieving a squared correlation coefficient (R2) value of 0.85. Moreover, the SHapley Additive exPlanations (SHAP) technique is employed to identify the key factors influencing yield. Additionally, a molecular generation scheme is established to create new IL structures. Through a two-step screening strategy involving yield prediction using the deep learning model and energy barrier calculations via density functional theory (DFT), 14 promising imidazolium-based ILs are identified as potential efficient catalysts for CO2 cycloaddition reactions with epichlorohydrin under mild conditions. This work introduces a novel machine learning approach for designing imidazolium-based IL and pyrazolium-based IL catalysts, aimed at reducing the experimental burden and exploration costs associated with catalyst development.
期刊介绍:
Reaction Chemistry & Engineering is a new journal reporting cutting edge research into all aspects of making molecules for the benefit of fundamental research, applied processes and wider society.
From fundamental, molecular-level chemistry to large scale chemical production, Reaction Chemistry & Engineering brings together communities of chemists and chemical engineers working to ensure the crucial role of reaction chemistry in today’s world.