An accurate and interpretable deep learning model for yield prediction using hybrid molecular representations†

IF 3.1 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Yingying Wang, Xinyi Sun, Yuanyuan Li, Li Wang and Jinglai Zhang
{"title":"An accurate and interpretable deep learning model for yield prediction using hybrid molecular representations†","authors":"Yingying Wang, Xinyi Sun, Yuanyuan Li, Li Wang and Jinglai Zhang","doi":"10.1039/D5RE00205B","DOIUrl":null,"url":null,"abstract":"<p >In recent years, imidazolium-based ionic liquids (ILs) and pyrazolium-based ILs have shown efficient catalytic abilities in CO<small><sub>2</sub></small> cycloaddition reactions. However, these catalysts require stringent conditions for the reactions in the absence of co-catalysts, thereby limiting their applicability. Therefore, there is an increasing demand for developing new IL catalysts capable of operating under milder conditions. Traditional methods for designing these ILs, whether through theoretical calculations or experimental exploration, are both costly and challenging. This study presents a deep learning model for predicting the yield of CO<small><sub>2</sub></small> cycloaddition reactions catalyzed by imidazolium-based and pyrazolium-based ILs. The model utilizes hybrid fingerprint features to describe the structural information of molecules, achieving a squared correlation coefficient (<em>R</em><small><sup>2</sup></small>) value of 0.85. Moreover, the SHapley Additive exPlanations (SHAP) technique is employed to identify the key factors influencing yield. Additionally, a molecular generation scheme is established to create new IL structures. Through a two-step screening strategy involving yield prediction using the deep learning model and energy barrier calculations <em>via</em> density functional theory (DFT), 14 promising imidazolium-based ILs are identified as potential efficient catalysts for CO<small><sub>2</sub></small> cycloaddition reactions with epichlorohydrin under mild conditions. This work introduces a novel machine learning approach for designing imidazolium-based IL and pyrazolium-based IL catalysts, aimed at reducing the experimental burden and exploration costs associated with catalyst development.</p>","PeriodicalId":101,"journal":{"name":"Reaction Chemistry & Engineering","volume":" 10","pages":" 2334-2344"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reaction Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/re/d5re00205b","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, imidazolium-based ionic liquids (ILs) and pyrazolium-based ILs have shown efficient catalytic abilities in CO2 cycloaddition reactions. However, these catalysts require stringent conditions for the reactions in the absence of co-catalysts, thereby limiting their applicability. Therefore, there is an increasing demand for developing new IL catalysts capable of operating under milder conditions. Traditional methods for designing these ILs, whether through theoretical calculations or experimental exploration, are both costly and challenging. This study presents a deep learning model for predicting the yield of CO2 cycloaddition reactions catalyzed by imidazolium-based and pyrazolium-based ILs. The model utilizes hybrid fingerprint features to describe the structural information of molecules, achieving a squared correlation coefficient (R2) value of 0.85. Moreover, the SHapley Additive exPlanations (SHAP) technique is employed to identify the key factors influencing yield. Additionally, a molecular generation scheme is established to create new IL structures. Through a two-step screening strategy involving yield prediction using the deep learning model and energy barrier calculations via density functional theory (DFT), 14 promising imidazolium-based ILs are identified as potential efficient catalysts for CO2 cycloaddition reactions with epichlorohydrin under mild conditions. This work introduces a novel machine learning approach for designing imidazolium-based IL and pyrazolium-based IL catalysts, aimed at reducing the experimental burden and exploration costs associated with catalyst development.

Abstract Image

一个准确的和可解释的深度学习模型产量预测使用混合分子表示†
近年来,咪唑基离子液体和吡唑基离子液体在CO2环加成反应中表现出了高效的催化能力。然而,这些催化剂在没有助催化剂的情况下需要严格的反应条件,从而限制了它们的适用性。因此,开发能够在较温和条件下工作的新型IL催化剂的需求越来越大。无论是通过理论计算还是实验探索,设计这些il的传统方法既昂贵又具有挑战性。本研究提出了一种预测咪唑基和吡唑基il催化CO2环加成反应产率的深度学习模型。该模型利用混合指纹特征描述分子结构信息,相关系数平方(R2)值为0.85。此外,采用SHapley加性解释(SHAP)技术识别影响产量的关键因素。此外,还建立了一种分子生成方案来生成新的IL结构。通过两步筛选策略,包括利用深度学习模型进行产率预测和利用密度泛函数理论(DFT)进行能量势垒计算,确定了14种咪唑基il作为温和条件下CO2与环氧氯丙烷环加成反应的潜在高效催化剂。这项工作介绍了一种新的机器学习方法来设计咪唑基和吡唑基IL催化剂,旨在减少与催化剂开发相关的实验负担和勘探成本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Reaction Chemistry & Engineering
Reaction Chemistry & Engineering Chemistry-Chemistry (miscellaneous)
CiteScore
6.60
自引率
7.70%
发文量
227
期刊介绍: Reaction Chemistry & Engineering is a new journal reporting cutting edge research into all aspects of making molecules for the benefit of fundamental research, applied processes and wider society. From fundamental, molecular-level chemistry to large scale chemical production, Reaction Chemistry & Engineering brings together communities of chemists and chemical engineers working to ensure the crucial role of reaction chemistry in today’s world.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信