Pooling solvent mixtures for solvation free energy predictions

IF 13.3 1区 工程技术 Q1 ENGINEERING, CHEMICAL
Roel J. Leenhouts, Nathan Morgan, Emad Al Ibrahim, William H. Green, Florence H. Vermeire
{"title":"Pooling solvent mixtures for solvation free energy predictions","authors":"Roel J. Leenhouts, Nathan Morgan, Emad Al Ibrahim, William H. Green, Florence H. Vermeire","doi":"10.1016/j.cej.2025.162232","DOIUrl":null,"url":null,"abstract":"Solvation free energy is an important design parameter in reaction kinetics and separation processes, making it a critical property to predict during process development. In previous research, directed message passing neural networks (D-MPNN) have successfully been used to predict solvation free energies and enthalpies in organic solvents. However, solvent mixtures provide greater flexibility for optimizing solvent interactions than monosolvents. This work aims to extend our previous models to mixtures. To handle mixtures in a permutation invariant manner we propose a pooling function; MolPool. With this pooling function, the machine learning models can learn and predict solvation energy and enthalpy for an arbitrary number of molecules in the mixed solvent. The novel SolProp-mix software that applies MolPool to D-MPNN was compared to state-of-the-art architectures for predicting mixture properties and validated with our new database of COSMOtherm calculations; BinarySolv-QM. To improve predictions towards experimental accuracy, the network was then fine-tuned on experimental data in monosolvents. To demonstrate the benefit of this transfer learning methodology, experimental datasets of solvation free energies in binary (BinarySolv-Exp) and ternary (TernarySolv-Exp) solvent mixtures were compiled from data on vapor–liquid equilibria and activity coefficients. The neural network performed comparable in accuracy to the benchmark of COSMOtherm calculations with an MAE of 0.29 kcal/mol and an RMSE of 0.45 kcal/mol for binary mixed solvents. Additionally, the ability to capture trends for a varying mixture composition was validated successfully. Our model’s ability to accurately predict mixture properties from the combination of <em>in silico</em> data and pure component experimental data is promising given the scarcity of experimental data for mixtures in many fields.","PeriodicalId":270,"journal":{"name":"Chemical Engineering Journal","volume":"183 1","pages":""},"PeriodicalIF":13.3000,"publicationDate":"2025-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.cej.2025.162232","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Solvation free energy is an important design parameter in reaction kinetics and separation processes, making it a critical property to predict during process development. In previous research, directed message passing neural networks (D-MPNN) have successfully been used to predict solvation free energies and enthalpies in organic solvents. However, solvent mixtures provide greater flexibility for optimizing solvent interactions than monosolvents. This work aims to extend our previous models to mixtures. To handle mixtures in a permutation invariant manner we propose a pooling function; MolPool. With this pooling function, the machine learning models can learn and predict solvation energy and enthalpy for an arbitrary number of molecules in the mixed solvent. The novel SolProp-mix software that applies MolPool to D-MPNN was compared to state-of-the-art architectures for predicting mixture properties and validated with our new database of COSMOtherm calculations; BinarySolv-QM. To improve predictions towards experimental accuracy, the network was then fine-tuned on experimental data in monosolvents. To demonstrate the benefit of this transfer learning methodology, experimental datasets of solvation free energies in binary (BinarySolv-Exp) and ternary (TernarySolv-Exp) solvent mixtures were compiled from data on vapor–liquid equilibria and activity coefficients. The neural network performed comparable in accuracy to the benchmark of COSMOtherm calculations with an MAE of 0.29 kcal/mol and an RMSE of 0.45 kcal/mol for binary mixed solvents. Additionally, the ability to capture trends for a varying mixture composition was validated successfully. Our model’s ability to accurately predict mixture properties from the combination of in silico data and pure component experimental data is promising given the scarcity of experimental data for mixtures in many fields.

Abstract Image

求助全文
约1分钟内获得全文 求助全文
来源期刊
Chemical Engineering Journal
Chemical Engineering Journal 工程技术-工程:化工
CiteScore
21.70
自引率
9.30%
发文量
6781
审稿时长
2.4 months
期刊介绍: The Chemical Engineering Journal is an international research journal that invites contributions of original and novel fundamental research. It aims to provide an international platform for presenting original fundamental research, interpretative reviews, and discussions on new developments in chemical engineering. The journal welcomes papers that describe novel theory and its practical application, as well as those that demonstrate the transfer of techniques from other disciplines. It also welcomes reports on carefully conducted experimental work that is soundly interpreted. The main focus of the journal is on original and rigorous research results that have broad significance. The Catalysis section within the Chemical Engineering Journal focuses specifically on Experimental and Theoretical studies in the fields of heterogeneous catalysis, molecular catalysis, and biocatalysis. These studies have industrial impact on various sectors such as chemicals, energy, materials, foods, healthcare, and environmental protection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信