离子液体对二氧化碳吸收的结构洞见:机器学习、关联规则和元学习对亨利定律常数建模的比较研究

IF 7.3 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Karol Baran*,  and , Adam Kloskowski, 
{"title":"离子液体对二氧化碳吸收的结构洞见:机器学习、关联规则和元学习对亨利定律常数建模的比较研究","authors":"Karol Baran*,&nbsp; and ,&nbsp;Adam Kloskowski,&nbsp;","doi":"10.1021/acssuschemeng.5c06122","DOIUrl":null,"url":null,"abstract":"<p >The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO<sub>2</sub>-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO<sub>2</sub> mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.</p><p >AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.</p>","PeriodicalId":25,"journal":{"name":"ACS Sustainable Chemistry & Engineering","volume":"13 31","pages":"12805–12817"},"PeriodicalIF":7.3000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acssuschemeng.5c06122","citationCount":"0","resultStr":"{\"title\":\"Unlocking Structural Insights into CO2 Absorption with Ionic Liquids: A Comparative Study of Machine Learning, Association Rules, and Meta-Learning for Modeling Henry’s Law Constant\",\"authors\":\"Karol Baran*,&nbsp; and ,&nbsp;Adam Kloskowski,&nbsp;\",\"doi\":\"10.1021/acssuschemeng.5c06122\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO<sub>2</sub>-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO<sub>2</sub> mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.</p><p >AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.</p>\",\"PeriodicalId\":25,\"journal\":{\"name\":\"ACS Sustainable Chemistry & Engineering\",\"volume\":\"13 31\",\"pages\":\"12805–12817\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/pdf/10.1021/acssuschemeng.5c06122\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Sustainable Chemistry & Engineering\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c06122\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Sustainable Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c06122","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

新型溶剂的发现对于推进绿色化学至关重要,而化学信息学可以加速这一进程。离子液体具有多种用途,但了解其结构和性质之间的联系是优化其性能的关键。本研究探讨了化学信息学中新兴算法的应用,如CN2规则算法和模型不可知元学习(MAML),以提高定量结构-性质关系(QSPR)模型对化学信息学的预测能力。二氧化碳溶解度数据集被用于开发预测人工智能(AI)模型,因为这些技术特别适合解决高科学兴趣的复杂问题,例如减轻二氧化碳相关的环境问题。通过对分子指纹(mf)和分子描述符(MDs)表示il分子结构的有效性进行评价,发现mf更适合于规则挖掘。通过将分数自由体积(FFV)与md结合,得到了具有较高预测能力的非线性QSPR模型。亨利定律常数预测使用了在相同温度条件下评估的76个ILs组成的数据集。模型训练利用了80%的数据,其余的用于测试。此外,将cosmos - rs模拟数据与MAML相结合可以增强神经网络的性能。结果表明,利用FFV的梯度推进模型效果最好。用规则解释化学数据的研究结果可以为开发更有效的溶剂提供信息。MAML算法在一个以摩尔分数表示的溶解度的数据集上进一步进行了评估,该数据集超过6000个数据点,用于对不同于二氧化碳溶解度的任务进行元训练,并在一个具有ILs的双组分系统中对二氧化碳摩尔分数的9000个数据点进行了微调。MAML允许我们在使用128个数据点进行微调和大约1800个数据点进行测试时获得稳定的模型。人工智能提高了对离子液体结构的理解,在环保工业过程中实现了可持续的溶剂定制和二氧化碳捕获。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Unlocking Structural Insights into CO2 Absorption with Ionic Liquids: A Comparative Study of Machine Learning, Association Rules, and Meta-Learning for Modeling Henry’s Law Constant

The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO2-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO2 mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.

AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACS Sustainable Chemistry & Engineering
ACS Sustainable Chemistry & Engineering CHEMISTRY, MULTIDISCIPLINARY-ENGINEERING, CHEMICAL
CiteScore
13.80
自引率
4.80%
发文量
1470
审稿时长
1.7 months
期刊介绍: ACS Sustainable Chemistry & Engineering is a prestigious weekly peer-reviewed scientific journal published by the American Chemical Society. Dedicated to advancing the principles of green chemistry and green engineering, it covers a wide array of research topics including green chemistry, green engineering, biomass, alternative energy, and life cycle assessment. The journal welcomes submissions in various formats, including Letters, Articles, Features, and Perspectives (Reviews), that address the challenges of sustainability in the chemical enterprise and contribute to the advancement of sustainable practices. Join us in shaping the future of sustainable chemistry and engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信