{"title":"离子液体对二氧化碳吸收的结构洞见:机器学习、关联规则和元学习对亨利定律常数建模的比较研究","authors":"Karol Baran*, and , Adam Kloskowski, ","doi":"10.1021/acssuschemeng.5c06122","DOIUrl":null,"url":null,"abstract":"<p >The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO<sub>2</sub>-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO<sub>2</sub> mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.</p><p >AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.</p>","PeriodicalId":25,"journal":{"name":"ACS Sustainable Chemistry & Engineering","volume":"13 31","pages":"12805–12817"},"PeriodicalIF":7.3000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acssuschemeng.5c06122","citationCount":"0","resultStr":"{\"title\":\"Unlocking Structural Insights into CO2 Absorption with Ionic Liquids: A Comparative Study of Machine Learning, Association Rules, and Meta-Learning for Modeling Henry’s Law Constant\",\"authors\":\"Karol Baran*, and , Adam Kloskowski, \",\"doi\":\"10.1021/acssuschemeng.5c06122\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO<sub>2</sub>-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO<sub>2</sub> mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.</p><p >AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.</p>\",\"PeriodicalId\":25,\"journal\":{\"name\":\"ACS Sustainable Chemistry & Engineering\",\"volume\":\"13 31\",\"pages\":\"12805–12817\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/pdf/10.1021/acssuschemeng.5c06122\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Sustainable Chemistry & Engineering\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c06122\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Sustainable Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c06122","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Unlocking Structural Insights into CO2 Absorption with Ionic Liquids: A Comparative Study of Machine Learning, Association Rules, and Meta-Learning for Modeling Henry’s Law Constant
The discovery of novel solvents is crucial for advancing green chemistry, and chemoinformatics can accelerate this process. Ionic liquids (ILs) have diverse applications, but understanding the link between their structure and properties is key to optimizing their performance. In this study, the use of emerging algorithms in chemoinformatics was explored, such as the CN2 rule algorithm and model-agnostic meta-learning (MAML), to improve the predictive power of quantitative structure–property relationship (QSPR) models for ILs. The data set on carbon dioxide solubility was leveraged to develop predictive artificial intelligence (AI) models, as these techniques are particularly well-suited for addressing complex problems of high scientific interest, such as mitigating CO2-related environmental issues. The effectiveness of using molecular fingerprints (MFs) and molecular descriptors (MDs) to represent ILs’ molecular structures was evaluated to find that MFs are more suitable for rule mining. By incorporating fractional free volume (FFV) alongside MDs, nonlinear QSPR models with higher predictive power were obtained. Henry’s law constant prediction utilized a data set composed of 76 ILs evaluated under the same temperature conditions. Model training utilized 80% of the data, while the rest was used for testing. Moreover, integrating COSMO-RS simulated data with MAML allowed for enhanced neural network performance. The gradient boosting model utilizing FFV was found to be the best performing. The findings on chemical data interpretation with rules can inform the development of more efficient solvents. MAML algorithm was further evaluated on a data set regarding solubility expressed in mole fraction for over 6000 data points for meta-training on tasks different than carbon dioxide solubility and fine-tuned on a fraction of over 9000 data points regarding CO2 mole fraction in a two-component system with ILs. MAML allowed us to obtain a stable model even while utilizing 128 data points for fine-tuning and about 1800 data points for testing.
AI improves understanding of ionic liquid structure, enabling sustainable solvent tailoring and carbon dioxide capture in environment-friendly industrial processes.
期刊介绍:
ACS Sustainable Chemistry & Engineering is a prestigious weekly peer-reviewed scientific journal published by the American Chemical Society. Dedicated to advancing the principles of green chemistry and green engineering, it covers a wide array of research topics including green chemistry, green engineering, biomass, alternative energy, and life cycle assessment.
The journal welcomes submissions in various formats, including Letters, Articles, Features, and Perspectives (Reviews), that address the challenges of sustainability in the chemical enterprise and contribute to the advancement of sustainable practices. Join us in shaping the future of sustainable chemistry and engineering.