{"title":"ML-driven models for predicting CO2 uptake in metal–organic frameworks (MOFs)","authors":"Sofiene Achour, Zied Hosni","doi":"10.1002/cjce.25509","DOIUrl":null,"url":null,"abstract":"<p>This study advances the discourse on the application of machine learning (ML) algorithms for the predictive analysis of CO<sub>2</sub> uptake in metal–organic frameworks (MOFs), with a nuanced focus on the CATBoost model's capability to navigate the complexities inherent in MOFs' heterogeneous landscape. Building upon and extending the comparative analysis, our investigation underscores the CATBoost model's remarkable prediction robustness, characterized by a significant reduction in root mean square error (RMSE) and an enhanced R-squared (R<sup>2</sup>) value, thereby affirming its superior accuracy and reliability in forecasting CO<sub>2</sub> adsorption. A pivotal aspect of our research is the integration of SHapley Additive exPlanations (SHAP) values for a detailed assessment of feature importance, which not only corroborated ‘pressure’ and ‘surface area’ as pivotal determinants of CO<sub>2</sub> uptake but also illuminated the model's advanced analytical capabilities in handling categorical features and mitigating overfitting, even within a dataset marked by intricate and non-linear patterns. Our quantitative and conceptual analysis, showcasing up to a 15% improvement in RMSE over previous models, reveals the CATBoost model's unparalleled efficiency in discerning the multifaceted interplay of factors influencing CO<sub>2</sub> adsorption. This is crucial for the strategic engineering of MOFs with optimized properties. Beyond ‘pressure’ and ‘surface area’, our SHAP analysis highlighted other descriptors with substantial values, elucidating their contributions to CO<sub>2</sub> uptake and providing invaluable insights for the MOF design process.</p>","PeriodicalId":9400,"journal":{"name":"Canadian Journal of Chemical Engineering","volume":"103 5","pages":"2161-2173"},"PeriodicalIF":1.6000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjce.25509","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjce.25509","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
This study advances the discourse on the application of machine learning (ML) algorithms for the predictive analysis of CO2 uptake in metal–organic frameworks (MOFs), with a nuanced focus on the CATBoost model's capability to navigate the complexities inherent in MOFs' heterogeneous landscape. Building upon and extending the comparative analysis, our investigation underscores the CATBoost model's remarkable prediction robustness, characterized by a significant reduction in root mean square error (RMSE) and an enhanced R-squared (R2) value, thereby affirming its superior accuracy and reliability in forecasting CO2 adsorption. A pivotal aspect of our research is the integration of SHapley Additive exPlanations (SHAP) values for a detailed assessment of feature importance, which not only corroborated ‘pressure’ and ‘surface area’ as pivotal determinants of CO2 uptake but also illuminated the model's advanced analytical capabilities in handling categorical features and mitigating overfitting, even within a dataset marked by intricate and non-linear patterns. Our quantitative and conceptual analysis, showcasing up to a 15% improvement in RMSE over previous models, reveals the CATBoost model's unparalleled efficiency in discerning the multifaceted interplay of factors influencing CO2 adsorption. This is crucial for the strategic engineering of MOFs with optimized properties. Beyond ‘pressure’ and ‘surface area’, our SHAP analysis highlighted other descriptors with substantial values, elucidating their contributions to CO2 uptake and providing invaluable insights for the MOF design process.
期刊介绍:
The Canadian Journal of Chemical Engineering (CJChE) publishes original research articles, new theoretical interpretation or experimental findings and critical reviews in the science or industrial practice of chemical and biochemical processes. Preference is given to papers having a clearly indicated scope and applicability in any of the following areas: Fluid mechanics, heat and mass transfer, multiphase flows, separations processes, thermodynamics, process systems engineering, reactors and reaction kinetics, catalysis, interfacial phenomena, electrochemical phenomena, bioengineering, minerals processing and natural products and environmental and energy engineering. Papers that merely describe or present a conventional or routine analysis of existing processes will not be considered.