Mohamed Riad Youcefi , Fahd Mohamad Alqahtani , Menad Nait Amar , Hakim Djema , Mohammad Ghasemi
{"title":"改进了可解释的多基因遗传编程相关性,用于预测二氧化碳在各种盐水中的溶解度","authors":"Mohamed Riad Youcefi , Fahd Mohamad Alqahtani , Menad Nait Amar , Hakim Djema , Mohammad Ghasemi","doi":"10.1016/j.desal.2025.118917","DOIUrl":null,"url":null,"abstract":"<div><div>Storing carbon dioxide (CO<sub>2</sub>) in deep saline aquifers has gained significant attention as an effective approach to reducing greenhouse gas emissions. The success of carbon capture and storage (CCS) in deep saline aquifers relies on accurately assessing CO<sub>2</sub> solubility in brine under real operating conditions. Gaining detailed insight into CO<sub>2</sub> behavior in subsurface environments is essential for effectively implementing this method. In this study, we have made substantial efforts to compile a comprehensive dataset on CO<sub>2</sub> solubility in diverse aqueous electrolyte solutions of CaCl<sub>2</sub>, NaCl, MgCl<sub>2</sub>, Na<sub>2</sub>SO<sub>4</sub>, and KCl, encompassing a wide interval of operating conditions and widespread salt concentrations. To leverage this extensive dataset effectively, we applied a robust white-box machine learning technique, namely the multi-gene genetic programming (MGGP) to establish user-friendly explicit correlations for accurately predicting CO<sub>2</sub> solubility in numerous brines under subsurface conditions. Our evaluation revealed that the derived correlations provided significantly more precise predictions of CO<sub>2</sub> solubility. In this context, the MGGP-based correlations demonstrated trustworthy accuracy with total root mean square error (RMSE) values of only 0.0235, 0.0304, 0.0196, 0.0289, and 0.0313 for CaCl<sub>2</sub>, NaCl, MgCl<sub>2</sub>, Na<sub>2</sub>SO<sub>4</sub>, and KCl solutions, respectively. Additionally, the trend analysis showed that the proposed correlations effectively captured the behavior of CO<sub>2</sub> solubility across a wide range of operating pressures, temperatures, and solvent salinities, demonstrating their robustness and reliability. Furthermore, Shapley Additive Explanations (SHAP) provided useful insights into how different inputs contribute and interact, making the proposed correlations easier to understand and interpret. Lastly, the newly introduced MGGP-based correlations exhibit notable improvements in accuracy, user-friendliness, generalization, and explainability, ensuring superior performance across diverse subsurface conditions. These advancements mark a significant step forward in the cost-effective and precise estimation of CO<sub>2</sub> solubility in various brines, making the proposed correlations highly valuable for applications in carbon capture and storage, environmental impact assessment, petroleum geology, reservoir engineering, and other CO<sub>2</sub>-related domains.</div></div>","PeriodicalId":299,"journal":{"name":"Desalination","volume":"610 ","pages":"Article 118917"},"PeriodicalIF":8.3000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved explainable multi-gene genetic programming correlations for predicting carbon dioxide solubility in various brines\",\"authors\":\"Mohamed Riad Youcefi , Fahd Mohamad Alqahtani , Menad Nait Amar , Hakim Djema , Mohammad Ghasemi\",\"doi\":\"10.1016/j.desal.2025.118917\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Storing carbon dioxide (CO<sub>2</sub>) in deep saline aquifers has gained significant attention as an effective approach to reducing greenhouse gas emissions. The success of carbon capture and storage (CCS) in deep saline aquifers relies on accurately assessing CO<sub>2</sub> solubility in brine under real operating conditions. Gaining detailed insight into CO<sub>2</sub> behavior in subsurface environments is essential for effectively implementing this method. In this study, we have made substantial efforts to compile a comprehensive dataset on CO<sub>2</sub> solubility in diverse aqueous electrolyte solutions of CaCl<sub>2</sub>, NaCl, MgCl<sub>2</sub>, Na<sub>2</sub>SO<sub>4</sub>, and KCl, encompassing a wide interval of operating conditions and widespread salt concentrations. To leverage this extensive dataset effectively, we applied a robust white-box machine learning technique, namely the multi-gene genetic programming (MGGP) to establish user-friendly explicit correlations for accurately predicting CO<sub>2</sub> solubility in numerous brines under subsurface conditions. Our evaluation revealed that the derived correlations provided significantly more precise predictions of CO<sub>2</sub> solubility. In this context, the MGGP-based correlations demonstrated trustworthy accuracy with total root mean square error (RMSE) values of only 0.0235, 0.0304, 0.0196, 0.0289, and 0.0313 for CaCl<sub>2</sub>, NaCl, MgCl<sub>2</sub>, Na<sub>2</sub>SO<sub>4</sub>, and KCl solutions, respectively. Additionally, the trend analysis showed that the proposed correlations effectively captured the behavior of CO<sub>2</sub> solubility across a wide range of operating pressures, temperatures, and solvent salinities, demonstrating their robustness and reliability. Furthermore, Shapley Additive Explanations (SHAP) provided useful insights into how different inputs contribute and interact, making the proposed correlations easier to understand and interpret. Lastly, the newly introduced MGGP-based correlations exhibit notable improvements in accuracy, user-friendliness, generalization, and explainability, ensuring superior performance across diverse subsurface conditions. These advancements mark a significant step forward in the cost-effective and precise estimation of CO<sub>2</sub> solubility in various brines, making the proposed correlations highly valuable for applications in carbon capture and storage, environmental impact assessment, petroleum geology, reservoir engineering, and other CO<sub>2</sub>-related domains.</div></div>\",\"PeriodicalId\":299,\"journal\":{\"name\":\"Desalination\",\"volume\":\"610 \",\"pages\":\"Article 118917\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Desalination\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0011916425003923\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Desalination","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0011916425003923","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Improved explainable multi-gene genetic programming correlations for predicting carbon dioxide solubility in various brines
Storing carbon dioxide (CO2) in deep saline aquifers has gained significant attention as an effective approach to reducing greenhouse gas emissions. The success of carbon capture and storage (CCS) in deep saline aquifers relies on accurately assessing CO2 solubility in brine under real operating conditions. Gaining detailed insight into CO2 behavior in subsurface environments is essential for effectively implementing this method. In this study, we have made substantial efforts to compile a comprehensive dataset on CO2 solubility in diverse aqueous electrolyte solutions of CaCl2, NaCl, MgCl2, Na2SO4, and KCl, encompassing a wide interval of operating conditions and widespread salt concentrations. To leverage this extensive dataset effectively, we applied a robust white-box machine learning technique, namely the multi-gene genetic programming (MGGP) to establish user-friendly explicit correlations for accurately predicting CO2 solubility in numerous brines under subsurface conditions. Our evaluation revealed that the derived correlations provided significantly more precise predictions of CO2 solubility. In this context, the MGGP-based correlations demonstrated trustworthy accuracy with total root mean square error (RMSE) values of only 0.0235, 0.0304, 0.0196, 0.0289, and 0.0313 for CaCl2, NaCl, MgCl2, Na2SO4, and KCl solutions, respectively. Additionally, the trend analysis showed that the proposed correlations effectively captured the behavior of CO2 solubility across a wide range of operating pressures, temperatures, and solvent salinities, demonstrating their robustness and reliability. Furthermore, Shapley Additive Explanations (SHAP) provided useful insights into how different inputs contribute and interact, making the proposed correlations easier to understand and interpret. Lastly, the newly introduced MGGP-based correlations exhibit notable improvements in accuracy, user-friendliness, generalization, and explainability, ensuring superior performance across diverse subsurface conditions. These advancements mark a significant step forward in the cost-effective and precise estimation of CO2 solubility in various brines, making the proposed correlations highly valuable for applications in carbon capture and storage, environmental impact assessment, petroleum geology, reservoir engineering, and other CO2-related domains.
期刊介绍:
Desalination is a scholarly journal that focuses on the field of desalination materials, processes, and associated technologies. It encompasses a wide range of disciplines and aims to publish exceptional papers in this area.
The journal invites submissions that explicitly revolve around water desalting and its applications to various sources such as seawater, groundwater, and wastewater. It particularly encourages research on diverse desalination methods including thermal, membrane, sorption, and hybrid processes.
By providing a platform for innovative studies, Desalination aims to advance the understanding and development of desalination technologies, promoting sustainable solutions for water scarcity challenges.