使用先进的机器学习模型预测二氧化碳在水和盐水中的溶解度

IF 4.6 0 ENERGY & FUELS
Kun Liu , Xiao-Qiang Bian , Jing Chen , Jian Li , Yu-Peng Wang
{"title":"使用先进的机器学习模型预测二氧化碳在水和盐水中的溶解度","authors":"Kun Liu ,&nbsp;Xiao-Qiang Bian ,&nbsp;Jing Chen ,&nbsp;Jian Li ,&nbsp;Yu-Peng Wang","doi":"10.1016/j.geoen.2025.214234","DOIUrl":null,"url":null,"abstract":"<div><div>Carbon capture and storage (CCS) is a crucial technology for reducing industrial emissions, and its effectiveness is related to the prediction of CO<sub>2</sub> solubility. However, existing studies are typically limited to modeling specific water/brine systems and lack practical engineering validation. Therefore, this study proposes a CO<sub>2</sub> solubility model applicable to pure water, single-salt solutions, and mixed-salt solutions. A database containing 3383 experimental data entries was constructed from the published literature. Input variables include temperature, pressure, and salt concentrations of NaCl, KCl, Na<sub>2</sub>SO<sub>4</sub>, MgCl<sub>2</sub>, and CaCl<sub>2</sub>, and the output variable is CO<sub>2</sub> solubility. To improve the accuracy of CO<sub>2</sub> solubility prediction, we apply three optimization algorithms—Artificial Lemming Algorithm (ALA), Black-winged Kite Algorithm (BKA), and IVY Algorithm (IVYA)—to fine-tune two machine learning (ML) models: Light Gradient Boosting Machine (LightGBM) and eXtreme Gradient Boosting (XGBoost). The results were compared with the Cubic-Plus-Association (CPA) equation of state (EoS) combined with the MHV1 mixing rule (CPA-MHV1). Findings indicate that among all the models considered in this paper, BKA-LightGBM stands out for its high accuracy and low computational cost, with R<sup>2</sup> = 0.9930, RMSE = 0.0007 and AARD = 7.41 %, outperforming the CPA-MHV1 model in prediction accuracy. In addition, SHapley Additive exPlanations (SHAP) indicated that pressure is the most influential input parameter for model output. The leverage approach based on the Williams plot verified the reliability of the data, with 90.63 % of the samples distributed within a leverage threshold of 0.3, effectively minimizing the influence of outliers. Cross-validation and external validation demonstrated that the BKA-LightGBM model can be effectively applied to CCS engineering, for applications such as CO<sub>2</sub> capture via dissolution. These results suggest that the BKA-LightGBM model has strong potential to support the development of efficient and practical <span>CCS</span> technologies.</div></div>","PeriodicalId":100578,"journal":{"name":"Geoenergy Science and Engineering","volume":"257 ","pages":"Article 214234"},"PeriodicalIF":4.6000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting CO2 solubility in water and brines using advanced machine learning models\",\"authors\":\"Kun Liu ,&nbsp;Xiao-Qiang Bian ,&nbsp;Jing Chen ,&nbsp;Jian Li ,&nbsp;Yu-Peng Wang\",\"doi\":\"10.1016/j.geoen.2025.214234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Carbon capture and storage (CCS) is a crucial technology for reducing industrial emissions, and its effectiveness is related to the prediction of CO<sub>2</sub> solubility. However, existing studies are typically limited to modeling specific water/brine systems and lack practical engineering validation. Therefore, this study proposes a CO<sub>2</sub> solubility model applicable to pure water, single-salt solutions, and mixed-salt solutions. A database containing 3383 experimental data entries was constructed from the published literature. Input variables include temperature, pressure, and salt concentrations of NaCl, KCl, Na<sub>2</sub>SO<sub>4</sub>, MgCl<sub>2</sub>, and CaCl<sub>2</sub>, and the output variable is CO<sub>2</sub> solubility. To improve the accuracy of CO<sub>2</sub> solubility prediction, we apply three optimization algorithms—Artificial Lemming Algorithm (ALA), Black-winged Kite Algorithm (BKA), and IVY Algorithm (IVYA)—to fine-tune two machine learning (ML) models: Light Gradient Boosting Machine (LightGBM) and eXtreme Gradient Boosting (XGBoost). The results were compared with the Cubic-Plus-Association (CPA) equation of state (EoS) combined with the MHV1 mixing rule (CPA-MHV1). Findings indicate that among all the models considered in this paper, BKA-LightGBM stands out for its high accuracy and low computational cost, with R<sup>2</sup> = 0.9930, RMSE = 0.0007 and AARD = 7.41 %, outperforming the CPA-MHV1 model in prediction accuracy. In addition, SHapley Additive exPlanations (SHAP) indicated that pressure is the most influential input parameter for model output. The leverage approach based on the Williams plot verified the reliability of the data, with 90.63 % of the samples distributed within a leverage threshold of 0.3, effectively minimizing the influence of outliers. Cross-validation and external validation demonstrated that the BKA-LightGBM model can be effectively applied to CCS engineering, for applications such as CO<sub>2</sub> capture via dissolution. These results suggest that the BKA-LightGBM model has strong potential to support the development of efficient and practical <span>CCS</span> technologies.</div></div>\",\"PeriodicalId\":100578,\"journal\":{\"name\":\"Geoenergy Science and Engineering\",\"volume\":\"257 \",\"pages\":\"Article 214234\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoenergy Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949891025005925\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoenergy Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949891025005925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

摘要

碳捕集与封存(CCS)技术是减少工业排放的关键技术,其有效性与CO2溶解度的预测有关。然而,现有的研究通常局限于模拟特定的水/盐水系统,缺乏实际的工程验证。因此,本研究提出了适用于纯水、单一盐溶液和混合盐溶液的CO2溶解度模型。从已发表的文献中构建包含3383个实验数据条目的数据库。输入变量包括温度、压力和NaCl、KCl、Na2SO4、MgCl2和CaCl2的盐浓度,输出变量是CO2溶解度。为了提高CO2溶解度预测的准确性,我们采用了人工Lemming算法(ALA)、黑翼风筝算法(BKA)和IVY算法(IVYA)三种优化算法,对Light Gradient Boosting machine (LightGBM)和eXtreme Gradient Boosting (XGBoost)两个机器学习(ML)模型进行了微调。结果与立方-加-关联(CPA)状态方程(EoS)结合MHV1混合规则(CPA-MHV1)进行了比较。研究结果表明,在本文考虑的所有模型中,BKA-LightGBM具有较高的预测精度和较低的计算成本,R2 = 0.9930, RMSE = 0.0007, AARD = 7.41%,在预测精度上优于CPA-MHV1模型。此外,SHapley加性解释(SHAP)表明,压力是对模型输出影响最大的输入参数。基于Williams图的杠杆法验证了数据的可靠性,90.63%的样本分布在0.3的杠杆阈值内,有效地减少了异常值的影响。交叉验证和外部验证表明,BKA-LightGBM模型可以有效地应用于CCS工程,例如通过溶解捕获二氧化碳。这些结果表明,BKA-LightGBM模型具有强大的潜力,可以支持高效实用的CCS技术的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting CO2 solubility in water and brines using advanced machine learning models
Carbon capture and storage (CCS) is a crucial technology for reducing industrial emissions, and its effectiveness is related to the prediction of CO2 solubility. However, existing studies are typically limited to modeling specific water/brine systems and lack practical engineering validation. Therefore, this study proposes a CO2 solubility model applicable to pure water, single-salt solutions, and mixed-salt solutions. A database containing 3383 experimental data entries was constructed from the published literature. Input variables include temperature, pressure, and salt concentrations of NaCl, KCl, Na2SO4, MgCl2, and CaCl2, and the output variable is CO2 solubility. To improve the accuracy of CO2 solubility prediction, we apply three optimization algorithms—Artificial Lemming Algorithm (ALA), Black-winged Kite Algorithm (BKA), and IVY Algorithm (IVYA)—to fine-tune two machine learning (ML) models: Light Gradient Boosting Machine (LightGBM) and eXtreme Gradient Boosting (XGBoost). The results were compared with the Cubic-Plus-Association (CPA) equation of state (EoS) combined with the MHV1 mixing rule (CPA-MHV1). Findings indicate that among all the models considered in this paper, BKA-LightGBM stands out for its high accuracy and low computational cost, with R2 = 0.9930, RMSE = 0.0007 and AARD = 7.41 %, outperforming the CPA-MHV1 model in prediction accuracy. In addition, SHapley Additive exPlanations (SHAP) indicated that pressure is the most influential input parameter for model output. The leverage approach based on the Williams plot verified the reliability of the data, with 90.63 % of the samples distributed within a leverage threshold of 0.3, effectively minimizing the influence of outliers. Cross-validation and external validation demonstrated that the BKA-LightGBM model can be effectively applied to CCS engineering, for applications such as CO2 capture via dissolution. These results suggest that the BKA-LightGBM model has strong potential to support the development of efficient and practical CCS technologies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信