Shixuan Cheng , Kai Zhang , Yihao Li , Zhangxin Chen
{"title":"Real-time prediction and inverse design of multiphase CO2 trapping in deep saline aquifers using machine learning enhanced by SHAP analysis","authors":"Shixuan Cheng , Kai Zhang , Yihao Li , Zhangxin Chen","doi":"10.1016/j.jgsce.2025.205782","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate forecasting of multiphase CO<sub>2</sub> behavior is critical for safe large-scale deployment of carbon capture and storage projects. A total of 9835 high fidelity CMG GEM simulations, covering geological and operational uncertainties in deep saline aquifers, were generated and subsequently used to benchmark thirteen c algorithms for the concurrent prediction of CO<sub>2</sub> trapping mechanisms. Attention based tabular networks TabTransformerLite and SAINTLite together with the decision ensemble NODELite outperformed linear, support vector, tree ensemble, and fully connected neural network baselines. TabTransformerLite secured the best performance with nRMSE of 0.033, R<sup>2</sup> of 0.92, and minimal fold to fold variability. Shapley value analysis placed depth and permeability as primary controls on dissolved, residual, and mineral precipitated CO<sub>2</sub> compared to the injection rate and porosity. An inverse design workflow used the trained model to identify operating windows that lift mineral trapping above the ninetieth percentile, indicating optimal depth near 2.25 km, permeability above 1.6 Darcy, and injection rates between 7.5 and 14.7 × 10<sup>3</sup> m<sup>3</sup>/day. The resulting framework connects model predictions to geographic information systems for site screening and to real time digital twin optimization, providing a scalable millisecond speed alternative to physics-based simulation for CO<sub>2</sub> storage planning and monitoring.</div></div>","PeriodicalId":100568,"journal":{"name":"Gas Science and Engineering","volume":"145 ","pages":"Article 205782"},"PeriodicalIF":5.5000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gas Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949908925002468","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate forecasting of multiphase CO2 behavior is critical for safe large-scale deployment of carbon capture and storage projects. A total of 9835 high fidelity CMG GEM simulations, covering geological and operational uncertainties in deep saline aquifers, were generated and subsequently used to benchmark thirteen c algorithms for the concurrent prediction of CO2 trapping mechanisms. Attention based tabular networks TabTransformerLite and SAINTLite together with the decision ensemble NODELite outperformed linear, support vector, tree ensemble, and fully connected neural network baselines. TabTransformerLite secured the best performance with nRMSE of 0.033, R2 of 0.92, and minimal fold to fold variability. Shapley value analysis placed depth and permeability as primary controls on dissolved, residual, and mineral precipitated CO2 compared to the injection rate and porosity. An inverse design workflow used the trained model to identify operating windows that lift mineral trapping above the ninetieth percentile, indicating optimal depth near 2.25 km, permeability above 1.6 Darcy, and injection rates between 7.5 and 14.7 × 103 m3/day. The resulting framework connects model predictions to geographic information systems for site screening and to real time digital twin optimization, providing a scalable millisecond speed alternative to physics-based simulation for CO2 storage planning and monitoring.