{"title":"Data-driven analysis and insights into sorption-induced kerogen deformation in shale","authors":"Baixi Chen , Jian Wu , Yuyao Zhang","doi":"10.1016/j.jgsce.2025.205579","DOIUrl":null,"url":null,"abstract":"<div><div>Kerogen is a nanoporous material present in the organic matrix of shale reservoirs. Its swelling can cause the closure of pores and microfractures in the rock matrix. As kerogen is the main gas production and storage site in shales, this may hinder gas extraction in reservoirs and, in the case of CO<sub>2</sub> injection, leads to decreased reservoir permeability and gas injectivity. This study presents a data-driven model aiming at efficiently predicting sorption-induced kerogen deformation. A diversely featured dataset comprising 306 entries gathered from different sources was utilized for training. Following holdout validation and testing, the optimal data-driven model was developed and identified among four prominent machine learning algorithms: regression tree (RT), support vector machine (SVM), Gaussian process regression (GPR), and artificial neural network (ANN), with multiple configurations. The data-driven model constructed using the GPR algorithm with the exponential kernel function demonstrates exceptional performance (<em>R</em><sup><em>2</em></sup> = 0.999 on the test dataset) as it adapts effectively to the limited dataset size and captures the deformation trend in different scenarios. This data-driven model exhibits greater accuracy and versatility than the two theoretical models derived from poromechanics and surface energy change, as impacted by large parameter uncertainties. Furthermore, the interpretability analysis of the data-driven model reveals that geological conditions, kerogen porosity, and absorbate molecule kinetic diameter play key roles in kerogen deformation. By incorporating additional data involving new kerogen and adsorbate types or geological conditions, this data-driven model can be retrained to address a broader range of shale reservoirs more accurately.</div></div>","PeriodicalId":100568,"journal":{"name":"Gas Science and Engineering","volume":"136 ","pages":"Article 205579"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gas Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949908925000433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Kerogen is a nanoporous material present in the organic matrix of shale reservoirs. Its swelling can cause the closure of pores and microfractures in the rock matrix. As kerogen is the main gas production and storage site in shales, this may hinder gas extraction in reservoirs and, in the case of CO2 injection, leads to decreased reservoir permeability and gas injectivity. This study presents a data-driven model aiming at efficiently predicting sorption-induced kerogen deformation. A diversely featured dataset comprising 306 entries gathered from different sources was utilized for training. Following holdout validation and testing, the optimal data-driven model was developed and identified among four prominent machine learning algorithms: regression tree (RT), support vector machine (SVM), Gaussian process regression (GPR), and artificial neural network (ANN), with multiple configurations. The data-driven model constructed using the GPR algorithm with the exponential kernel function demonstrates exceptional performance (R2 = 0.999 on the test dataset) as it adapts effectively to the limited dataset size and captures the deformation trend in different scenarios. This data-driven model exhibits greater accuracy and versatility than the two theoretical models derived from poromechanics and surface energy change, as impacted by large parameter uncertainties. Furthermore, the interpretability analysis of the data-driven model reveals that geological conditions, kerogen porosity, and absorbate molecule kinetic diameter play key roles in kerogen deformation. By incorporating additional data involving new kerogen and adsorbate types or geological conditions, this data-driven model can be retrained to address a broader range of shale reservoirs more accurately.