利用机器学习方法进行孔隙压力预测，测井数据采用高斯混合聚类模型

IF 4.6 0 ENERGY & FUELS

Geoenergy Science and Engineering Pub Date : 2025-09-13 DOI:10.1016/j.geoen.2025.214188

Jiajia Gao , Weidong Yang , Fuzhi Chen , Long Chang , Hai Lin , Yutian Feng , Gengchen Bian , Hengyi Jiang

{"title":"利用机器学习方法进行孔隙压力预测，测井数据采用高斯混合聚类模型","authors":"Jiajia Gao , Weidong Yang , Fuzhi Chen , Long Chang , Hai Lin , Yutian Feng , Gengchen Bian , Hengyi Jiang","doi":"10.1016/j.geoen.2025.214188","DOIUrl":null,"url":null,"abstract":"<div><div>This work proposes an innovative framework that combines intelligent clustering and machine learning methods to address the limitations of insufficient accuracy and complex operation in traditional pore pressure prediction methods. Firstly, the Gaussian mixture clustering model (GMCM) automatically identifies normal compaction clusters and eliminates the subjective error associated with manual division. Secondly, based on the clustering results and the CPO algorithm, the empirical coefficient of the Eaton method is dynamically optimized to generate high-precision pore pressure samples. Finally, the training set is constructed by integrating the preferred samples and logging data. The performances of the four machine learning models, including LSTM, XGBoost, SVR, and CNN-BiLSTM, are systematically evaluated using the ReliefF feature analysis. The empirical study demonstrates that the GMCM-CPO-Eaton method significantly enhances prediction accuracy in formations with abnormally high pressures. The errors of the AC-Eaton method and the RT-Eaton method are reduced by 11.7 % and 89.8 %, respectively. The predicted pore pressure curve highly matches the measured point height. The comprehensive performance of the models is in descending order: XGBoost, CNN-BiLSTM, LSTM, and SVR. The XGBoost model is susceptible to overfitting, which decreases its generalization ability, as evidenced by the MSE of the verification wells being above 0.12. The CNN-BiLSTM exhibits excellent stability, with its performance least affected by variations in data quantity and characteristic fluctuations. The determination coefficient R<sup>2</sup> remains stable above 0.95 in large samples and above 0.8 in small samples, performing well in adjacent well prediction with MSE values of 0.1005 and 0.0971 for the two studied wells. This further indicates that the CNN-BiSLTM model exhibits high generalization in predicting pore pressure.</div></div>","PeriodicalId":100578,"journal":{"name":"Geoenergy Science and Engineering","volume":"257 ","pages":"Article 214188"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Pore pressure prediction using machine learning methods and logging data considering Gaussian mixture clustering model\",\"authors\":\"Jiajia Gao , Weidong Yang , Fuzhi Chen , Long Chang , Hai Lin , Yutian Feng , Gengchen Bian , Hengyi Jiang\",\"doi\":\"10.1016/j.geoen.2025.214188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This work proposes an innovative framework that combines intelligent clustering and machine learning methods to address the limitations of insufficient accuracy and complex operation in traditional pore pressure prediction methods. Firstly, the Gaussian mixture clustering model (GMCM) automatically identifies normal compaction clusters and eliminates the subjective error associated with manual division. Secondly, based on the clustering results and the CPO algorithm, the empirical coefficient of the Eaton method is dynamically optimized to generate high-precision pore pressure samples. Finally, the training set is constructed by integrating the preferred samples and logging data. The performances of the four machine learning models, including LSTM, XGBoost, SVR, and CNN-BiLSTM, are systematically evaluated using the ReliefF feature analysis. The empirical study demonstrates that the GMCM-CPO-Eaton method significantly enhances prediction accuracy in formations with abnormally high pressures. The errors of the AC-Eaton method and the RT-Eaton method are reduced by 11.7 % and 89.8 %, respectively. The predicted pore pressure curve highly matches the measured point height. The comprehensive performance of the models is in descending order: XGBoost, CNN-BiLSTM, LSTM, and SVR. The XGBoost model is susceptible to overfitting, which decreases its generalization ability, as evidenced by the MSE of the verification wells being above 0.12. The CNN-BiLSTM exhibits excellent stability, with its performance least affected by variations in data quantity and characteristic fluctuations. The determination coefficient R<sup>2</sup> remains stable above 0.95 in large samples and above 0.8 in small samples, performing well in adjacent well prediction with MSE values of 0.1005 and 0.0971 for the two studied wells. This further indicates that the CNN-BiSLTM model exhibits high generalization in predicting pore pressure.</div></div>\",\"PeriodicalId\":100578,\"journal\":{\"name\":\"Geoenergy Science and Engineering\",\"volume\":\"257 \",\"pages\":\"Article 214188\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoenergy Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949891025005469\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoenergy Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949891025005469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

本工作提出了一种结合智能聚类和机器学习方法的创新框架，以解决传统孔隙压力预测方法精度不足和操作复杂的局限性。首先，高斯混合聚类模型（GMCM）自动识别正常压缩聚类，消除人工分割带来的主观误差；其次，基于聚类结果和CPO算法，对Eaton方法的经验系数进行动态优化，生成高精度孔隙压力样本；最后，将优选样本与测井数据相结合，构造训练集。利用ReliefF特征分析对LSTM、XGBoost、SVR和CNN-BiLSTM四种机器学习模型的性能进行了系统评价。实证研究表明，GMCM-CPO-Eaton方法显著提高了异常高压地层的预测精度。AC-Eaton法和RT-Eaton法的误差分别降低了11.7%和89.8%。预测的孔隙压力曲线与实测点高度高度吻合。模型综合性能由高到低依次为：XGBoost、CNN-BiLSTM、LSTM、SVR。验证井的MSE均在0.12以上，证明了XGBoost模型容易出现过拟合，降低了模型的泛化能力。CNN-BiLSTM具有良好的稳定性，其性能受数据量变化和特征波动的影响最小。确定系数R2在大样本下稳定在0.95以上，在小样本下稳定在0.8以上，对邻井预测效果良好，两口井的MSE分别为0.1005和0.0971。这进一步表明CNN-BiSLTM模型在预测孔隙压力方面具有较高的通用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Pore pressure prediction using machine learning methods and logging data considering Gaussian mixture clustering model

This work proposes an innovative framework that combines intelligent clustering and machine learning methods to address the limitations of insufficient accuracy and complex operation in traditional pore pressure prediction methods. Firstly, the Gaussian mixture clustering model (GMCM) automatically identifies normal compaction clusters and eliminates the subjective error associated with manual division. Secondly, based on the clustering results and the CPO algorithm, the empirical coefficient of the Eaton method is dynamically optimized to generate high-precision pore pressure samples. Finally, the training set is constructed by integrating the preferred samples and logging data. The performances of the four machine learning models, including LSTM, XGBoost, SVR, and CNN-BiLSTM, are systematically evaluated using the ReliefF feature analysis. The empirical study demonstrates that the GMCM-CPO-Eaton method significantly enhances prediction accuracy in formations with abnormally high pressures. The errors of the AC-Eaton method and the RT-Eaton method are reduced by 11.7 % and 89.8 %, respectively. The predicted pore pressure curve highly matches the measured point height. The comprehensive performance of the models is in descending order: XGBoost, CNN-BiLSTM, LSTM, and SVR. The XGBoost model is susceptible to overfitting, which decreases its generalization ability, as evidenced by the MSE of the verification wells being above 0.12. The CNN-BiLSTM exhibits excellent stability, with its performance least affected by variations in data quantity and characteristic fluctuations. The determination coefficient R² remains stable above 0.95 in large samples and above 0.8 in small samples, performing well in adjacent well prediction with MSE values of 0.1005 and 0.0971 for the two studied wells. This further indicates that the CNN-BiSLTM model exhibits high generalization in predicting pore pressure.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Geoenergy Science and Engineering

CiteScore

1.00

自引率

0.00%

发文量