Jiaheng Wang, Nong Li, Xiangyu Huo, Mingli Yang, Li Zhang
{"title":"利用极端梯度提升决策树方法预测页岩层的天然气存储容量","authors":"Jiaheng Wang, Nong Li, Xiangyu Huo, Mingli Yang, Li Zhang","doi":"10.1002/ente.202400377","DOIUrl":null,"url":null,"abstract":"<p>Accurate shale gas reserves estimation is essential for development. Existing machine learning (ML) models for predicting gas isothermal adsorption are limited by small datasets and lack verified generalization. We constructed an “original dataset” containing 2112 data points from 11 measurements on samples from 8 formations in 3 countries to develop ML-based prediction models. Similar to previous ML models, total organic matter, pressure, and temperature are characterized as the three most significant features using the mean impurity method. In contrast to previous ML models, the study reveals that these three features are inadequate to be used to make reasonable predictions for the datasets from the measurements different from those used to train the models. Instead, the extreme gradient boosting decision trees (XGBoost) model with two more features (specific surface area and moisture) exhibits good robustness, generalization, and precision in the prediction of gas isothermal adsorption. Overall, An XGBoost model with optimal input features is developed in this work, which exhibits both good performance in gas adsorption prediction and good potential for the estimation of gas storage in shale gas development.</p>","PeriodicalId":11573,"journal":{"name":"Energy technology","volume":"12 10","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the Gas Storage Capacity in Shale Formations Using the Extreme Gradient Boosting Decision Trees Method\",\"authors\":\"Jiaheng Wang, Nong Li, Xiangyu Huo, Mingli Yang, Li Zhang\",\"doi\":\"10.1002/ente.202400377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Accurate shale gas reserves estimation is essential for development. Existing machine learning (ML) models for predicting gas isothermal adsorption are limited by small datasets and lack verified generalization. We constructed an “original dataset” containing 2112 data points from 11 measurements on samples from 8 formations in 3 countries to develop ML-based prediction models. Similar to previous ML models, total organic matter, pressure, and temperature are characterized as the three most significant features using the mean impurity method. In contrast to previous ML models, the study reveals that these three features are inadequate to be used to make reasonable predictions for the datasets from the measurements different from those used to train the models. Instead, the extreme gradient boosting decision trees (XGBoost) model with two more features (specific surface area and moisture) exhibits good robustness, generalization, and precision in the prediction of gas isothermal adsorption. Overall, An XGBoost model with optimal input features is developed in this work, which exhibits both good performance in gas adsorption prediction and good potential for the estimation of gas storage in shale gas development.</p>\",\"PeriodicalId\":11573,\"journal\":{\"name\":\"Energy technology\",\"volume\":\"12 10\",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ente.202400377\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy technology","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ente.202400377","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
摘要
准确估算页岩气储量对开发至关重要。用于预测天然气等温吸附的现有机器学习(ML)模型受限于数据集较小,且缺乏经过验证的通用性。我们构建了一个 "原始数据集",其中包含来自 3 个国家 8 个地层样本的 11 次测量的 2112 个数据点,用于开发基于 ML 的预测模型。与以往的 ML 模型类似,使用平均杂质法将总有机质、压力和温度作为三个最重要的特征。与以往的 ML 模型不同的是,研究发现这三个特征不足以用来对来自不同于用于训练模型的测量数据集进行合理预测。相反,带有另外两个特征(比表面积和水分)的极端梯度提升决策树(XGBoost)模型在预测气体等温吸附时表现出良好的鲁棒性、泛化和精确性。总之,本研究建立了一个具有最佳输入特征的 XGBoost 模型,该模型在气体吸附预测中表现出了良好的性能,在页岩气开发中的气体储量估算方面也具有良好的潜力。
Predicting the Gas Storage Capacity in Shale Formations Using the Extreme Gradient Boosting Decision Trees Method
Accurate shale gas reserves estimation is essential for development. Existing machine learning (ML) models for predicting gas isothermal adsorption are limited by small datasets and lack verified generalization. We constructed an “original dataset” containing 2112 data points from 11 measurements on samples from 8 formations in 3 countries to develop ML-based prediction models. Similar to previous ML models, total organic matter, pressure, and temperature are characterized as the three most significant features using the mean impurity method. In contrast to previous ML models, the study reveals that these three features are inadequate to be used to make reasonable predictions for the datasets from the measurements different from those used to train the models. Instead, the extreme gradient boosting decision trees (XGBoost) model with two more features (specific surface area and moisture) exhibits good robustness, generalization, and precision in the prediction of gas isothermal adsorption. Overall, An XGBoost model with optimal input features is developed in this work, which exhibits both good performance in gas adsorption prediction and good potential for the estimation of gas storage in shale gas development.
期刊介绍:
Energy Technology provides a forum for researchers and engineers from all relevant disciplines concerned with the generation, conversion, storage, and distribution of energy.
This new journal shall publish articles covering all technical aspects of energy process engineering from different perspectives, e.g.,
new concepts of energy generation and conversion;
design, operation, control, and optimization of processes for energy generation (e.g., carbon capture) and conversion of energy carriers;
improvement of existing processes;
combination of single components to systems for energy generation;
design of systems for energy storage;
production processes of fuels, e.g., hydrogen, electricity, petroleum, biobased fuels;
concepts and design of devices for energy distribution.