Haoxin Shi , Yanjun Zhang , Yuxiang Cheng , Jixiang Guo , Jianqiao Zheng , Xin Zhang , Yude Lei , Yongjie Ma , Lin Bai
{"title":"A novel machine learning approach for reservoir temperature prediction","authors":"Haoxin Shi , Yanjun Zhang , Yuxiang Cheng , Jixiang Guo , Jianqiao Zheng , Xin Zhang , Yude Lei , Yongjie Ma , Lin Bai","doi":"10.1016/j.geothermics.2024.103204","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately assessing geothermal potential is a significant global challenge, and the development of reservoir temperature prediction models is a key aspect of evaluating this potential. Machine learning modeling serves as an effective tool in this process. However, before modeling, the inability to fully screen complex and nonlinear input features, combined with the insufficiency of datasets, often impacts the predictive accuracy of the models. This study collected hydrochemical test data from 65 groundwater samples in the Guide area of Qinghai Province from 2009 to 2016. To address the issue of missing data, we employed the LRTC-TNN method to supplement the dataset. Subsequently, we conducted correlation analysis on the data features using normalization and Pearson correlation coefficients to identify important features. Based on the processed dataset, we constructed XGBoost and LightGBM models and used 5-fold cross-validation and Bayesian optimization model to select the optimal combination of model parameters. In the modeling analysis, we explored the advantages and disadvantages of both models and evaluated their performance in terms of accuracy, robustness, and generalization capability. The results indicate that the model performs best when 80% of the training data is used. The LRTC-TNN model effectively fills in missing data, achieving an accuracy exceeding 95%. When applying the XGBoost and LightGBM models to the training set, test set, and complete dataset, the XGBoost model consistently yielded significant predictive results, specifically an R² value of 98.09%, a RMSE of 0.546, and a MAE of 0.396. Robustness analysis showed that the XGBoost model is more robust, while feature importance and sensitivity analysis revealed that chloride ions are the key independent variable affecting reservoir temperature predictions. Furthermore, generalization capability validation indicated that the model can adapt well to different datasets and provide accurate predictive results. In conclusion, the XGBoost model, which considers complementary data, demonstrates excellent generality in reservoir temperature prediction, providing a reliable solution for accurately determining underground reservoir temperatures.</div></div>","PeriodicalId":55095,"journal":{"name":"Geothermics","volume":"125 ","pages":"Article 103204"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geothermics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375650524002906","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately assessing geothermal potential is a significant global challenge, and the development of reservoir temperature prediction models is a key aspect of evaluating this potential. Machine learning modeling serves as an effective tool in this process. However, before modeling, the inability to fully screen complex and nonlinear input features, combined with the insufficiency of datasets, often impacts the predictive accuracy of the models. This study collected hydrochemical test data from 65 groundwater samples in the Guide area of Qinghai Province from 2009 to 2016. To address the issue of missing data, we employed the LRTC-TNN method to supplement the dataset. Subsequently, we conducted correlation analysis on the data features using normalization and Pearson correlation coefficients to identify important features. Based on the processed dataset, we constructed XGBoost and LightGBM models and used 5-fold cross-validation and Bayesian optimization model to select the optimal combination of model parameters. In the modeling analysis, we explored the advantages and disadvantages of both models and evaluated their performance in terms of accuracy, robustness, and generalization capability. The results indicate that the model performs best when 80% of the training data is used. The LRTC-TNN model effectively fills in missing data, achieving an accuracy exceeding 95%. When applying the XGBoost and LightGBM models to the training set, test set, and complete dataset, the XGBoost model consistently yielded significant predictive results, specifically an R² value of 98.09%, a RMSE of 0.546, and a MAE of 0.396. Robustness analysis showed that the XGBoost model is more robust, while feature importance and sensitivity analysis revealed that chloride ions are the key independent variable affecting reservoir temperature predictions. Furthermore, generalization capability validation indicated that the model can adapt well to different datasets and provide accurate predictive results. In conclusion, the XGBoost model, which considers complementary data, demonstrates excellent generality in reservoir temperature prediction, providing a reliable solution for accurately determining underground reservoir temperatures.
期刊介绍:
Geothermics is an international journal devoted to the research and development of geothermal energy. The International Board of Editors of Geothermics, which comprises specialists in the various aspects of geothermal resources, exploration and development, guarantees the balanced, comprehensive view of scientific and technological developments in this promising energy field.
It promulgates the state of the art and science of geothermal energy, its exploration and exploitation through a regular exchange of information from all parts of the world. The journal publishes articles dealing with the theory, exploration techniques and all aspects of the utilization of geothermal resources. Geothermics serves as the scientific house, or exchange medium, through which the growing community of geothermal specialists can provide and receive information.