{"title":"A sparrow search algorithm optimized GAN-stacking model for the evaluation of geothermal resource potential assessment","authors":"Haibin Li , Yang Yang , Qiang Xu","doi":"10.1016/j.geothermics.2025.103398","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, in the field of geothermal resource potential prediction, smaller datasets and complexity of geothermal energy formation are the problems that should be effectively addressed. In this work, a GAN-stacking model optimized using sparrow search algorithm (SSA) is constructed, based on artificial neural network (ANN), random forest (RF), support vector machine (SVM), XGBoost, Catboost model, and GIS platform, to predict and evaluate the potential of geothermal resources in Sichuan Province, China. The features include distance from water systems, distance from fracture zones, distance from mountain ranges, seismic kernel density, geothermal heat flow, magnetic anomaly data, distance from igneous rocks, and gravitational gradient anomalies. The receiver operating characteristic curve (ROC) is used for validation. The AUROC values for ANN, RF, SVM, XGBoost, and Catboost are 0.785, 0.834, 0.787, 0.811, and 0.818, respectively, and the AUROC values after enhancing the training set using GAN are 0.793, 0.853, 0.791, 0.840, and 0.839, respectively. The accuracies of all models improve when using extra samples. In addition, the SHAP algorithm is added to analyze the models. The SHAP values of the vast majority of the samples are higher than the SHAP values of the five machine learning models compared in this work, except for the individual extra samples. Subsequently, the sparrow search algorithm-optimized GAN-stacking model is constructed for prediction, and its obtained AUROC value is 0.95. This study shows that the use of GAN to generate extra samples improves the accuracy of geothermal potential prediction models to a certain extent, and the extra samples generated by GANS have a certain degree of confidence. The GAN-stacking model constructed in this study has high accuracy in this study area, and can provide a decision-making basis for the exploration and development of geothermal resources.</div></div>","PeriodicalId":55095,"journal":{"name":"Geothermics","volume":"131 ","pages":"Article 103398"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geothermics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S037565052500149X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, in the field of geothermal resource potential prediction, smaller datasets and complexity of geothermal energy formation are the problems that should be effectively addressed. In this work, a GAN-stacking model optimized using sparrow search algorithm (SSA) is constructed, based on artificial neural network (ANN), random forest (RF), support vector machine (SVM), XGBoost, Catboost model, and GIS platform, to predict and evaluate the potential of geothermal resources in Sichuan Province, China. The features include distance from water systems, distance from fracture zones, distance from mountain ranges, seismic kernel density, geothermal heat flow, magnetic anomaly data, distance from igneous rocks, and gravitational gradient anomalies. The receiver operating characteristic curve (ROC) is used for validation. The AUROC values for ANN, RF, SVM, XGBoost, and Catboost are 0.785, 0.834, 0.787, 0.811, and 0.818, respectively, and the AUROC values after enhancing the training set using GAN are 0.793, 0.853, 0.791, 0.840, and 0.839, respectively. The accuracies of all models improve when using extra samples. In addition, the SHAP algorithm is added to analyze the models. The SHAP values of the vast majority of the samples are higher than the SHAP values of the five machine learning models compared in this work, except for the individual extra samples. Subsequently, the sparrow search algorithm-optimized GAN-stacking model is constructed for prediction, and its obtained AUROC value is 0.95. This study shows that the use of GAN to generate extra samples improves the accuracy of geothermal potential prediction models to a certain extent, and the extra samples generated by GANS have a certain degree of confidence. The GAN-stacking model constructed in this study has high accuracy in this study area, and can provide a decision-making basis for the exploration and development of geothermal resources.
期刊介绍:
Geothermics is an international journal devoted to the research and development of geothermal energy. The International Board of Editors of Geothermics, which comprises specialists in the various aspects of geothermal resources, exploration and development, guarantees the balanced, comprehensive view of scientific and technological developments in this promising energy field.
It promulgates the state of the art and science of geothermal energy, its exploration and exploitation through a regular exchange of information from all parts of the world. The journal publishes articles dealing with the theory, exploration techniques and all aspects of the utilization of geothermal resources. Geothermics serves as the scientific house, or exchange medium, through which the growing community of geothermal specialists can provide and receive information.