Iongel Duran-Llacer , Víctor Gómez-Escalonilla Canales , Marcelo Aliaga-Alvarado , José Luis Arumí , Francisco Zambrano , Lien Rodríguez-López , Rebeca Martínez-Retureta , Pedro Martínez-Santos
{"title":"Approach to mapping groundwater-dependent ecosystems through machine learning in central Chile","authors":"Iongel Duran-Llacer , Víctor Gómez-Escalonilla Canales , Marcelo Aliaga-Alvarado , José Luis Arumí , Francisco Zambrano , Lien Rodríguez-López , Rebeca Martínez-Retureta , Pedro Martínez-Santos","doi":"10.1016/j.gsd.2025.101526","DOIUrl":null,"url":null,"abstract":"<div><div>Groundwater depletion can significantly impact the ecological integrity of groundwater-dependent ecosystems (GDEs). Identifying and mapping these ecosystems is essential for their effective management and conservation. This study presents a new probabilistic approach that uses machine learning techniques to predict the presence of GDEs zones in the Ligua and Petorca basins, central Chile. A comprehensive set of 21 spatially distributed explanatory variables related to GDEs occurrence was compiled. These include geology, topography, climate, and satellite-based indices. Using a dataset of 3067 GDEs presence/absence points, 16 supervised classification algorithms were trained and evaluated with randomly selected subsets containing 100 %, 75 %, 50 %, and 25 % of the original dataset. This analysis involved collinearity assessment, cross-validation, feature selection, and hyperparameter tuning. Tree-based ensemble models, including Random Forest (RFC), AdaBoost (ABC), Gradient Boosting (GBC), and ExtraTrees (ETC), consistently outperformed other classifiers. In all subsets, regardless of the number of samples included, the models obtained raw scores above 0.90 for metrics such as test score, F1 score and the area under the curve (AUC), with key predictor variables identified as distance to rivers, rainfall, and land use/land cover. The models show high predictive performance consistently exceeding 0.95 on the above metrics. The resulting GDEs map manages to identify areas with a high probability of GDEs presence, clearly differentiating these ecosystems from adjacent agricultural areas. This study provides a robust methodological framework for GDEs mapping and serves as a valuable tool to manage and protect groundwater and GDEs in arid and semi-arid environments.</div></div>","PeriodicalId":37879,"journal":{"name":"Groundwater for Sustainable Development","volume":"31 ","pages":"Article 101526"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Groundwater for Sustainable Development","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352801X25001237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Groundwater depletion can significantly impact the ecological integrity of groundwater-dependent ecosystems (GDEs). Identifying and mapping these ecosystems is essential for their effective management and conservation. This study presents a new probabilistic approach that uses machine learning techniques to predict the presence of GDEs zones in the Ligua and Petorca basins, central Chile. A comprehensive set of 21 spatially distributed explanatory variables related to GDEs occurrence was compiled. These include geology, topography, climate, and satellite-based indices. Using a dataset of 3067 GDEs presence/absence points, 16 supervised classification algorithms were trained and evaluated with randomly selected subsets containing 100 %, 75 %, 50 %, and 25 % of the original dataset. This analysis involved collinearity assessment, cross-validation, feature selection, and hyperparameter tuning. Tree-based ensemble models, including Random Forest (RFC), AdaBoost (ABC), Gradient Boosting (GBC), and ExtraTrees (ETC), consistently outperformed other classifiers. In all subsets, regardless of the number of samples included, the models obtained raw scores above 0.90 for metrics such as test score, F1 score and the area under the curve (AUC), with key predictor variables identified as distance to rivers, rainfall, and land use/land cover. The models show high predictive performance consistently exceeding 0.95 on the above metrics. The resulting GDEs map manages to identify areas with a high probability of GDEs presence, clearly differentiating these ecosystems from adjacent agricultural areas. This study provides a robust methodological framework for GDEs mapping and serves as a valuable tool to manage and protect groundwater and GDEs in arid and semi-arid environments.
期刊介绍:
Groundwater for Sustainable Development is directed to different stakeholders and professionals, including government and non-governmental organizations, international funding agencies, universities, public water institutions, public health and other public/private sector professionals, and other relevant institutions. It is aimed at professionals, academics and students in the fields of disciplines such as: groundwater and its connection to surface hydrology and environment, soil sciences, engineering, ecology, microbiology, atmospheric sciences, analytical chemistry, hydro-engineering, water technology, environmental ethics, economics, public health, policy, as well as social sciences, legal disciplines, or any other area connected with water issues. The objectives of this journal are to facilitate: • The improvement of effective and sustainable management of water resources across the globe. • The improvement of human access to groundwater resources in adequate quantity and good quality. • The meeting of the increasing demand for drinking and irrigation water needed for food security to contribute to a social and economically sound human development. • The creation of a global inter- and multidisciplinary platform and forum to improve our understanding of groundwater resources and to advocate their effective and sustainable management and protection against contamination. • Interdisciplinary information exchange and to stimulate scientific research in the fields of groundwater related sciences and social and health sciences required to achieve the United Nations Millennium Development Goals for sustainable development.