阿瓦什河上游流域(埃塞俄比亚)地下水质量对土地覆盖和岩性的预测响应。

IF 8.4 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Bewuket B Tefera, Jane Southworth, Joann Mossa, Mashoukur Rahaman, Mohammad Safaei, Di Yang, Shankar Karuppannan
{"title":"阿瓦什河上游流域(埃塞俄比亚)地下水质量对土地覆盖和岩性的预测响应。","authors":"Bewuket B Tefera, Jane Southworth, Joann Mossa, Mashoukur Rahaman, Mohammad Safaei, Di Yang, Shankar Karuppannan","doi":"10.1016/j.jenvman.2025.127572","DOIUrl":null,"url":null,"abstract":"<p><p>Groundwater resources are vital for human and environmental needs, especially in humid and semi-arid regions. Conventional groundwater quality models, including statistical and single-algorithm machine learning techniques, often lack accuracy, interpretability, and scalability. This study presents an advanced ensemble machine learning framework for assessing groundwater quality in Ethiopia's Upper Awash River Basin, Africa. The Entropy Weighted Water Quality Index (EWQI) consolidates 13 hydrochemical parameters, including electrical conductivity, total dissolved solids, pH, and major ions. Data preprocessing involved imputation, standardization, and partitioning into training sets (70 %) and testing sets (30 %). Predictors include elevation, slope, land cover, lithology, and soil characteristics (type, moisture, and temperature). A novel stacking ensemble model was developed using Random Forest, Gradient Boosting, Support Vector Regression, K-Nearest Neighbors, and EXtreme Gradient Boosting. The stacking model outperformed individual models, achieving training metrics of MSE 17.96, RMSE 4.24, and R<sup>2</sup> 0.97, as well as testing metrics of MSE 76.29, RMSE 8.73, and R<sup>2</sup> 0.87. The validation results showed an MSE of 67.18, an RMSE of 8.2, and an R<sup>2</sup> of 0.89. Beyond accuracy, SHAP interpretation shows that soil temperature, land cover, and soil moisture are the dominant drivers of EWQI, exceeding terrain and lithologic controls. By coupling an objective EWQI target with broadly available covariates and an interpretable stacked ensemble, the study links prediction to actionable land and water management in a data-scarce basin and outlines a transferable workflow.</p>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"394 ","pages":"127572"},"PeriodicalIF":8.4000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predictive groundwater quality responses to land cover and lithology in the upper Awash River basin (Ethiopia) with stacking ensembles.\",\"authors\":\"Bewuket B Tefera, Jane Southworth, Joann Mossa, Mashoukur Rahaman, Mohammad Safaei, Di Yang, Shankar Karuppannan\",\"doi\":\"10.1016/j.jenvman.2025.127572\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Groundwater resources are vital for human and environmental needs, especially in humid and semi-arid regions. Conventional groundwater quality models, including statistical and single-algorithm machine learning techniques, often lack accuracy, interpretability, and scalability. This study presents an advanced ensemble machine learning framework for assessing groundwater quality in Ethiopia's Upper Awash River Basin, Africa. The Entropy Weighted Water Quality Index (EWQI) consolidates 13 hydrochemical parameters, including electrical conductivity, total dissolved solids, pH, and major ions. Data preprocessing involved imputation, standardization, and partitioning into training sets (70 %) and testing sets (30 %). Predictors include elevation, slope, land cover, lithology, and soil characteristics (type, moisture, and temperature). A novel stacking ensemble model was developed using Random Forest, Gradient Boosting, Support Vector Regression, K-Nearest Neighbors, and EXtreme Gradient Boosting. The stacking model outperformed individual models, achieving training metrics of MSE 17.96, RMSE 4.24, and R<sup>2</sup> 0.97, as well as testing metrics of MSE 76.29, RMSE 8.73, and R<sup>2</sup> 0.87. The validation results showed an MSE of 67.18, an RMSE of 8.2, and an R<sup>2</sup> of 0.89. Beyond accuracy, SHAP interpretation shows that soil temperature, land cover, and soil moisture are the dominant drivers of EWQI, exceeding terrain and lithologic controls. By coupling an objective EWQI target with broadly available covariates and an interpretable stacked ensemble, the study links prediction to actionable land and water management in a data-scarce basin and outlines a transferable workflow.</p>\",\"PeriodicalId\":356,\"journal\":{\"name\":\"Journal of Environmental Management\",\"volume\":\"394 \",\"pages\":\"127572\"},\"PeriodicalIF\":8.4000,\"publicationDate\":\"2025-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Environmental Management\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jenvman.2025.127572\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jenvman.2025.127572","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

地下水资源对人类和环境需求至关重要,特别是在潮湿和半干旱地区。传统的地下水质量模型,包括统计和单算法机器学习技术,往往缺乏准确性、可解释性和可扩展性。本研究提出了一个先进的集成机器学习框架,用于评估非洲埃塞俄比亚上阿瓦什河流域的地下水质量。熵加权水质指数(EWQI)综合了13个水化学参数,包括电导率、总溶解固体、pH和主要离子。数据预处理包括输入、标准化和划分为训练集(70%)和测试集(30%)。预测因素包括海拔、坡度、土地覆盖、岩性和土壤特征(类型、湿度和温度)。利用随机森林、梯度增强、支持向量回归、k近邻和极端梯度增强等方法建立了一种新的叠加集成模型。叠加模型优于单个模型,训练指标MSE为17.96,RMSE为4.24,R2为0.97;检验指标MSE为76.29,RMSE为8.73,R2为0.87。验证结果显示,MSE为67.18,RMSE为8.2,R2为0.89。除了精度之外,SHAP解释还表明,土壤温度、土地覆盖和土壤湿度是EWQI的主要驱动因素,超过了地形和岩性的控制。通过将客观的EWQI目标与广泛可用的协变量和可解释的堆栈集合相结合,该研究将数据稀缺盆地的预测与可操作的土地和水管理联系起来,并概述了可转移的工作流程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive groundwater quality responses to land cover and lithology in the upper Awash River basin (Ethiopia) with stacking ensembles.

Groundwater resources are vital for human and environmental needs, especially in humid and semi-arid regions. Conventional groundwater quality models, including statistical and single-algorithm machine learning techniques, often lack accuracy, interpretability, and scalability. This study presents an advanced ensemble machine learning framework for assessing groundwater quality in Ethiopia's Upper Awash River Basin, Africa. The Entropy Weighted Water Quality Index (EWQI) consolidates 13 hydrochemical parameters, including electrical conductivity, total dissolved solids, pH, and major ions. Data preprocessing involved imputation, standardization, and partitioning into training sets (70 %) and testing sets (30 %). Predictors include elevation, slope, land cover, lithology, and soil characteristics (type, moisture, and temperature). A novel stacking ensemble model was developed using Random Forest, Gradient Boosting, Support Vector Regression, K-Nearest Neighbors, and EXtreme Gradient Boosting. The stacking model outperformed individual models, achieving training metrics of MSE 17.96, RMSE 4.24, and R2 0.97, as well as testing metrics of MSE 76.29, RMSE 8.73, and R2 0.87. The validation results showed an MSE of 67.18, an RMSE of 8.2, and an R2 of 0.89. Beyond accuracy, SHAP interpretation shows that soil temperature, land cover, and soil moisture are the dominant drivers of EWQI, exceeding terrain and lithologic controls. By coupling an objective EWQI target with broadly available covariates and an interpretable stacked ensemble, the study links prediction to actionable land and water management in a data-scarce basin and outlines a transferable workflow.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Environmental Management
Journal of Environmental Management 环境科学-环境科学
CiteScore
13.70
自引率
5.70%
发文量
2477
审稿时长
84 days
期刊介绍: The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信