Performance of tree-based ensemble techniques in predicting groundwater quality for irrigation purposes

IF 2.8 4区 环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES
Anas El Ouali, Kayhan Bayhan, Rachid Mohamed Mouhoumed, Pınar Spor, Cemre Sude Atan, Eyyup Ensar Başakın, Ömer Ekmekcioğlu
{"title":"Performance of tree-based ensemble techniques in predicting groundwater quality for irrigation purposes","authors":"Anas El Ouali,&nbsp;Kayhan Bayhan,&nbsp;Rachid Mohamed Mouhoumed,&nbsp;Pınar Spor,&nbsp;Cemre Sude Atan,&nbsp;Eyyup Ensar Başakın,&nbsp;Ömer Ekmekcioğlu","doi":"10.1007/s12665-025-12469-w","DOIUrl":null,"url":null,"abstract":"<div><p>This study evaluates the performance of eight different machine learning (ML) methods to predict the Irrigation Water Quality Index (IWQI), an important metric for assessing groundwater quality for agricultural purposes. The study domain was selected as the Saïss Plain in northern Morocco as the region stands out as an area with intense agricultural activities where groundwater quality is of critical importance for irrigation. Groundwater quality is affected by natural factors such as salinity and ion concentrations, as well as anthropogenic activities such as agricultural and industrial practices. Among eight ML approaches, the XGBoost model outperformed its counterparts, including other tree-based ML algorithms and benchmarking models, and yielded the highest prediction accuracy with Nash Sutcliffe Efficiency (NSE) index of 0.963 and 0.892 for training and testing sets, respectively. Other tree-based models such as Random Forest, AdaBoost, and Extra Trees also showed strong performance, while benchmarking models such as ANN, KNN, and SVR were less effective due to the size and non-linear nature of the dataset. The analysis revealed that chloride (Cl⁻) and sodium (Na+) ions are the most critical factors in IWQI estimations. This study highlights the importance of robust ML models in groundwater quality management and provides insights to guide future research for sustainable irrigation practices.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"84 16","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-025-12469-w","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

This study evaluates the performance of eight different machine learning (ML) methods to predict the Irrigation Water Quality Index (IWQI), an important metric for assessing groundwater quality for agricultural purposes. The study domain was selected as the Saïss Plain in northern Morocco as the region stands out as an area with intense agricultural activities where groundwater quality is of critical importance for irrigation. Groundwater quality is affected by natural factors such as salinity and ion concentrations, as well as anthropogenic activities such as agricultural and industrial practices. Among eight ML approaches, the XGBoost model outperformed its counterparts, including other tree-based ML algorithms and benchmarking models, and yielded the highest prediction accuracy with Nash Sutcliffe Efficiency (NSE) index of 0.963 and 0.892 for training and testing sets, respectively. Other tree-based models such as Random Forest, AdaBoost, and Extra Trees also showed strong performance, while benchmarking models such as ANN, KNN, and SVR were less effective due to the size and non-linear nature of the dataset. The analysis revealed that chloride (Cl⁻) and sodium (Na+) ions are the most critical factors in IWQI estimations. This study highlights the importance of robust ML models in groundwater quality management and provides insights to guide future research for sustainable irrigation practices.

基于树木的集合技术在灌溉用地下水质量预测中的应用
本研究评估了八种不同的机器学习(ML)方法来预测灌溉水质指数(IWQI)的性能,IWQI是评估用于农业目的的地下水质量的重要指标。研究领域被选为摩洛哥北部的Saïss平原,因为该地区是农业活动密集的地区,地下水质量对灌溉至关重要。地下水质量受到盐度和离子浓度等自然因素以及农业和工业实践等人为活动的影响。在8种机器学习方法中,XGBoost模型的表现优于其他基于树的机器学习算法和基准模型,在训练集和测试集上的纳什萨克利夫效率(NSE)指数分别为0.963和0.892,预测精度最高。其他基于树的模型,如Random Forest、AdaBoost和Extra Trees也表现出了很强的性能,而基准模型,如ANN、KNN和SVR,由于数据集的大小和非线性性质,效果较差。分析显示氯(Cl -毒血症)和钠(Na+)离子是IWQI估计中最关键的因素。本研究强调了强大的ML模型在地下水质量管理中的重要性,并为指导未来可持续灌溉实践的研究提供了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Earth Sciences
Environmental Earth Sciences 环境科学-地球科学综合
CiteScore
5.10
自引率
3.60%
发文量
494
审稿时长
8.3 months
期刊介绍: Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth: Water and soil contamination caused by waste management and disposal practices Environmental problems associated with transportation by land, air, or water Geological processes that may impact biosystems or humans Man-made or naturally occurring geological or hydrological hazards Environmental problems associated with the recovery of materials from the earth Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials Management of environmental data and information in data banks and information systems Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信