{"title":"A generalized machine learning approach for cost-effective monitoring of irrigation suitability: A demonstration case in El Fahs aquifer (Tunisia)","authors":"","doi":"10.1016/j.gsd.2024.101324","DOIUrl":null,"url":null,"abstract":"<div><p>This study develops and evaluates the performance of machine learning (ML) regression models, namely the Support Vector Regression (SVR), Random Forest (RF), k-Nearest Neighbors (kNN) and eXtreme Gradient Boosting (XgBoost), for estimating the Irrigation Water Quality Index (IWQI) based on groundwater samples collected from El Fahs aquifer in Tunisia. The groundwater data are used as predictors for training and testing the machine learning models. Results indicate that sodium concentration has the most influence on the IWQI estimations for all ML models. The best-performing algorithms are found to be the SVR and XgBoost. Different datasets are also collected from existing studies, and merged to generate a single dataset that is used to develop generalized machine learning models. Despite regional variations, the generalized models performed adequately when tested against the unseen data, particularly the data collected from El Fahs case site. This work highlights the potential of using ML models, together with established metrics, as screening tools for predicting and classifying the irrigation suitability of groundwater based on available literature, even when metrics are not universally applicable. The findings of this study can be used by the water authorities in data scarce environments as cost-effective monitoring tools to assess water suitability for irrigation purposes.</p></div>","PeriodicalId":37879,"journal":{"name":"Groundwater for Sustainable Development","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352801X24002479/pdfft?md5=f607bd0aa02da4ea05e044ade4f0ce94&pid=1-s2.0-S2352801X24002479-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Groundwater for Sustainable Development","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352801X24002479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
This study develops and evaluates the performance of machine learning (ML) regression models, namely the Support Vector Regression (SVR), Random Forest (RF), k-Nearest Neighbors (kNN) and eXtreme Gradient Boosting (XgBoost), for estimating the Irrigation Water Quality Index (IWQI) based on groundwater samples collected from El Fahs aquifer in Tunisia. The groundwater data are used as predictors for training and testing the machine learning models. Results indicate that sodium concentration has the most influence on the IWQI estimations for all ML models. The best-performing algorithms are found to be the SVR and XgBoost. Different datasets are also collected from existing studies, and merged to generate a single dataset that is used to develop generalized machine learning models. Despite regional variations, the generalized models performed adequately when tested against the unseen data, particularly the data collected from El Fahs case site. This work highlights the potential of using ML models, together with established metrics, as screening tools for predicting and classifying the irrigation suitability of groundwater based on available literature, even when metrics are not universally applicable. The findings of this study can be used by the water authorities in data scarce environments as cost-effective monitoring tools to assess water suitability for irrigation purposes.
本研究开发并评估了机器学习(ML)回归模型的性能,即支持向量回归(SVR)、随机森林(RF)、k-近邻(kNN)和极梯度提升(XgBoost),用于根据从突尼斯法赫斯含水层采集的地下水样本估算灌溉水质量指数(IWQI)。地下水数据被用作机器学习模型训练和测试的预测因子。结果表明,钠浓度对所有 ML 模型的 IWQI 估算影响最大。表现最好的算法是 SVR 和 XgBoost。此外,还从现有研究中收集了不同的数据集,并将其合并生成单一数据集,用于开发通用机器学习模型。尽管存在地区差异,但在对未见过的数据进行测试时,特别是对从 El Fahs 案例点收集的数据进行测试时,通用模型表现良好。这项工作凸显了使用 ML 模型和既定指标作为筛选工具的潜力,可根据现有文献预测地下水的灌溉适宜性并对其进行分类,即使指标并非普遍适用。在数据匮乏的环境中,水利部门可将本研究的结果用作评估灌溉用水适宜性的经济有效的监测工具。
期刊介绍:
Groundwater for Sustainable Development is directed to different stakeholders and professionals, including government and non-governmental organizations, international funding agencies, universities, public water institutions, public health and other public/private sector professionals, and other relevant institutions. It is aimed at professionals, academics and students in the fields of disciplines such as: groundwater and its connection to surface hydrology and environment, soil sciences, engineering, ecology, microbiology, atmospheric sciences, analytical chemistry, hydro-engineering, water technology, environmental ethics, economics, public health, policy, as well as social sciences, legal disciplines, or any other area connected with water issues. The objectives of this journal are to facilitate: • The improvement of effective and sustainable management of water resources across the globe. • The improvement of human access to groundwater resources in adequate quantity and good quality. • The meeting of the increasing demand for drinking and irrigation water needed for food security to contribute to a social and economically sound human development. • The creation of a global inter- and multidisciplinary platform and forum to improve our understanding of groundwater resources and to advocate their effective and sustainable management and protection against contamination. • Interdisciplinary information exchange and to stimulate scientific research in the fields of groundwater related sciences and social and health sciences required to achieve the United Nations Millennium Development Goals for sustainable development.