Assaad Kassem , Ahmed Sefelnasr , Abdel Azim Ebraheem , Luqman Ali , Faisal Baig , Mohsen Sherif
{"title":"阿联酋富查伊拉超干旱沿海含水层海水入侵的机器学习预测与分类","authors":"Assaad Kassem , Ahmed Sefelnasr , Abdel Azim Ebraheem , Luqman Ali , Faisal Baig , Mohsen Sherif","doi":"10.1016/j.ejrh.2025.102664","DOIUrl":null,"url":null,"abstract":"<div><h3>Study region</h3><div>This study focuses on an unconfined coastal aquifer located in the Emirate of Fujairah, United Arab Emirates—a hyper-arid region where the coastal aquifer is vulnerable to seawater intrusion (SWI), posing a threat to groundwater quality.</div></div><div><h3>Study focus</h3><div>Fifteen machine learning (ML) algorithms were evaluated to predict and classify total dissolved solids (TDS) as an indicator of SWI. The models were trained using six hydrogeological parameters: rainfall, hydraulic head, distance from the coastline, aquifer saturated thickness, hydraulic conductivity, and specific yield. Model performance was assessed for both prediction and classification tasks. LightGBM yielded the highest prediction accuracy (R² = 0.9574), followed by CatBoost (R² = 0.9565) and XGBoost (R² = 0.9517). For classification, CatBoost and Gradient Boosting achieved the best performance with 97.6 % accuracy and AUC = 0.9986. The top-performing prediction models were utilized to derive empirical formulas for estimating groundwater TDS, informed by variable importance analysis of the six hydrogeological parameters.</div></div><div><h3>New hydrological insights for the region</h3><div>Variable importance analysis highlighted hydraulic head and distance from the coast as key predictors of TDS, consistent with established SWI mechanisms. The study demonstrates that ML can be an effective alternative to traditional modeling approaches in data-limited environments, offering a robust SWI assessment. The developed empirical equations provide practical tools for local groundwater TDS estimation. Future work should address temporal dynamics and salinity vertical distribution patterns.</div></div>","PeriodicalId":48620,"journal":{"name":"Journal of Hydrology-Regional Studies","volume":"61 ","pages":"Article 102664"},"PeriodicalIF":5.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based prediction and classification of seawater intrusion in the hyper-arid coastal aquifer of Fujairah, UAE\",\"authors\":\"Assaad Kassem , Ahmed Sefelnasr , Abdel Azim Ebraheem , Luqman Ali , Faisal Baig , Mohsen Sherif\",\"doi\":\"10.1016/j.ejrh.2025.102664\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Study region</h3><div>This study focuses on an unconfined coastal aquifer located in the Emirate of Fujairah, United Arab Emirates—a hyper-arid region where the coastal aquifer is vulnerable to seawater intrusion (SWI), posing a threat to groundwater quality.</div></div><div><h3>Study focus</h3><div>Fifteen machine learning (ML) algorithms were evaluated to predict and classify total dissolved solids (TDS) as an indicator of SWI. The models were trained using six hydrogeological parameters: rainfall, hydraulic head, distance from the coastline, aquifer saturated thickness, hydraulic conductivity, and specific yield. Model performance was assessed for both prediction and classification tasks. LightGBM yielded the highest prediction accuracy (R² = 0.9574), followed by CatBoost (R² = 0.9565) and XGBoost (R² = 0.9517). For classification, CatBoost and Gradient Boosting achieved the best performance with 97.6 % accuracy and AUC = 0.9986. The top-performing prediction models were utilized to derive empirical formulas for estimating groundwater TDS, informed by variable importance analysis of the six hydrogeological parameters.</div></div><div><h3>New hydrological insights for the region</h3><div>Variable importance analysis highlighted hydraulic head and distance from the coast as key predictors of TDS, consistent with established SWI mechanisms. The study demonstrates that ML can be an effective alternative to traditional modeling approaches in data-limited environments, offering a robust SWI assessment. The developed empirical equations provide practical tools for local groundwater TDS estimation. Future work should address temporal dynamics and salinity vertical distribution patterns.</div></div>\",\"PeriodicalId\":48620,\"journal\":{\"name\":\"Journal of Hydrology-Regional Studies\",\"volume\":\"61 \",\"pages\":\"Article 102664\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology-Regional Studies\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214581825004938\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology-Regional Studies","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214581825004938","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
Machine learning-based prediction and classification of seawater intrusion in the hyper-arid coastal aquifer of Fujairah, UAE
Study region
This study focuses on an unconfined coastal aquifer located in the Emirate of Fujairah, United Arab Emirates—a hyper-arid region where the coastal aquifer is vulnerable to seawater intrusion (SWI), posing a threat to groundwater quality.
Study focus
Fifteen machine learning (ML) algorithms were evaluated to predict and classify total dissolved solids (TDS) as an indicator of SWI. The models were trained using six hydrogeological parameters: rainfall, hydraulic head, distance from the coastline, aquifer saturated thickness, hydraulic conductivity, and specific yield. Model performance was assessed for both prediction and classification tasks. LightGBM yielded the highest prediction accuracy (R² = 0.9574), followed by CatBoost (R² = 0.9565) and XGBoost (R² = 0.9517). For classification, CatBoost and Gradient Boosting achieved the best performance with 97.6 % accuracy and AUC = 0.9986. The top-performing prediction models were utilized to derive empirical formulas for estimating groundwater TDS, informed by variable importance analysis of the six hydrogeological parameters.
New hydrological insights for the region
Variable importance analysis highlighted hydraulic head and distance from the coast as key predictors of TDS, consistent with established SWI mechanisms. The study demonstrates that ML can be an effective alternative to traditional modeling approaches in data-limited environments, offering a robust SWI assessment. The developed empirical equations provide practical tools for local groundwater TDS estimation. Future work should address temporal dynamics and salinity vertical distribution patterns.
期刊介绍:
Journal of Hydrology: Regional Studies publishes original research papers enhancing the science of hydrology and aiming at region-specific problems, past and future conditions, analysis, review and solutions. The journal particularly welcomes research papers that deliver new insights into region-specific hydrological processes and responses to changing conditions, as well as contributions that incorporate interdisciplinarity and translational science.