{"title":"水文地质、测井和机器学习对干旱地区含水层水力特性预测的贡献:以埃及Farafra绿洲努比亚砂岩含水层为例","authors":"Ahmed Nosair, Muhammad Nabih, Ahmed Bakry","doi":"10.1007/s13201-025-02547-6","DOIUrl":null,"url":null,"abstract":"<div><p>In hydrogeology, assessing key aquifer hydraulic parameters such as transmissivity (T), hydraulic conductivity (K), and porosity (PHIE) is crucial for effective groundwater management. Traditionally, these parameters are obtained through pumping tests and well log data. However, porosity logs are often lacking in most groundwater wells. While neutron density logs are commonly used for porosity estimation, our study uniquely employs resistivity logs to calculate porosity due to the scarcity of recorded logs in groundwater exploration. Consequently, this research aims to use conventional well log and hydrogeological data to predict T, K, and PHIE using machine learning (ML) algorithms, including random forest (RF), gradient boosting (GB), linear regression (LR), and support vector machines (SVM). This methodology is applied as a case study in the Nubian Sandstone Aquifer (NSA) in Farafra Oasis, Egypt. Firstly, T and k values were determined by analysis of the long duration pumping test records for ten wells penetrated the NSA. The performance of the ML algorithms in predicting transmissivity and hydraulic conductivity was rigorously evaluated using test wells. The RF model demonstrated superior accuracy, with predicted values of T and K being 113.11 m<sup>2</sup>/h and 0.2271 m/h in well W-6, and 104.15 m<sup>2</sup>/h and 0.1867 m/h in well W-8, respectively. The close agreement among actual and predicted values underscores the RF model’s reliability in estimating these parameters, effectively identifying the fundamental trends within the dataset. For porosity prediction, the RF and GB models exhibited excellent correlation with log-derived PHIE, achieving correlation coefficients of 0.95 and 0.96, respectively. In contrast, the LR model showed acceptable performance, while the SVM model had comparatively lower correlation. These findings highlight the potential of ML models, particularly RF and GB, in accurately predicting key aquifer hydraulic parameters, thereby enhancing the understanding and management of the groundwater aquifers.</p></div>","PeriodicalId":8374,"journal":{"name":"Applied Water Science","volume":"15 8","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s13201-025-02547-6.pdf","citationCount":"0","resultStr":"{\"title\":\"Contribution of hydrogeological, well logs and machine learning in predicting the aquifer hydraulic properties in arid regions: a case study of Nubian Sandstone aquifer, Farafra Oasis, Egypt\",\"authors\":\"Ahmed Nosair, Muhammad Nabih, Ahmed Bakry\",\"doi\":\"10.1007/s13201-025-02547-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In hydrogeology, assessing key aquifer hydraulic parameters such as transmissivity (T), hydraulic conductivity (K), and porosity (PHIE) is crucial for effective groundwater management. Traditionally, these parameters are obtained through pumping tests and well log data. However, porosity logs are often lacking in most groundwater wells. While neutron density logs are commonly used for porosity estimation, our study uniquely employs resistivity logs to calculate porosity due to the scarcity of recorded logs in groundwater exploration. Consequently, this research aims to use conventional well log and hydrogeological data to predict T, K, and PHIE using machine learning (ML) algorithms, including random forest (RF), gradient boosting (GB), linear regression (LR), and support vector machines (SVM). This methodology is applied as a case study in the Nubian Sandstone Aquifer (NSA) in Farafra Oasis, Egypt. Firstly, T and k values were determined by analysis of the long duration pumping test records for ten wells penetrated the NSA. The performance of the ML algorithms in predicting transmissivity and hydraulic conductivity was rigorously evaluated using test wells. The RF model demonstrated superior accuracy, with predicted values of T and K being 113.11 m<sup>2</sup>/h and 0.2271 m/h in well W-6, and 104.15 m<sup>2</sup>/h and 0.1867 m/h in well W-8, respectively. The close agreement among actual and predicted values underscores the RF model’s reliability in estimating these parameters, effectively identifying the fundamental trends within the dataset. For porosity prediction, the RF and GB models exhibited excellent correlation with log-derived PHIE, achieving correlation coefficients of 0.95 and 0.96, respectively. In contrast, the LR model showed acceptable performance, while the SVM model had comparatively lower correlation. These findings highlight the potential of ML models, particularly RF and GB, in accurately predicting key aquifer hydraulic parameters, thereby enhancing the understanding and management of the groundwater aquifers.</p></div>\",\"PeriodicalId\":8374,\"journal\":{\"name\":\"Applied Water Science\",\"volume\":\"15 8\",\"pages\":\"\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-07-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s13201-025-02547-6.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Water Science\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13201-025-02547-6\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Water Science","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s13201-025-02547-6","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
Contribution of hydrogeological, well logs and machine learning in predicting the aquifer hydraulic properties in arid regions: a case study of Nubian Sandstone aquifer, Farafra Oasis, Egypt
In hydrogeology, assessing key aquifer hydraulic parameters such as transmissivity (T), hydraulic conductivity (K), and porosity (PHIE) is crucial for effective groundwater management. Traditionally, these parameters are obtained through pumping tests and well log data. However, porosity logs are often lacking in most groundwater wells. While neutron density logs are commonly used for porosity estimation, our study uniquely employs resistivity logs to calculate porosity due to the scarcity of recorded logs in groundwater exploration. Consequently, this research aims to use conventional well log and hydrogeological data to predict T, K, and PHIE using machine learning (ML) algorithms, including random forest (RF), gradient boosting (GB), linear regression (LR), and support vector machines (SVM). This methodology is applied as a case study in the Nubian Sandstone Aquifer (NSA) in Farafra Oasis, Egypt. Firstly, T and k values were determined by analysis of the long duration pumping test records for ten wells penetrated the NSA. The performance of the ML algorithms in predicting transmissivity and hydraulic conductivity was rigorously evaluated using test wells. The RF model demonstrated superior accuracy, with predicted values of T and K being 113.11 m2/h and 0.2271 m/h in well W-6, and 104.15 m2/h and 0.1867 m/h in well W-8, respectively. The close agreement among actual and predicted values underscores the RF model’s reliability in estimating these parameters, effectively identifying the fundamental trends within the dataset. For porosity prediction, the RF and GB models exhibited excellent correlation with log-derived PHIE, achieving correlation coefficients of 0.95 and 0.96, respectively. In contrast, the LR model showed acceptable performance, while the SVM model had comparatively lower correlation. These findings highlight the potential of ML models, particularly RF and GB, in accurately predicting key aquifer hydraulic parameters, thereby enhancing the understanding and management of the groundwater aquifers.