Prediction of Groundwater Level and its Correlation with Land Subsidence and Groundwater Quality in Cangzhou, North China Plain, Using Time-Series Long Short-Term Memory Neural Network and Hybrid Models
Mouigni Baraka Nafouanti, Junxia Li, Hamada Chakira, Edwin E. Nyakilla, Denice Cleophace Fabiani, Jane Ferah Gondwe, Ismaila Sallah
{"title":"Prediction of Groundwater Level and its Correlation with Land Subsidence and Groundwater Quality in Cangzhou, North China Plain, Using Time-Series Long Short-Term Memory Neural Network and Hybrid Models","authors":"Mouigni Baraka Nafouanti, Junxia Li, Hamada Chakira, Edwin E. Nyakilla, Denice Cleophace Fabiani, Jane Ferah Gondwe, Ismaila Sallah","doi":"10.1007/s11053-025-10474-1","DOIUrl":null,"url":null,"abstract":"<p>Groundwater is the primary source of drinking water in the world, but its contamination and reduction cause environmental problems. Traditional hydraulic and numerical models for assessing groundwater and land subsidence are time-consuming and expensive. Thus, this study used the long short-term memory (LSTM) neural network to predict groundwater level and employed linear regression analysis and the hybrid random forest linear regression to find the correlation between groundwater and land subsidence. The impact of groundwater level on groundwater quality was investigated by forecasting the fluoride in groundwater using the hybrid models of random forest and k-nearest neighbor (RF–KNN), random forest linear model (HRFLM), and gradient boosting support vector regression (GBR–SVR) for the prediction of groundwater fluoride. The LSTM model yielded an <i>R</i><sup>2</sup> of 0.96 in forecasting groundwater level, and the time series results from 2018 to 2022 showed a variation in groundwater level, with a decline in 2022. The LSTM model suggested that from 2024 to 2040, the groundwater level would recover progressively. The regression analysis showed an <i>R</i><sup>2</sup> of 0.99 and a <i>p</i> value of 0.01 for the correlation between groundwater level and land subsidence, and the HRFLM model yielded an <i>R</i><sup>2</sup> of 0.94. For predicting groundwater fluoride contamination, the hybrid RF–KNN had the highest <i>R</i><sup>2</sup> of 0.97 compared to HRFLM and GBR–SVR, with <i>R</i><sup>2</sup> of 0.95 and 0.93, respectively. This research demonstrated that hybrid models and deep learning are advanced techniques that can be applied in Cangzhou to evaluate groundwater level and land subsidence and they can be applied in areas facing similar challenges.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"12 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-025-10474-1","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Groundwater is the primary source of drinking water in the world, but its contamination and reduction cause environmental problems. Traditional hydraulic and numerical models for assessing groundwater and land subsidence are time-consuming and expensive. Thus, this study used the long short-term memory (LSTM) neural network to predict groundwater level and employed linear regression analysis and the hybrid random forest linear regression to find the correlation between groundwater and land subsidence. The impact of groundwater level on groundwater quality was investigated by forecasting the fluoride in groundwater using the hybrid models of random forest and k-nearest neighbor (RF–KNN), random forest linear model (HRFLM), and gradient boosting support vector regression (GBR–SVR) for the prediction of groundwater fluoride. The LSTM model yielded an R2 of 0.96 in forecasting groundwater level, and the time series results from 2018 to 2022 showed a variation in groundwater level, with a decline in 2022. The LSTM model suggested that from 2024 to 2040, the groundwater level would recover progressively. The regression analysis showed an R2 of 0.99 and a p value of 0.01 for the correlation between groundwater level and land subsidence, and the HRFLM model yielded an R2 of 0.94. For predicting groundwater fluoride contamination, the hybrid RF–KNN had the highest R2 of 0.97 compared to HRFLM and GBR–SVR, with R2 of 0.95 and 0.93, respectively. This research demonstrated that hybrid models and deep learning are advanced techniques that can be applied in Cangzhou to evaluate groundwater level and land subsidence and they can be applied in areas facing similar challenges.
期刊介绍:
This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.