{"title":"设计基于特征选择和不确定性量化的深度学习方法预测大坝水库叶绿素a和水华风险","authors":"Akram Seifi , Hossien Riahi Madvar , Rouhollah Davarpanah , Mumtaz Ali , Abdul-Wahab Mashat","doi":"10.1016/j.jwpe.2025.108341","DOIUrl":null,"url":null,"abstract":"<div><div>Predicting water quality indicators accurately is vital for the sustainable management of aquatic ecosystems, particularly in dam reservoirs that are highly vulnerable to environmental phenomena. Dissolved oxygen (DO) and chlorophyll-a (Chl-a) are essential indicators for evaluating ecosystem stability and water quality. In this study, an innovative and robust intelligent framework is designed using integrated uncertainty quantification and feature selection to predict DO, Chl-a, and bloom risk evaluation of dams. First, the individual machine learning and deep learning models, including Extreme Gradient Boosting (XGBoost), Convolutional Neural Networks (CNNs), Gated Recurrent Units (GRUs), Least Square Support Vector Regression (LSSVR), and Multi-Layer Perceptron (MLP) were assessed. Subsequently, the most effective models are then integrated to enhance predictive accuracy. The Boruta Feature Selection Approach (BFSA), Gamma Test, and Shapley Additive Explanations (SHAP) are used to select the most suitable and relevant features. Then the Monte-Carlo simulation is implemented for uncertainty analysis to evaluate the reliability of models' prediction by determining probability distribution functions. The hybrid XGBoost-CNNs achieved the highest performance in terms of R<sup>2</sup> = 0.923, RMSE = 0.547 μg/l for Chl-a prediction, and CNNs obtained R<sup>2</sup> = 0.995, RMSE = 0.143 ppm for DO prediction. The 95 % Prediction Uncertainty (95PPU) varied from 79.37 to 100, which shows strong predictive reliability. Also, d-factor values lower than 0.77 confirmed the model uncertainty is low. Furthermore, water bloom risk was assessed using the predicted Chl-a concentration. The analysis indicated no risk levels at reservoir depths of 0–5.5 m and 13.5–32 m, while low-risk levels were identified between 5.5 and 13.5 m. The maximum risk probability was 20.66 % when Chl-a concentrations were below 40 μg/l. The results highlight the effectiveness of hybrid artificial intelligence frameworks in enabling real-time water quality monitoring, early detection of harmful algal blooms, and promoting sustainable reservoir management.</div></div>","PeriodicalId":17528,"journal":{"name":"Journal of water process engineering","volume":"77 ","pages":"Article 108341"},"PeriodicalIF":6.3000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing advanced feature selection and uncertainty quantification-based deep learning approach to predict chlorophyll-a and water bloom risks in dam reservoir\",\"authors\":\"Akram Seifi , Hossien Riahi Madvar , Rouhollah Davarpanah , Mumtaz Ali , Abdul-Wahab Mashat\",\"doi\":\"10.1016/j.jwpe.2025.108341\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Predicting water quality indicators accurately is vital for the sustainable management of aquatic ecosystems, particularly in dam reservoirs that are highly vulnerable to environmental phenomena. Dissolved oxygen (DO) and chlorophyll-a (Chl-a) are essential indicators for evaluating ecosystem stability and water quality. In this study, an innovative and robust intelligent framework is designed using integrated uncertainty quantification and feature selection to predict DO, Chl-a, and bloom risk evaluation of dams. First, the individual machine learning and deep learning models, including Extreme Gradient Boosting (XGBoost), Convolutional Neural Networks (CNNs), Gated Recurrent Units (GRUs), Least Square Support Vector Regression (LSSVR), and Multi-Layer Perceptron (MLP) were assessed. Subsequently, the most effective models are then integrated to enhance predictive accuracy. The Boruta Feature Selection Approach (BFSA), Gamma Test, and Shapley Additive Explanations (SHAP) are used to select the most suitable and relevant features. Then the Monte-Carlo simulation is implemented for uncertainty analysis to evaluate the reliability of models' prediction by determining probability distribution functions. The hybrid XGBoost-CNNs achieved the highest performance in terms of R<sup>2</sup> = 0.923, RMSE = 0.547 μg/l for Chl-a prediction, and CNNs obtained R<sup>2</sup> = 0.995, RMSE = 0.143 ppm for DO prediction. The 95 % Prediction Uncertainty (95PPU) varied from 79.37 to 100, which shows strong predictive reliability. Also, d-factor values lower than 0.77 confirmed the model uncertainty is low. Furthermore, water bloom risk was assessed using the predicted Chl-a concentration. The analysis indicated no risk levels at reservoir depths of 0–5.5 m and 13.5–32 m, while low-risk levels were identified between 5.5 and 13.5 m. The maximum risk probability was 20.66 % when Chl-a concentrations were below 40 μg/l. The results highlight the effectiveness of hybrid artificial intelligence frameworks in enabling real-time water quality monitoring, early detection of harmful algal blooms, and promoting sustainable reservoir management.</div></div>\",\"PeriodicalId\":17528,\"journal\":{\"name\":\"Journal of water process engineering\",\"volume\":\"77 \",\"pages\":\"Article 108341\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of water process engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214714425014138\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of water process engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214714425014138","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Designing advanced feature selection and uncertainty quantification-based deep learning approach to predict chlorophyll-a and water bloom risks in dam reservoir
Predicting water quality indicators accurately is vital for the sustainable management of aquatic ecosystems, particularly in dam reservoirs that are highly vulnerable to environmental phenomena. Dissolved oxygen (DO) and chlorophyll-a (Chl-a) are essential indicators for evaluating ecosystem stability and water quality. In this study, an innovative and robust intelligent framework is designed using integrated uncertainty quantification and feature selection to predict DO, Chl-a, and bloom risk evaluation of dams. First, the individual machine learning and deep learning models, including Extreme Gradient Boosting (XGBoost), Convolutional Neural Networks (CNNs), Gated Recurrent Units (GRUs), Least Square Support Vector Regression (LSSVR), and Multi-Layer Perceptron (MLP) were assessed. Subsequently, the most effective models are then integrated to enhance predictive accuracy. The Boruta Feature Selection Approach (BFSA), Gamma Test, and Shapley Additive Explanations (SHAP) are used to select the most suitable and relevant features. Then the Monte-Carlo simulation is implemented for uncertainty analysis to evaluate the reliability of models' prediction by determining probability distribution functions. The hybrid XGBoost-CNNs achieved the highest performance in terms of R2 = 0.923, RMSE = 0.547 μg/l for Chl-a prediction, and CNNs obtained R2 = 0.995, RMSE = 0.143 ppm for DO prediction. The 95 % Prediction Uncertainty (95PPU) varied from 79.37 to 100, which shows strong predictive reliability. Also, d-factor values lower than 0.77 confirmed the model uncertainty is low. Furthermore, water bloom risk was assessed using the predicted Chl-a concentration. The analysis indicated no risk levels at reservoir depths of 0–5.5 m and 13.5–32 m, while low-risk levels were identified between 5.5 and 13.5 m. The maximum risk probability was 20.66 % when Chl-a concentrations were below 40 μg/l. The results highlight the effectiveness of hybrid artificial intelligence frameworks in enabling real-time water quality monitoring, early detection of harmful algal blooms, and promoting sustainable reservoir management.
期刊介绍:
The Journal of Water Process Engineering aims to publish refereed, high-quality research papers with significant novelty and impact in all areas of the engineering of water and wastewater processing . Papers on advanced and novel treatment processes and technologies are particularly welcome. The Journal considers papers in areas such as nanotechnology and biotechnology applications in water, novel oxidation and separation processes, membrane processes (except those for desalination) , catalytic processes for the removal of water contaminants, sustainable processes, water reuse and recycling, water use and wastewater minimization, integrated/hybrid technology, process modeling of water treatment and novel treatment processes. Submissions on the subject of adsorbents, including standard measurements of adsorption kinetics and equilibrium will only be considered if there is a genuine case for novelty and contribution, for example highly novel, sustainable adsorbents and their use: papers on activated carbon-type materials derived from natural matter, or surfactant-modified clays and related minerals, would not fulfil this criterion. The Journal particularly welcomes contributions involving environmentally, economically and socially sustainable technology for water treatment, including those which are energy-efficient, with minimal or no chemical consumption, and capable of water recycling and reuse that minimizes the direct disposal of wastewater to the aquatic environment. Papers that describe novel ideas for solving issues related to water quality and availability are also welcome, as are those that show the transfer of techniques from other disciplines. The Journal will consider papers dealing with processes for various water matrices including drinking water (except desalination), domestic, urban and industrial wastewaters, in addition to their residues. It is expected that the journal will be of particular relevance to chemical and process engineers working in the field. The Journal welcomes Full Text papers, Short Communications, State-of-the-Art Reviews and Letters to Editors and Case Studies