Zhenyu Gao , Guoqiang Wang , Yi Zhu , Jinyue Chen , Lei Fang , Shilong Ren , Jie Li , Yinglan A , Wanting Wang , Qiao Wang
{"title":"基于可解释深度学习模型的半干旱区典型河流水质参数预测及污染超标分析","authors":"Zhenyu Gao , Guoqiang Wang , Yi Zhu , Jinyue Chen , Lei Fang , Shilong Ren , Jie Li , Yinglan A , Wanting Wang , Qiao Wang","doi":"10.1016/j.envpol.2025.126801","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning models that integrate environmental characteristics provide a powerful means for high-precision water quality prediction; however, their black-box nature can limit interpretability and reliability. We proposed an interpretable Attention-Gated Recurrent Unit (AT-GRU) model that integrates water quality, meteorological, and hydrological data from the semi-arid Dahei River Basin, to improve prediction accuracy and transparency of results. The model achieved superior daily-scale prediction accuracy (average R<sup>2</sup> = 0.907) over traditional machine learning and deep learning approaches. To enhance interpretability, SHapley Additive exPlanations (SHAP) analysis was conducted to identify key drivers behind the predictions. Results indicated that ammonia nitrogen (NH<sub>3</sub>N), population count, and river flow were the dominant predictors of total nitrogen (TN) and total phosphorus (TP), while meteorological factors had limited influence under high-pollution conditions. Extreme precipitation events were found to temporarily elevate nutrient concentrations. Analysis of exceedances and extremes further highlighted specific periods of most effective regulatory interventions. Overall, our study contributes a data- and mechanism-informed modeling framework that supports targeted pollution control, early warning, and adaptive water quality management strategies in semi-arid regions.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"383 ","pages":"Article 126801"},"PeriodicalIF":7.3000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of water quality parameters and pollution exceedance analysis in typical rivers of semi-arid regions based on interpretable deep learning models\",\"authors\":\"Zhenyu Gao , Guoqiang Wang , Yi Zhu , Jinyue Chen , Lei Fang , Shilong Ren , Jie Li , Yinglan A , Wanting Wang , Qiao Wang\",\"doi\":\"10.1016/j.envpol.2025.126801\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning models that integrate environmental characteristics provide a powerful means for high-precision water quality prediction; however, their black-box nature can limit interpretability and reliability. We proposed an interpretable Attention-Gated Recurrent Unit (AT-GRU) model that integrates water quality, meteorological, and hydrological data from the semi-arid Dahei River Basin, to improve prediction accuracy and transparency of results. The model achieved superior daily-scale prediction accuracy (average R<sup>2</sup> = 0.907) over traditional machine learning and deep learning approaches. To enhance interpretability, SHapley Additive exPlanations (SHAP) analysis was conducted to identify key drivers behind the predictions. Results indicated that ammonia nitrogen (NH<sub>3</sub>N), population count, and river flow were the dominant predictors of total nitrogen (TN) and total phosphorus (TP), while meteorological factors had limited influence under high-pollution conditions. Extreme precipitation events were found to temporarily elevate nutrient concentrations. Analysis of exceedances and extremes further highlighted specific periods of most effective regulatory interventions. Overall, our study contributes a data- and mechanism-informed modeling framework that supports targeted pollution control, early warning, and adaptive water quality management strategies in semi-arid regions.</div></div>\",\"PeriodicalId\":311,\"journal\":{\"name\":\"Environmental Pollution\",\"volume\":\"383 \",\"pages\":\"Article 126801\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Pollution\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0269749125011741\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0269749125011741","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Prediction of water quality parameters and pollution exceedance analysis in typical rivers of semi-arid regions based on interpretable deep learning models
Deep learning models that integrate environmental characteristics provide a powerful means for high-precision water quality prediction; however, their black-box nature can limit interpretability and reliability. We proposed an interpretable Attention-Gated Recurrent Unit (AT-GRU) model that integrates water quality, meteorological, and hydrological data from the semi-arid Dahei River Basin, to improve prediction accuracy and transparency of results. The model achieved superior daily-scale prediction accuracy (average R2 = 0.907) over traditional machine learning and deep learning approaches. To enhance interpretability, SHapley Additive exPlanations (SHAP) analysis was conducted to identify key drivers behind the predictions. Results indicated that ammonia nitrogen (NH3N), population count, and river flow were the dominant predictors of total nitrogen (TN) and total phosphorus (TP), while meteorological factors had limited influence under high-pollution conditions. Extreme precipitation events were found to temporarily elevate nutrient concentrations. Analysis of exceedances and extremes further highlighted specific periods of most effective regulatory interventions. Overall, our study contributes a data- and mechanism-informed modeling framework that supports targeted pollution control, early warning, and adaptive water quality management strategies in semi-arid regions.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.