基于机器学习的流域尺度长期水质模拟与驱动因素识别

IF 3.5 3区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES
Mingxuan Zhao , Chunzi Ma , Hanxiao Zhang , Haisheng Li , Shouliang Huo
{"title":"基于机器学习的流域尺度长期水质模拟与驱动因素识别","authors":"Mingxuan Zhao ,&nbsp;Chunzi Ma ,&nbsp;Hanxiao Zhang ,&nbsp;Haisheng Li ,&nbsp;Shouliang Huo","doi":"10.1016/j.jconhyd.2025.104604","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding long-term trends and analyzing their driving factors are essential to effectively enhance water quality in watersheds. In China, although the overall quality of surface water continues to improve, significant issues remain in certain regions. The Liao River Basin, a critical industrial hub and key agricultural grain base in northeast China, continues to face unstable water quality conditions, despite over 20 years of management efforts. This study compared several data-driven models (random forest (RF), support vector machine regression (SVR), K-nearest neighbors (KNN), stacking, long short-term memory (LSTM), convolutional-long short-term memory (CNN-LSTM)), to accurately fill the water quality data gaps (i.e., total nitrogen (TN), ammonia nitrogen (NH<sub>3</sub>-N), total phosphorus (TP), chemical oxygen demand (COD<sub>Cr</sub>), permanganate index (COD<sub>Mn</sub>), electroconductibility (E)) from 1980 to 2022 in Liao River Basin. In addition, the SHapley Additive exPlanations (SHAP) model was employed to quantitatively assess the driving factors of water quality. The results showed that the RF model exhibited robust predictive capabilities. TN showed a steady increase of approximately 20 % from 1980 to 2022, while the other parameters were effectively controlled. Anthropogenic activities, especially in agriculture and urban areas, were found to significantly contribute to water quality deterioration. Additionally, climatic factors such as extreme rainfall, annual average precipitation, and extreme temperatures-along with geographical factors like soil properties and slope, were found to play crucial roles in influencing water quality.</div></div>","PeriodicalId":15530,"journal":{"name":"Journal of contaminant hydrology","volume":"273 ","pages":"Article 104604"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Long-term water quality simulation and driving factors identification within the watershed scale using machine learning\",\"authors\":\"Mingxuan Zhao ,&nbsp;Chunzi Ma ,&nbsp;Hanxiao Zhang ,&nbsp;Haisheng Li ,&nbsp;Shouliang Huo\",\"doi\":\"10.1016/j.jconhyd.2025.104604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Understanding long-term trends and analyzing their driving factors are essential to effectively enhance water quality in watersheds. In China, although the overall quality of surface water continues to improve, significant issues remain in certain regions. The Liao River Basin, a critical industrial hub and key agricultural grain base in northeast China, continues to face unstable water quality conditions, despite over 20 years of management efforts. This study compared several data-driven models (random forest (RF), support vector machine regression (SVR), K-nearest neighbors (KNN), stacking, long short-term memory (LSTM), convolutional-long short-term memory (CNN-LSTM)), to accurately fill the water quality data gaps (i.e., total nitrogen (TN), ammonia nitrogen (NH<sub>3</sub>-N), total phosphorus (TP), chemical oxygen demand (COD<sub>Cr</sub>), permanganate index (COD<sub>Mn</sub>), electroconductibility (E)) from 1980 to 2022 in Liao River Basin. In addition, the SHapley Additive exPlanations (SHAP) model was employed to quantitatively assess the driving factors of water quality. The results showed that the RF model exhibited robust predictive capabilities. TN showed a steady increase of approximately 20 % from 1980 to 2022, while the other parameters were effectively controlled. Anthropogenic activities, especially in agriculture and urban areas, were found to significantly contribute to water quality deterioration. Additionally, climatic factors such as extreme rainfall, annual average precipitation, and extreme temperatures-along with geographical factors like soil properties and slope, were found to play crucial roles in influencing water quality.</div></div>\",\"PeriodicalId\":15530,\"journal\":{\"name\":\"Journal of contaminant hydrology\",\"volume\":\"273 \",\"pages\":\"Article 104604\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of contaminant hydrology\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169772225001093\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of contaminant hydrology","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169772225001093","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

了解长期趋势并分析其驱动因素对有效改善流域水质至关重要。在中国,虽然地表水的整体质量继续改善,但在某些地区仍然存在重大问题。辽河流域是中国东北重要的工业中心和重要的农业粮食基地,尽管经过20多年的管理努力,水质状况仍然不稳定。本研究比较了随机森林(RF)、支持向量机回归(SVR)、k近邻(KNN)、叠加、长短期记忆(LSTM)、卷积-长短期记忆(CNN-LSTM)等几种数据驱动模型,准确填补了辽河流域1980 - 2022年水质数据缺口(即总氮(TN)、氨氮(NH3-N)、总磷(TP)、化学需氧量(CODCr)、高锰酸盐指数(CODMn)、电导率(E))。此外,采用SHapley加性解释(SHAP)模型定量评价水质驱动因子。结果表明,该模型具有较强的预测能力。1980 - 2022年TN稳定增长约20%,其他参数得到有效控制。人类活动,特别是农业和城市地区的人类活动,对水质恶化起着重要作用。此外,极端降雨、年平均降水和极端温度等气候因素以及土壤性质和坡度等地理因素在影响水质方面发挥了关键作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Long-term water quality simulation and driving factors identification within the watershed scale using machine learning
Understanding long-term trends and analyzing their driving factors are essential to effectively enhance water quality in watersheds. In China, although the overall quality of surface water continues to improve, significant issues remain in certain regions. The Liao River Basin, a critical industrial hub and key agricultural grain base in northeast China, continues to face unstable water quality conditions, despite over 20 years of management efforts. This study compared several data-driven models (random forest (RF), support vector machine regression (SVR), K-nearest neighbors (KNN), stacking, long short-term memory (LSTM), convolutional-long short-term memory (CNN-LSTM)), to accurately fill the water quality data gaps (i.e., total nitrogen (TN), ammonia nitrogen (NH3-N), total phosphorus (TP), chemical oxygen demand (CODCr), permanganate index (CODMn), electroconductibility (E)) from 1980 to 2022 in Liao River Basin. In addition, the SHapley Additive exPlanations (SHAP) model was employed to quantitatively assess the driving factors of water quality. The results showed that the RF model exhibited robust predictive capabilities. TN showed a steady increase of approximately 20 % from 1980 to 2022, while the other parameters were effectively controlled. Anthropogenic activities, especially in agriculture and urban areas, were found to significantly contribute to water quality deterioration. Additionally, climatic factors such as extreme rainfall, annual average precipitation, and extreme temperatures-along with geographical factors like soil properties and slope, were found to play crucial roles in influencing water quality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of contaminant hydrology
Journal of contaminant hydrology 环境科学-地球科学综合
CiteScore
6.80
自引率
2.80%
发文量
129
审稿时长
68 days
期刊介绍: The Journal of Contaminant Hydrology is an international journal publishing scientific articles pertaining to the contamination of subsurface water resources. Emphasis is placed on investigations of the physical, chemical, and biological processes influencing the behavior and fate of organic and inorganic contaminants in the unsaturated (vadose) and saturated (groundwater) zones, as well as at groundwater-surface water interfaces. The ecological impacts of contaminants transported both from and to aquifers are of interest. Articles on contamination of surface water only, without a link to groundwater, are out of the scope. Broad latitude is allowed in identifying contaminants of interest, and include legacy and emerging pollutants, nutrients, nanoparticles, pathogenic microorganisms (e.g., bacteria, viruses, protozoa), microplastics, and various constituents associated with energy production (e.g., methane, carbon dioxide, hydrogen sulfide). The journal''s scope embraces a wide range of topics including: experimental investigations of contaminant sorption, diffusion, transformation, volatilization and transport in the surface and subsurface; characterization of soil and aquifer properties only as they influence contaminant behavior; development and testing of mathematical models of contaminant behaviour; innovative techniques for restoration of contaminated sites; development of new tools or techniques for monitoring the extent of soil and groundwater contamination; transformation of contaminants in the hyporheic zone; effects of contaminants traversing the hyporheic zone on surface water and groundwater ecosystems; subsurface carbon sequestration and/or turnover; and migration of fluids associated with energy production into groundwater.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信