{"title":"基于组合机器学习模型的水质预测方法——以大陵江流域为例。","authors":"Yang Liu, Yingchun Wang","doi":"10.1016/j.jconhyd.2025.104725","DOIUrl":null,"url":null,"abstract":"<div><div>Total nitrogen (TN) is a key factor limiting river water quality. Identifying its spatial-temporal variation and influencing factors is essential for predicting pollution trends and mitigating water quality risks. However, challenges such as insufficient real-time monitoring, limited spatiotemporal modeling, and complex feature selection for multi-source data remain. This study proposes a machine learning method combining spatiotemporal weighted interpolation, relevant feature selection, and time-series decomposition to analyze TN concentration variations in the Daling River Basin (2023–2025) using data from seven monitoring stations. The Enhanced Long Short-Term Memory with Back Propagation Network (ELSTM-EBP) model is developed to assess TN spatial-temporal fluctuations and driving factors. Results show that: (1) TN concentrations are highest in January and February, with a “U”-shaped fluctuation over the year, and lowest in August; (2) Significant spatial heterogeneity is observed at the seven monitoring points; (3) Water temperature negatively correlates with TN, while dissolved oxygen positively correlates; other factors such as the permanganate index and turbidity also significantly influence TN levels; (4) The ELSTM-EBP model outperforms other models (ELSTM-EBP > QLSTM > LSTM > GRU-QIMAS > EQINN > BP) in TN prediction accuracy and generalization ability; (5) Multi-step prediction accuracy decreases slightly with step length, but remains within −0.4 to 0.4 mg/L for up to 7 steps, indicating robust performance for short-term predictions.</div></div>","PeriodicalId":15530,"journal":{"name":"Journal of contaminant hydrology","volume":"276 ","pages":"Article 104725"},"PeriodicalIF":4.4000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Water quality prediction method based on a combined machine learning model: A case study of the Daling River Basin\",\"authors\":\"Yang Liu, Yingchun Wang\",\"doi\":\"10.1016/j.jconhyd.2025.104725\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Total nitrogen (TN) is a key factor limiting river water quality. Identifying its spatial-temporal variation and influencing factors is essential for predicting pollution trends and mitigating water quality risks. However, challenges such as insufficient real-time monitoring, limited spatiotemporal modeling, and complex feature selection for multi-source data remain. This study proposes a machine learning method combining spatiotemporal weighted interpolation, relevant feature selection, and time-series decomposition to analyze TN concentration variations in the Daling River Basin (2023–2025) using data from seven monitoring stations. The Enhanced Long Short-Term Memory with Back Propagation Network (ELSTM-EBP) model is developed to assess TN spatial-temporal fluctuations and driving factors. Results show that: (1) TN concentrations are highest in January and February, with a “U”-shaped fluctuation over the year, and lowest in August; (2) Significant spatial heterogeneity is observed at the seven monitoring points; (3) Water temperature negatively correlates with TN, while dissolved oxygen positively correlates; other factors such as the permanganate index and turbidity also significantly influence TN levels; (4) The ELSTM-EBP model outperforms other models (ELSTM-EBP > QLSTM > LSTM > GRU-QIMAS > EQINN > BP) in TN prediction accuracy and generalization ability; (5) Multi-step prediction accuracy decreases slightly with step length, but remains within −0.4 to 0.4 mg/L for up to 7 steps, indicating robust performance for short-term predictions.</div></div>\",\"PeriodicalId\":15530,\"journal\":{\"name\":\"Journal of contaminant hydrology\",\"volume\":\"276 \",\"pages\":\"Article 104725\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of contaminant hydrology\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S016977222500230X\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of contaminant hydrology","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016977222500230X","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Water quality prediction method based on a combined machine learning model: A case study of the Daling River Basin
Total nitrogen (TN) is a key factor limiting river water quality. Identifying its spatial-temporal variation and influencing factors is essential for predicting pollution trends and mitigating water quality risks. However, challenges such as insufficient real-time monitoring, limited spatiotemporal modeling, and complex feature selection for multi-source data remain. This study proposes a machine learning method combining spatiotemporal weighted interpolation, relevant feature selection, and time-series decomposition to analyze TN concentration variations in the Daling River Basin (2023–2025) using data from seven monitoring stations. The Enhanced Long Short-Term Memory with Back Propagation Network (ELSTM-EBP) model is developed to assess TN spatial-temporal fluctuations and driving factors. Results show that: (1) TN concentrations are highest in January and February, with a “U”-shaped fluctuation over the year, and lowest in August; (2) Significant spatial heterogeneity is observed at the seven monitoring points; (3) Water temperature negatively correlates with TN, while dissolved oxygen positively correlates; other factors such as the permanganate index and turbidity also significantly influence TN levels; (4) The ELSTM-EBP model outperforms other models (ELSTM-EBP > QLSTM > LSTM > GRU-QIMAS > EQINN > BP) in TN prediction accuracy and generalization ability; (5) Multi-step prediction accuracy decreases slightly with step length, but remains within −0.4 to 0.4 mg/L for up to 7 steps, indicating robust performance for short-term predictions.
期刊介绍:
The Journal of Contaminant Hydrology is an international journal publishing scientific articles pertaining to the contamination of subsurface water resources. Emphasis is placed on investigations of the physical, chemical, and biological processes influencing the behavior and fate of organic and inorganic contaminants in the unsaturated (vadose) and saturated (groundwater) zones, as well as at groundwater-surface water interfaces. The ecological impacts of contaminants transported both from and to aquifers are of interest. Articles on contamination of surface water only, without a link to groundwater, are out of the scope. Broad latitude is allowed in identifying contaminants of interest, and include legacy and emerging pollutants, nutrients, nanoparticles, pathogenic microorganisms (e.g., bacteria, viruses, protozoa), microplastics, and various constituents associated with energy production (e.g., methane, carbon dioxide, hydrogen sulfide).
The journal''s scope embraces a wide range of topics including: experimental investigations of contaminant sorption, diffusion, transformation, volatilization and transport in the surface and subsurface; characterization of soil and aquifer properties only as they influence contaminant behavior; development and testing of mathematical models of contaminant behaviour; innovative techniques for restoration of contaminated sites; development of new tools or techniques for monitoring the extent of soil and groundwater contamination; transformation of contaminants in the hyporheic zone; effects of contaminants traversing the hyporheic zone on surface water and groundwater ecosystems; subsurface carbon sequestration and/or turnover; and migration of fluids associated with energy production into groundwater.