Machine learning-based evolution of water quality prediction model: An integrated robust framework for comparative application on periodic return and jitter data

IF 7.3 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Xizhi Nong , Yi He , Lihua Chen , Jiahua Wei
{"title":"Machine learning-based evolution of water quality prediction model: An integrated robust framework for comparative application on periodic return and jitter data","authors":"Xizhi Nong ,&nbsp;Yi He ,&nbsp;Lihua Chen ,&nbsp;Jiahua Wei","doi":"10.1016/j.envpol.2025.125834","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate water quality prediction is paramount for the sustainable management of surface water resources. Current deep learning models face challenges in reliably forecasting water quality due to the non-stationarity of environmental conditions and the intricate interactions among various environmental factors. This study introduces a novel, multi-level coupled machine learning framework that integrates data denoising, feature selection, and Long Short-Term Memory (LSTM) networks to enhance predictive accuracy. The findings demonstrate that the LSTM model incorporates data denoising pre-processing, capturing non-stationary water quality patterns more effectively than the baseline model, enhancing prediction performance (R<sup>2</sup> increased by 1.01%). The most adept model with wavelet transform exhibited superior adaptability and predictability, achieving a modest but statistically significant increase in R<sup>2</sup> values of 0.81% and 0.51% relative to incorporate moving average and complete ensemble empirical mode decomposition with adaptive noise techniques, respectively. The integrated models varied in their suitability for time series characterized by different patterns of variability (stability vs. instability, periodicity vs. non-periodicity). We conducted multi-step ahead predictions (t+1 and t+3 days) and employed two training configurations (80-20% and 70-30% splits) for dissolved oxygen and the permanganate index across four monitoring stations within the world's largest long-distance inter-basin water diversion project, to assess the reliability and robustness of the proposed water quality prediction models under varying conditions. The integration of data denoising techniques with LSTM networks substantially improves the prediction of dynamic water quality indices in complex environmental settings. Future research should explore the scalability of this framework across different geographical and climatic conditions to further validate its effectiveness and utility in global water resource management.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"369 ","pages":"Article 125834"},"PeriodicalIF":7.3000,"publicationDate":"2025-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0269749125002076","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate water quality prediction is paramount for the sustainable management of surface water resources. Current deep learning models face challenges in reliably forecasting water quality due to the non-stationarity of environmental conditions and the intricate interactions among various environmental factors. This study introduces a novel, multi-level coupled machine learning framework that integrates data denoising, feature selection, and Long Short-Term Memory (LSTM) networks to enhance predictive accuracy. The findings demonstrate that the LSTM model incorporates data denoising pre-processing, capturing non-stationary water quality patterns more effectively than the baseline model, enhancing prediction performance (R2 increased by 1.01%). The most adept model with wavelet transform exhibited superior adaptability and predictability, achieving a modest but statistically significant increase in R2 values of 0.81% and 0.51% relative to incorporate moving average and complete ensemble empirical mode decomposition with adaptive noise techniques, respectively. The integrated models varied in their suitability for time series characterized by different patterns of variability (stability vs. instability, periodicity vs. non-periodicity). We conducted multi-step ahead predictions (t+1 and t+3 days) and employed two training configurations (80-20% and 70-30% splits) for dissolved oxygen and the permanganate index across four monitoring stations within the world's largest long-distance inter-basin water diversion project, to assess the reliability and robustness of the proposed water quality prediction models under varying conditions. The integration of data denoising techniques with LSTM networks substantially improves the prediction of dynamic water quality indices in complex environmental settings. Future research should explore the scalability of this framework across different geographical and climatic conditions to further validate its effectiveness and utility in global water resource management.

Abstract Image

Abstract Image

基于机器学习的水质预测模型演化:周期回归和抖动数据比较应用的集成鲁棒框架
准确的水质预测对地表水资源的可持续管理至关重要。由于环境条件的非平稳性和各种环境因素之间复杂的相互作用,目前的深度学习模型在可靠地预测水质方面面临挑战。本研究引入了一种新颖的多层次耦合机器学习框架,该框架集成了数据去噪、特征选择和长短期记忆(LSTM)网络,以提高预测准确性。结果表明,LSTM模型采用数据去噪预处理,比基线模型更有效地捕获非平稳水质模式,提高了预测性能(R2提高了1.01%)。最熟练的小波变换模型表现出更强的适应性和可预测性,相对于采用自适应噪声技术的移动平均和完全集合经验模态分解,R2值分别提高了0.81%和0.51%,增幅不大,但具有统计学意义。综合模型对具有不同变率模式(稳定性与不稳定性、周期性与非周期性)的时间序列的适用性各不相同。我们对世界上最大的长距离跨流域调水工程中4个监测站的溶解氧和高锰酸盐指数进行了多步提前预测(t+1天和t+3天),并采用80-20%和70-30%分割两种训练配置,以评估所提出的水质预测模型在不同条件下的可靠性和稳健性。数据去噪技术与LSTM网络的集成大大改善了复杂环境下动态水质指标的预测。未来的研究应探索该框架在不同地理和气候条件下的可扩展性,以进一步验证其在全球水资源管理中的有效性和实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Pollution
Environmental Pollution 环境科学-环境科学
CiteScore
16.00
自引率
6.70%
发文量
2082
审稿时长
2.9 months
期刊介绍: Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health. Subject areas include, but are not limited to: • Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies; • Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change; • Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects; • Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects; • Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest; • New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信