Advanced deep learning model for predicting water pollutants using spectral data and augmentation techniques: A case study of the Middle and Lower Yangtze River, China

IF 6.9 2区 环境科学与生态学 Q1 ENGINEERING, CHEMICAL
Guohao Zhang , Cailing Wang , Hongwei Wang , YU Tao
{"title":"Advanced deep learning model for predicting water pollutants using spectral data and augmentation techniques: A case study of the Middle and Lower Yangtze River, China","authors":"Guohao Zhang ,&nbsp;Cailing Wang ,&nbsp;Hongwei Wang ,&nbsp;YU Tao","doi":"10.1016/j.psep.2025.107058","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has demonstrated significant advantages in managing nonlinear relationships within high-dimensional spectral data, making it widely applicable in water quality monitoring. However, the variety of model selection and construction strategies has resulted in substantial fluctuations in predictive performance, particularly with high-dimensional data. This study constructs an integrated deep learning framework for predicting water pollutant concentrations, incorporating several key modules including data preprocessing, frequency decomposition, feature enhancement, sample augmentation, and decoder regression prediction. In the established model, an improved wavelet transform algorithm is first employed to address the issue of original data being unable to effectively distinguish detailed features, thereby accurately extracting the periodicity and volatility characteristics of the data. Secondly, an encoder module based on the Informer architecture enhances various frequency domain features and further improves the quality of features and their correlation with labels through distillation techniques. Subsequently, an improved generative adversarial network is introduced to tackle the problem of small sample data by effectively augmenting the limited dataset, thereby enhancing the overall quality of the dataset. Finally, a decoder module combining an optimization algorithm and an improved convolutional neural network (IMCPSO-RCNN) effectively addresses the shortcomings of traditional models in hyperparameter optimization and predictive performance, achieving efficient and accurate regression prediction of pollutant concentrations. A case study in the middle and lower reaches of the Yangtze River shows that this model outperforms others in prediction accuracy, achieving coefficients of determination (R²) of 0.9785, 0.9733, and 0.9741 for TN, COD, and TP, respectively. The root mean square error (RMSE) values are 0.0601, 0.6248, and 0.0023, while the mean absolute error (MAE) scores are 0.0252, 0.2810, and 0.0006, respectively. The necessity and effectiveness of each model component are validated through ablation experiments. This research offers an efficient and unified deep learning solution for monitoring water pollutants.</div></div><div><h3>Synopsis</h3><div>This deep learning framework enhances water quality monitoring by accurately predicting pollutant concentrations, informing environmental policy and water system management.</div></div>","PeriodicalId":20743,"journal":{"name":"Process Safety and Environmental Protection","volume":"197 ","pages":"Article 107058"},"PeriodicalIF":6.9000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Process Safety and Environmental Protection","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957582025003258","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning has demonstrated significant advantages in managing nonlinear relationships within high-dimensional spectral data, making it widely applicable in water quality monitoring. However, the variety of model selection and construction strategies has resulted in substantial fluctuations in predictive performance, particularly with high-dimensional data. This study constructs an integrated deep learning framework for predicting water pollutant concentrations, incorporating several key modules including data preprocessing, frequency decomposition, feature enhancement, sample augmentation, and decoder regression prediction. In the established model, an improved wavelet transform algorithm is first employed to address the issue of original data being unable to effectively distinguish detailed features, thereby accurately extracting the periodicity and volatility characteristics of the data. Secondly, an encoder module based on the Informer architecture enhances various frequency domain features and further improves the quality of features and their correlation with labels through distillation techniques. Subsequently, an improved generative adversarial network is introduced to tackle the problem of small sample data by effectively augmenting the limited dataset, thereby enhancing the overall quality of the dataset. Finally, a decoder module combining an optimization algorithm and an improved convolutional neural network (IMCPSO-RCNN) effectively addresses the shortcomings of traditional models in hyperparameter optimization and predictive performance, achieving efficient and accurate regression prediction of pollutant concentrations. A case study in the middle and lower reaches of the Yangtze River shows that this model outperforms others in prediction accuracy, achieving coefficients of determination (R²) of 0.9785, 0.9733, and 0.9741 for TN, COD, and TP, respectively. The root mean square error (RMSE) values are 0.0601, 0.6248, and 0.0023, while the mean absolute error (MAE) scores are 0.0252, 0.2810, and 0.0006, respectively. The necessity and effectiveness of each model component are validated through ablation experiments. This research offers an efficient and unified deep learning solution for monitoring water pollutants.

Synopsis

This deep learning framework enhances water quality monitoring by accurately predicting pollutant concentrations, informing environmental policy and water system management.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Process Safety and Environmental Protection
Process Safety and Environmental Protection 环境科学-工程:化工
CiteScore
11.40
自引率
15.40%
发文量
929
审稿时长
8.0 months
期刊介绍: The Process Safety and Environmental Protection (PSEP) journal is a leading international publication that focuses on the publication of high-quality, original research papers in the field of engineering, specifically those related to the safety of industrial processes and environmental protection. The journal encourages submissions that present new developments in safety and environmental aspects, particularly those that show how research findings can be applied in process engineering design and practice. PSEP is particularly interested in research that brings fresh perspectives to established engineering principles, identifies unsolved problems, or suggests directions for future research. The journal also values contributions that push the boundaries of traditional engineering and welcomes multidisciplinary papers. PSEP's articles are abstracted and indexed by a range of databases and services, which helps to ensure that the journal's research is accessible and recognized in the academic and professional communities. These databases include ANTE, Chemical Abstracts, Chemical Hazards in Industry, Current Contents, Elsevier Engineering Information database, Pascal Francis, Web of Science, Scopus, Engineering Information Database EnCompass LIT (Elsevier), and INSPEC. This wide coverage facilitates the dissemination of the journal's content to a global audience interested in process safety and environmental engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信