Shitao Zhang, Jiafei Cao, Yang Gao, Fangfang Sun, Yong Yang
{"title":"A Deep Learning Algorithm for Multi-Source Data Fusion to Predict Effluent Quality of Wastewater Treatment Plant.","authors":"Shitao Zhang, Jiafei Cao, Yang Gao, Fangfang Sun, Yong Yang","doi":"10.3390/toxics13050349","DOIUrl":null,"url":null,"abstract":"<p><p>The operational complexity of wastewater treatment systems mainly stems from the diversity of influent characteristics and the nonlinear nature of the treatment process. Together, these factors make the control of effluent quality in wastewater treatment plants (WWTPs) difficult to manage effectively. To address this challenge, constructing accurate effluent quality models for WWTPs can not only mitigate these complexities, but also provide critical decision support for operational management. In this research, we introduce a deep learning method that fuses multi-source data. This method utilises various indicators to comprehensively analyse and predict the quality of effluent water: water quantity data, process data, energy consumption data, and water quality data. To assess the efficacy of this method, a case study was carried out at an industrial effluent treatment plant (IETP) in Anhui Province, China. Deep learning algorithms including long short-term memory (LSTM) and gated recurrent unit (GRU) were found to have a favourable prediction performance by comparing with traditional machine learning algorithms (random forest, RF) and multi-layer perceptron (MLP). The results show that the R<sup>2</sup> of LSTM and GRU is 1.36%~31.82% higher than that of MLP and 9.10%~47.75% higher than that of traditional machine learning algorithms. Finally, the RReliefF approach was used to identify the key parameters affecting the water quality behaviour of IETP effluent, and it was found that, by optimising the multi-source feature structure, not only the monitoring and management strategies can be optimised, but also the modelling efficiency of the model can be further improved.</p>","PeriodicalId":23195,"journal":{"name":"Toxics","volume":"13 5","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12115653/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Toxics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.3390/toxics13050349","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The operational complexity of wastewater treatment systems mainly stems from the diversity of influent characteristics and the nonlinear nature of the treatment process. Together, these factors make the control of effluent quality in wastewater treatment plants (WWTPs) difficult to manage effectively. To address this challenge, constructing accurate effluent quality models for WWTPs can not only mitigate these complexities, but also provide critical decision support for operational management. In this research, we introduce a deep learning method that fuses multi-source data. This method utilises various indicators to comprehensively analyse and predict the quality of effluent water: water quantity data, process data, energy consumption data, and water quality data. To assess the efficacy of this method, a case study was carried out at an industrial effluent treatment plant (IETP) in Anhui Province, China. Deep learning algorithms including long short-term memory (LSTM) and gated recurrent unit (GRU) were found to have a favourable prediction performance by comparing with traditional machine learning algorithms (random forest, RF) and multi-layer perceptron (MLP). The results show that the R2 of LSTM and GRU is 1.36%~31.82% higher than that of MLP and 9.10%~47.75% higher than that of traditional machine learning algorithms. Finally, the RReliefF approach was used to identify the key parameters affecting the water quality behaviour of IETP effluent, and it was found that, by optimising the multi-source feature structure, not only the monitoring and management strategies can be optimised, but also the modelling efficiency of the model can be further improved.
ToxicsChemical Engineering-Chemical Health and Safety
CiteScore
4.50
自引率
10.90%
发文量
681
审稿时长
6 weeks
期刊介绍:
Toxics (ISSN 2305-6304) is an international, peer-reviewed, open access journal which provides an advanced forum for studies related to all aspects of toxic chemicals and materials. It publishes reviews, regular research papers, and short communications. Our aim is to encourage scientists to publish their experimental and theoretical results in detail. There is, therefore, no restriction on the maximum length of the papers, although authors should write their papers in a clear and concise way. The full experimental details must be provided so that the results can be reproduced. Electronic files or software regarding the full details of calculations and experimental procedure can be deposited as supplementary material, if it is not possible to publish them along with the text.