Jing Yang , Ke Tian , Huayu Zhao , Zheng Feng , Sami Bourouis , Sami Dhahbi , Abdullah Ayub Khan , Mouhebeddine Berrima , Lip Yee Por
{"title":"Wastewater treatment monitoring: Fault detection in sensors using transductive learning and improved reinforcement learning","authors":"Jing Yang , Ke Tian , Huayu Zhao , Zheng Feng , Sami Bourouis , Sami Dhahbi , Abdullah Ayub Khan , Mouhebeddine Berrima , Lip Yee Por","doi":"10.1016/j.eswa.2024.125805","DOIUrl":null,"url":null,"abstract":"<div><div>Wastewater treatment plants (WWTPs) increasingly utilize sensors to optimize operations and ensure treated water quality. These sensors’ rich datasets are well-suited for automated monitoring and fault detection. This study introduces a deep learning method for fault detection in sensors designed to tackle significant challenges, including a class imbalance in datasets where normal operational data significantly outnumber anomalies and sensitivity to hyperparameters. We employ a novel spatial attention-based transductive long short-term memory (TLSTM) network designed to detect subtle temporal variations in time-series data, facilitating the binary classification of faults in key processes like oxidation and nitrification. To address the challenge of data imbalance prevalent in WWTP monitoring, our model integrates the off-policy proximal policy optimization (Off-Policy PPO) framework. This adaptation enhances the traditional PPO algorithm for off-policy learning environments, improving data utilization and algorithm stability. In this system, data points are treated as a sequence of decisions, with the neural network functioning as an intelligent agent. The Off-Policy PPO approach employs a reward mechanism that prioritizes the correct prediction of minority-class instances over majority-class ones by assigning higher rewards. Moreover, the model incorporates the differential evolution (DE) algorithm for autonomous hyperparameter optimization, thereby minimizing reliance on manual tuning. Our rigorous testing on the Valdobbiadene dataset shows that our approach outperforms existing methods. Additionally, we apply transfer learning (TL) to the BSM1 dataset to further validate the model’s effectiveness. Achieving F-measures of 87.24% on the Valdobbiadene dataset and 82.48% on the BSM1 dataset demonstrates the model’s capability to promptly identify faults, significantly enhancing the reliability and efficiency of WWTP monitoring systems.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"264 ","pages":"Article 125805"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026721","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Wastewater treatment plants (WWTPs) increasingly utilize sensors to optimize operations and ensure treated water quality. These sensors’ rich datasets are well-suited for automated monitoring and fault detection. This study introduces a deep learning method for fault detection in sensors designed to tackle significant challenges, including a class imbalance in datasets where normal operational data significantly outnumber anomalies and sensitivity to hyperparameters. We employ a novel spatial attention-based transductive long short-term memory (TLSTM) network designed to detect subtle temporal variations in time-series data, facilitating the binary classification of faults in key processes like oxidation and nitrification. To address the challenge of data imbalance prevalent in WWTP monitoring, our model integrates the off-policy proximal policy optimization (Off-Policy PPO) framework. This adaptation enhances the traditional PPO algorithm for off-policy learning environments, improving data utilization and algorithm stability. In this system, data points are treated as a sequence of decisions, with the neural network functioning as an intelligent agent. The Off-Policy PPO approach employs a reward mechanism that prioritizes the correct prediction of minority-class instances over majority-class ones by assigning higher rewards. Moreover, the model incorporates the differential evolution (DE) algorithm for autonomous hyperparameter optimization, thereby minimizing reliance on manual tuning. Our rigorous testing on the Valdobbiadene dataset shows that our approach outperforms existing methods. Additionally, we apply transfer learning (TL) to the BSM1 dataset to further validate the model’s effectiveness. Achieving F-measures of 87.24% on the Valdobbiadene dataset and 82.48% on the BSM1 dataset demonstrates the model’s capability to promptly identify faults, significantly enhancing the reliability and efficiency of WWTP monitoring systems.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.