Development of a Soft Sensor Using Machine Learning Algorithms for Predicting the Water Quality of an Onsite Wastewater Treatment System

IF 6.7 Q1 ENGINEERING, ENVIRONMENTAL
Hsiang-Yang Shyu, Cynthia J. Castro, Robert A. Bair, Qing Lu and Daniel H. Yeh*, 
{"title":"Development of a Soft Sensor Using Machine Learning Algorithms for Predicting the Water Quality of an Onsite Wastewater Treatment System","authors":"Hsiang-Yang Shyu,&nbsp;Cynthia J. Castro,&nbsp;Robert A. Bair,&nbsp;Qing Lu and Daniel H. Yeh*,&nbsp;","doi":"10.1021/acsenvironau.2c00072","DOIUrl":null,"url":null,"abstract":"<p >Developing advanced onsite wastewater treatment systems (OWTS) requires accurate and consistent water quality monitoring to evaluate treatment efficiency and ensure regulatory compliance. However, off-line parameters such as chemical oxygen demand (COD), total suspended solids (TSS), and <i>Escherichia coli</i> (<i>E. coli</i>) require sample collection and time-consuming laboratory analyses that do not provide real-time information of system performance or component failure. While real-time COD analyzers have emerged in recent years, they are not economically viable for onsite systems due to cost and chemical consumables. This study aimed to design and implement a real-time remote monitoring system for OWTS by developing several multi-input and single-output soft sensors. The soft sensor integrates data that can be obtained from well-established in-line sensors to accurately predict key water quality parameters, including COD, TSS, and <i>E. coli</i> concentrations. The temporal and spatial water quality data of an existing field-tested OWTS operated for almost two years (<i>n</i> = 56 data points) were used to evaluate the prediction performance of four machine learning algorithms. These algorithms, namely, partial least square regression (PLS), support vector regression (SVR), cubist regression (CUB), and quantile regression neural network (QRNN), were chosen as candidate algorithms for their prior application and effectiveness in wastewater treatment predictions. Water quality parameters that can be measured in-line, including turbidity, color, pH, NH<sub>4</sub><sup>+</sup>, NO<sub>3</sub><sup>–</sup>, and electrical conductivity, were selected as model inputs for predicting COD, TSS, and <i>E. coli</i>. The results revealed that the trained SVR model provided a statistically significant prediction for COD with a mean absolute percentage error (MAPE) of 14.5% and <i>R</i><sup>2</sup> of 0.96. The CUB model provided the optimal predictive performance for TSS, with a MAPE of 24.8% and <i>R</i><sup>2</sup> of 0.99. None of the models were able to achieve optimal prediction results for <i>E. coli</i>; however, the CUB model performed the best with a MAPE of 71.4% and <i>R</i><sup>2</sup> of 0.22. Given the large fluctuation in the concentrations of COD, TSS, and <i>E. coli</i> within the OWTS wastewater dataset, the proposed soft sensor models adequately predicted COD and TSS, while <i>E. coli</i> prediction was comparatively less accurate and requires further improvement. These results indicate that although water quality datasets for the OWTS are relatively small, machine learning-based soft sensors can provide useful predictive estimates of off-line parameters and provide real-time monitoring capabilities that can be used to make adjustments to OWTS operations.</p>","PeriodicalId":29801,"journal":{"name":"ACS Environmental Au","volume":"3 5","pages":"308–318"},"PeriodicalIF":6.7000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsenvironau.2c00072","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Environmental Au","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsenvironau.2c00072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 1

Abstract

Developing advanced onsite wastewater treatment systems (OWTS) requires accurate and consistent water quality monitoring to evaluate treatment efficiency and ensure regulatory compliance. However, off-line parameters such as chemical oxygen demand (COD), total suspended solids (TSS), and Escherichia coli (E. coli) require sample collection and time-consuming laboratory analyses that do not provide real-time information of system performance or component failure. While real-time COD analyzers have emerged in recent years, they are not economically viable for onsite systems due to cost and chemical consumables. This study aimed to design and implement a real-time remote monitoring system for OWTS by developing several multi-input and single-output soft sensors. The soft sensor integrates data that can be obtained from well-established in-line sensors to accurately predict key water quality parameters, including COD, TSS, and E. coli concentrations. The temporal and spatial water quality data of an existing field-tested OWTS operated for almost two years (n = 56 data points) were used to evaluate the prediction performance of four machine learning algorithms. These algorithms, namely, partial least square regression (PLS), support vector regression (SVR), cubist regression (CUB), and quantile regression neural network (QRNN), were chosen as candidate algorithms for their prior application and effectiveness in wastewater treatment predictions. Water quality parameters that can be measured in-line, including turbidity, color, pH, NH4+, NO3, and electrical conductivity, were selected as model inputs for predicting COD, TSS, and E. coli. The results revealed that the trained SVR model provided a statistically significant prediction for COD with a mean absolute percentage error (MAPE) of 14.5% and R2 of 0.96. The CUB model provided the optimal predictive performance for TSS, with a MAPE of 24.8% and R2 of 0.99. None of the models were able to achieve optimal prediction results for E. coli; however, the CUB model performed the best with a MAPE of 71.4% and R2 of 0.22. Given the large fluctuation in the concentrations of COD, TSS, and E. coli within the OWTS wastewater dataset, the proposed soft sensor models adequately predicted COD and TSS, while E. coli prediction was comparatively less accurate and requires further improvement. These results indicate that although water quality datasets for the OWTS are relatively small, machine learning-based soft sensors can provide useful predictive estimates of off-line parameters and provide real-time monitoring capabilities that can be used to make adjustments to OWTS operations.

Abstract Image

利用机器学习算法开发用于预测现场污水处理系统水质的软传感器。
开发先进的现场废水处理系统(OWTS)需要准确和一致的水质监测,以评估处理效率并确保遵守法规。然而,离线参数,如化学需氧量(COD)、总悬浮固体(TSS)和大肠杆菌(E.coli),需要收集样本和耗时的实验室分析,无法提供系统性能或组件故障的实时信息。虽然近年来出现了实时COD分析仪,但由于成本和化学耗材的原因,它们在经济上不适用于现场系统。本研究旨在通过开发多个多输入和单输出软传感器来设计和实现OWTS的实时远程监控系统。该软传感器集成了可以从成熟的在线传感器获得的数据,以准确预测关键的水质参数,包括COD、TSS和大肠杆菌浓度。使用运行了近两年的现有现场测试OWTS的时间和空间水质数据(n=56个数据点)来评估四种机器学习算法的预测性能。这些算法,即偏最小二乘回归(PLS)、支持向量回归(SVR)、立体主义回归(CUB)和分位数回归神经网络(QRNN),因其在废水处理预测中的先前应用和有效性而被选为候选算法。选择可以在线测量的水质参数,包括浊度、颜色、pH、NH4+、NO3-和电导率,作为预测COD、TSS和大肠杆菌的模型输入。结果表明,训练的SVR模型对COD提供了具有统计学意义的预测,平均绝对百分比误差(MAPE)为14.5%,R2为0.96。CUB模型为TSS提供了最佳的预测性能,MAPE为24.8%,R2为0.99。没有一个模型能够实现对大肠杆菌的最佳预测结果;CUB模型表现最好,MAPE为71.4%,R2为0.22。考虑到OWTS废水数据集中COD、TSS和大肠杆菌的浓度波动较大,所提出的软传感器模型充分预测了COD和TSS,而大肠杆菌的预测相对不那么准确,需要进一步改进。这些结果表明,尽管OWTS的水质数据集相对较小,但基于机器学习的软传感器可以提供离线参数的有用预测估计,并提供可用于调整OWTS操作的实时监测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Environmental Au
ACS Environmental Au 环境科学-
CiteScore
7.10
自引率
0.00%
发文量
0
期刊介绍: ACS Environmental Au is an open access journal which publishes experimental research and theoretical results in all aspects of environmental science and technology both pure and applied. Short letters comprehensive articles reviews and perspectives are welcome in the following areas:Alternative EnergyAnthropogenic Impacts on Atmosphere Soil or WaterBiogeochemical CyclingBiomass or Wastes as ResourcesContaminants in Aquatic and Terrestrial EnvironmentsEnvironmental Data ScienceEcotoxicology and Public HealthEnergy and ClimateEnvironmental Modeling Processes and Measurement Methods and TechnologiesEnvironmental Nanotechnology and BiotechnologyGreen ChemistryGreen Manufacturing and EngineeringRisk assessment Regulatory Frameworks and Life-Cycle AssessmentsTreatment and Resource Recovery and Waste Management
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信