Comparing the Performance of Machine Learning and Deep Learning Algorithms in Wastewater Treatment Process

Jaeil Kim, H. Lee, J. Jang, Yongtae Ahn, S. Ki, Hyun-Geoun Park
{"title":"Comparing the Performance of Machine Learning and Deep Learning Algorithms in Wastewater Treatment Process","authors":"Jaeil Kim, H. Lee, J. Jang, Yongtae Ahn, S. Ki, Hyun-Geoun Park","doi":"10.4491/ksee.2023.45.12.587","DOIUrl":null,"url":null,"abstract":"This study assessed the performance of single and modified algorithms based on machine learning and deep learning for wastewater treatment process. More specifically, this study adopted support vector machine (SVM), random forest (RF), and artificial neural network (ANN) for machine learning as well as long short-term memory (LSTM) for deep learning. The performance of these (single) algorithms were compared with that of modified ones processed through hyperparameter tuning, ensemble learning (only for machine learning), and multi-layer stacking (i.e., two layers of LSTM units). The daily effluent of wastewater treatment process observed between 2017 and 2022 in the Cheong-Ju National Industrial Complex was used as input to all tested algorithms, which was evaluated with respect to mean squared error. For the model performance evaluation, discharge and biochemical oxygen demand are selected as dependent variables out of nine measured parameters. Results showed that the performance of any machine learning algorithms was superior to their competitor LSTM. This is mainly attributed to a small amount of input data provided to the LSTM algorithm and unstable effluent wastewater characteristics. Meanwhile, hyperparameter tuning improved the performance of all tested algorithms. However, ensemble learning for machine learning and two-layer stacking for LSTM generally resulted in performance degradation as compared to that of single algorithms, regardless of dependent variables. Therefore, this calls for a careful design and evaluation of modified algorithms, specifically for model architecture and performance improvement processes.","PeriodicalId":16127,"journal":{"name":"Journal of Korean Society of Environmental Engineers","volume":" 1116","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Environmental Engineers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4491/ksee.2023.45.12.587","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study assessed the performance of single and modified algorithms based on machine learning and deep learning for wastewater treatment process. More specifically, this study adopted support vector machine (SVM), random forest (RF), and artificial neural network (ANN) for machine learning as well as long short-term memory (LSTM) for deep learning. The performance of these (single) algorithms were compared with that of modified ones processed through hyperparameter tuning, ensemble learning (only for machine learning), and multi-layer stacking (i.e., two layers of LSTM units). The daily effluent of wastewater treatment process observed between 2017 and 2022 in the Cheong-Ju National Industrial Complex was used as input to all tested algorithms, which was evaluated with respect to mean squared error. For the model performance evaluation, discharge and biochemical oxygen demand are selected as dependent variables out of nine measured parameters. Results showed that the performance of any machine learning algorithms was superior to their competitor LSTM. This is mainly attributed to a small amount of input data provided to the LSTM algorithm and unstable effluent wastewater characteristics. Meanwhile, hyperparameter tuning improved the performance of all tested algorithms. However, ensemble learning for machine learning and two-layer stacking for LSTM generally resulted in performance degradation as compared to that of single algorithms, regardless of dependent variables. Therefore, this calls for a careful design and evaluation of modified algorithms, specifically for model architecture and performance improvement processes.
比较机器学习算法和深度学习算法在污水处理过程中的表现
本研究评估了基于机器学习和深度学习的单一算法和改进算法在污水处理过程中的性能。具体而言,本研究在机器学习方面采用了支持向量机(SVM)、随机森林(RF)和人工神经网络(ANN),在深度学习方面采用了长短期记忆(LSTM)。将这些(单一)算法的性能与通过超参数调整、集合学习(仅用于机器学习)和多层堆叠(即两层 LSTM 单元)处理的改进算法的性能进行了比较。所有测试算法均使用 2017 年至 2022 年期间在清州国家工业园区观测到的废水处理过程的日排放量作为输入,并根据均方误差对其进行评估。在模型性能评估中,从九个测量参数中选择了排放量和生化需氧量作为因变量。结果表明,任何机器学习算法的性能都优于其竞争对手 LSTM。这主要归因于提供给 LSTM 算法的输入数据量较小,以及出水废水特性不稳定。同时,超参数调整提高了所有测试算法的性能。然而,与单一算法相比,机器学习的集合学习和 LSTM 的双层堆叠算法(无论因变量如何)通常会导致性能下降。因此,这就要求对修改后的算法进行仔细的设计和评估,特别是在模型架构和性能改进过程中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信