Comparing the Performance of Machine Learning and Deep Learning Algorithms in Wastewater Treatment Process

Journal of Korean Society of Environmental Engineers Pub Date : 2023-12-31 DOI:10.4491/ksee.2023.45.12.587

Jaeil Kim, H. Lee, J. Jang, Yongtae Ahn, S. Ki, Hyun-Geoun Park

{"title":"Comparing the Performance of Machine Learning and Deep Learning Algorithms in Wastewater Treatment Process","authors":"Jaeil Kim, H. Lee, J. Jang, Yongtae Ahn, S. Ki, Hyun-Geoun Park","doi":"10.4491/ksee.2023.45.12.587","DOIUrl":null,"url":null,"abstract":"This study assessed the performance of single and modified algorithms based on machine learning and deep learning for wastewater treatment process. More specifically, this study adopted support vector machine (SVM), random forest (RF), and artificial neural network (ANN) for machine learning as well as long short-term memory (LSTM) for deep learning. The performance of these (single) algorithms were compared with that of modified ones processed through hyperparameter tuning, ensemble learning (only for machine learning), and multi-layer stacking (i.e., two layers of LSTM units). The daily effluent of wastewater treatment process observed between 2017 and 2022 in the Cheong-Ju National Industrial Complex was used as input to all tested algorithms, which was evaluated with respect to mean squared error. For the model performance evaluation, discharge and biochemical oxygen demand are selected as dependent variables out of nine measured parameters. Results showed that the performance of any machine learning algorithms was superior to their competitor LSTM. This is mainly attributed to a small amount of input data provided to the LSTM algorithm and unstable effluent wastewater characteristics. Meanwhile, hyperparameter tuning improved the performance of all tested algorithms. However, ensemble learning for machine learning and two-layer stacking for LSTM generally resulted in performance degradation as compared to that of single algorithms, regardless of dependent variables. Therefore, this calls for a careful design and evaluation of modified algorithms, specifically for model architecture and performance improvement processes.","PeriodicalId":16127,"journal":{"name":"Journal of Korean Society of Environmental Engineers","volume":" 1116","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Environmental Engineers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4491/ksee.2023.45.12.587","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This study assessed the performance of single and modified algorithms based on machine learning and deep learning for wastewater treatment process. More specifically, this study adopted support vector machine (SVM), random forest (RF), and artificial neural network (ANN) for machine learning as well as long short-term memory (LSTM) for deep learning. The performance of these (single) algorithms were compared with that of modified ones processed through hyperparameter tuning, ensemble learning (only for machine learning), and multi-layer stacking (i.e., two layers of LSTM units). The daily effluent of wastewater treatment process observed between 2017 and 2022 in the Cheong-Ju National Industrial Complex was used as input to all tested algorithms, which was evaluated with respect to mean squared error. For the model performance evaluation, discharge and biochemical oxygen demand are selected as dependent variables out of nine measured parameters. Results showed that the performance of any machine learning algorithms was superior to their competitor LSTM. This is mainly attributed to a small amount of input data provided to the LSTM algorithm and unstable effluent wastewater characteristics. Meanwhile, hyperparameter tuning improved the performance of all tested algorithms. However, ensemble learning for machine learning and two-layer stacking for LSTM generally resulted in performance degradation as compared to that of single algorithms, regardless of dependent variables. Therefore, this calls for a careful design and evaluation of modified algorithms, specifically for model architecture and performance improvement processes.

查看原文本刊更多论文

比较机器学习算法和深度学习算法在污水处理过程中的表现

本研究评估了基于机器学习和深度学习的单一算法和改进算法在污水处理过程中的性能。具体而言，本研究在机器学习方面采用了支持向量机（SVM）、随机森林（RF）和人工神经网络（ANN），在深度学习方面采用了长短期记忆（LSTM）。将这些（单一）算法的性能与通过超参数调整、集合学习（仅用于机器学习）和多层堆叠（即两层 LSTM 单元）处理的改进算法的性能进行了比较。所有测试算法均使用 2017 年至 2022 年期间在清州国家工业园区观测到的废水处理过程的日排放量作为输入，并根据均方误差对其进行评估。在模型性能评估中，从九个测量参数中选择了排放量和生化需氧量作为因变量。结果表明，任何机器学习算法的性能都优于其竞争对手 LSTM。这主要归因于提供给 LSTM 算法的输入数据量较小，以及出水废水特性不稳定。同时，超参数调整提高了所有测试算法的性能。然而，与单一算法相比，机器学习的集合学习和 LSTM 的双层堆叠算法（无论因变量如何）通常会导致性能下降。因此，这就要求对修改后的算法进行仔细的设计和评估，特别是在模型架构和性能改进过程中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Korean Society of Environmental Engineers

自引率

0.00%

发文量