{"title":"A comparative analysis of hyperparameter optimization using LSTM-based deep learning models for urban air quality predictions","authors":"Beytullah Eren , Caner Erden , Ayşegül Atalı , Serkan Ozdemir","doi":"10.1016/j.asej.2025.103786","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution poses significant threats to human health and the environment, necessitating accurate prediction models for effective management and mitigation strategies. This study presents a comprehensive analysis of hyperparameter optimization techniques for Long Short-Term Memory (LSTM) based deep learning models in urban air quality forecasting. We focus on predicting concentrations of four key pollutants: carbon monoxide (CO), nitrogen oxides (NO<sub>X</sub>), nitrogen dioxide (NO<sub>2</sub>), and particulate matter (PM<sub>10</sub>). The study employs and compares three prominent hyperparameter optimization methods: Random Search, Bayesian Optimization, and Hyperband. Using air quality data from Sakarya, Turkey, collected between January 2020 and September 2022, we first addressed missing data through comparative analysis of mean imputation and k-Nearest Neighbors (kNN) imputation methods. Our results demonstrate that kNN imputation generally outperforms mean imputation, except for NO<sub>X</sub> predictions. The hyperparameter-optimized LSTM models consistently outperformed baseline models across all pollutants. Notably, the Hyperband Search algorithm excelled in NO<sub>X</sub> prediction, while Bayesian Optimization showed superior performance for other pollutants. Our analysis also revealed temporal trends in pollutant concentrations during the COVID-19 pandemic, including significant reductions in PM<sub>10</sub> and CO levels. This study contributes to AI-driven environmental monitoring by comparing hyperparameter optimization techniques in urban air quality modeling. The improved prediction accuracy offered by our optimized models has significant implications for public health protection, environmental policymaking, and smart city initiatives. Our findings underscore the importance of tailored optimization approaches for different pollutants and highlight the potential of advanced machine learning techniques in addressing environmental challenges.</div></div>","PeriodicalId":48648,"journal":{"name":"Ain Shams Engineering Journal","volume":"16 12","pages":"Article 103786"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ain Shams Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2090447925005271","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Air pollution poses significant threats to human health and the environment, necessitating accurate prediction models for effective management and mitigation strategies. This study presents a comprehensive analysis of hyperparameter optimization techniques for Long Short-Term Memory (LSTM) based deep learning models in urban air quality forecasting. We focus on predicting concentrations of four key pollutants: carbon monoxide (CO), nitrogen oxides (NOX), nitrogen dioxide (NO2), and particulate matter (PM10). The study employs and compares three prominent hyperparameter optimization methods: Random Search, Bayesian Optimization, and Hyperband. Using air quality data from Sakarya, Turkey, collected between January 2020 and September 2022, we first addressed missing data through comparative analysis of mean imputation and k-Nearest Neighbors (kNN) imputation methods. Our results demonstrate that kNN imputation generally outperforms mean imputation, except for NOX predictions. The hyperparameter-optimized LSTM models consistently outperformed baseline models across all pollutants. Notably, the Hyperband Search algorithm excelled in NOX prediction, while Bayesian Optimization showed superior performance for other pollutants. Our analysis also revealed temporal trends in pollutant concentrations during the COVID-19 pandemic, including significant reductions in PM10 and CO levels. This study contributes to AI-driven environmental monitoring by comparing hyperparameter optimization techniques in urban air quality modeling. The improved prediction accuracy offered by our optimized models has significant implications for public health protection, environmental policymaking, and smart city initiatives. Our findings underscore the importance of tailored optimization approaches for different pollutants and highlight the potential of advanced machine learning techniques in addressing environmental challenges.
期刊介绍:
in Shams Engineering Journal is an international journal devoted to publication of peer reviewed original high-quality research papers and review papers in both traditional topics and those of emerging science and technology. Areas of both theoretical and fundamental interest as well as those concerning industrial applications, emerging instrumental techniques and those which have some practical application to an aspect of human endeavor, such as the preservation of the environment, health, waste disposal are welcome. The overall focus is on original and rigorous scientific research results which have generic significance.
Ain Shams Engineering Journal focuses upon aspects of mechanical engineering, electrical engineering, civil engineering, chemical engineering, petroleum engineering, environmental engineering, architectural and urban planning engineering. Papers in which knowledge from other disciplines is integrated with engineering are especially welcome like nanotechnology, material sciences, and computational methods as well as applied basic sciences: engineering mathematics, physics and chemistry.