Qing Wei , Huijin Zhang , Ju Yang , Bin Niu , Zuxin Xu
{"title":"PM2.5 concentration prediction using a whale optimization algorithm based hybrid deep learning model in Beijing, China","authors":"Qing Wei , Huijin Zhang , Ju Yang , Bin Niu , Zuxin Xu","doi":"10.1016/j.envpol.2025.125953","DOIUrl":null,"url":null,"abstract":"<div><div>PM<sub>2.5</sub> is a significant global atmospheric pollutant impacting visibility, climate, and public health. Accurate prediction of PM<sub>2.5</sub> concentrations is critical for assessing air pollution risks and providing early warnings for effective management. This study proposes a novel hybrid machine learning model that combines the whale optimization algorithm (WOA) with a convolutional neural network (CNN), long short-term memory (LSTM), and an attention mechanism (AM) to predict daily PM<sub>2.5</sub> concentrations. Tested with meteorological and air pollution daily data from 2014 to 2018, the WOA-CNN-LSTM-AM model demonstrates substantial improvements. It achieves MAE, RMSE, MBE, and R<sup>2</sup> values of 14.29, 21.96, −0.23, and 0.93, respectively, showing a reduction in prediction errors by 39% compared to CNN and 34% compared to LSTM models. In the medium-term forecast, the accuracy of the hybrid model is 30%–54% over WOA-CNN-LSTM and 26%–39% over CNN-LSTM-AM. The R<sup>2</sup> value decreases by 2.5% from the 1-day to 5-day forecast, maintaining high accuracy. SHAP analysis reveals that NO<sub>2</sub> and CO are the primary drivers for PM<sub>2.5</sub> predictions. This study provides a reliable tool for short and medium-term PM<sub>2.5</sub> prediction and air pollution control.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"371 ","pages":"Article 125953"},"PeriodicalIF":7.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0269749125003264","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
PM2.5 concentration prediction using a whale optimization algorithm based hybrid deep learning model in Beijing, China
PM2.5 is a significant global atmospheric pollutant impacting visibility, climate, and public health. Accurate prediction of PM2.5 concentrations is critical for assessing air pollution risks and providing early warnings for effective management. This study proposes a novel hybrid machine learning model that combines the whale optimization algorithm (WOA) with a convolutional neural network (CNN), long short-term memory (LSTM), and an attention mechanism (AM) to predict daily PM2.5 concentrations. Tested with meteorological and air pollution daily data from 2014 to 2018, the WOA-CNN-LSTM-AM model demonstrates substantial improvements. It achieves MAE, RMSE, MBE, and R2 values of 14.29, 21.96, −0.23, and 0.93, respectively, showing a reduction in prediction errors by 39% compared to CNN and 34% compared to LSTM models. In the medium-term forecast, the accuracy of the hybrid model is 30%–54% over WOA-CNN-LSTM and 26%–39% over CNN-LSTM-AM. The R2 value decreases by 2.5% from the 1-day to 5-day forecast, maintaining high accuracy. SHAP analysis reveals that NO2 and CO are the primary drivers for PM2.5 predictions. This study provides a reliable tool for short and medium-term PM2.5 prediction and air pollution control.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.