Xingtian Chen , Yuhang Zhang , Kai Cao , Dongxing Li , Qizhong Wu
{"title":"Improving PM2.5 simulations using LSTM: a study on spatiotemporal generalization","authors":"Xingtian Chen , Yuhang Zhang , Kai Cao , Dongxing Li , Qizhong Wu","doi":"10.1016/j.apr.2025.102647","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning models, particularly long short-term memory (LSTM) networks, have shown strong potential for bias correction in air quality simulations. Although LSTM-based models have demonstrated success in improving PM<sub>2.5</sub> simulations during specific periods such as winter months or heavy pollution events, their ability to generalize across varying temporal intervals and geographical locations remains underexplored. This study systematically investigates the spatiotemporal generalization capabilities of LSTM-based (LSTM<sub>Local</sub>, LSTM<sub>Regional</sub>, LSTM<sub>Regional-idx</sub>, LSTM<sub>Regional-sub</sub>) models across 12 monthly intervals and 34 monitoring sites in Beijing. Benchmark comparisons with alternative deep learning models (RNN, GRU, BPNN, CNN) demonstrate the superiority of LSTM-based model in improving predictive performance. Seasonal analysis reveals that the LSTM<sub>Local</sub> and LSTM<sub>Regional-sub</sub> model achieved modest gains during summer half-year (May to October) but achieved significant improvements during winter half-year (November to April), with average RMSE reductions of −5.89 % (November), −17.40 % (December), −6.37 % (January), 0.36 % (February), −5.87 % (March) and −1.57 % (April). Spatially, urban sites show moderate gains, but suburban sites exhibit greater improvement, with average RMSE of −5.14 % (Center), −2.26 % (South-East), −7.05 % (North-East), −8.39 % (South-West) and −12.37 % (North-West). These findings highlight the robust spatiotemporal generalization of LSTM-based models and support their applicability for real-time bias correction, long-term forecasting, and air quality dataset enhancement at fine spatial and temporal resolutions.</div></div>","PeriodicalId":8604,"journal":{"name":"Atmospheric Pollution Research","volume":"16 11","pages":"Article 102647"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1309104225002491","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning models, particularly long short-term memory (LSTM) networks, have shown strong potential for bias correction in air quality simulations. Although LSTM-based models have demonstrated success in improving PM2.5 simulations during specific periods such as winter months or heavy pollution events, their ability to generalize across varying temporal intervals and geographical locations remains underexplored. This study systematically investigates the spatiotemporal generalization capabilities of LSTM-based (LSTMLocal, LSTMRegional, LSTMRegional-idx, LSTMRegional-sub) models across 12 monthly intervals and 34 monitoring sites in Beijing. Benchmark comparisons with alternative deep learning models (RNN, GRU, BPNN, CNN) demonstrate the superiority of LSTM-based model in improving predictive performance. Seasonal analysis reveals that the LSTMLocal and LSTMRegional-sub model achieved modest gains during summer half-year (May to October) but achieved significant improvements during winter half-year (November to April), with average RMSE reductions of −5.89 % (November), −17.40 % (December), −6.37 % (January), 0.36 % (February), −5.87 % (March) and −1.57 % (April). Spatially, urban sites show moderate gains, but suburban sites exhibit greater improvement, with average RMSE of −5.14 % (Center), −2.26 % (South-East), −7.05 % (North-East), −8.39 % (South-West) and −12.37 % (North-West). These findings highlight the robust spatiotemporal generalization of LSTM-based models and support their applicability for real-time bias correction, long-term forecasting, and air quality dataset enhancement at fine spatial and temporal resolutions.
期刊介绍:
Atmospheric Pollution Research (APR) is an international journal designed for the publication of articles on air pollution. Papers should present novel experimental results, theory and modeling of air pollution on local, regional, or global scales. Areas covered are research on inorganic, organic, and persistent organic air pollutants, air quality monitoring, air quality management, atmospheric dispersion and transport, air-surface (soil, water, and vegetation) exchange of pollutants, dry and wet deposition, indoor air quality, exposure assessment, health effects, satellite measurements, natural emissions, atmospheric chemistry, greenhouse gases, and effects on climate change.