{"title":"Hybrid graph convolutional LSTM model for spatio-temporal air quality transfer learning","authors":"Sooraj Raj, Jim Smith, Enda Hayes","doi":"10.1007/s11869-025-01713-8","DOIUrl":null,"url":null,"abstract":"<div><p>The short-term air quality forecasting models serve as an early warning system for local agencies, aiding in preparing mitigation strategies against severe pollution episodes. This paper explores the application of Transfer Learning to enhance short-term air quality forecasting model accuracy when labelled data is limited or missing, as often occurs with newly installed monitoring stations or due to sensor malfunctions. These monitoring stations are typically installed in areas of high exposure, like roads or urban/industrial areas, due to recurrent peak episodes or to monitor background pollutant levels generally. Forecasts with greater reliability, even when there is limited historical data available due to the recent installation of the monitoring station for example, are expected to enable the swift implementation of proactive measures to prevent significant pollution episodes from happening. The proposed method leverages knowledge from spatially neighbouring air quality monitoring stations to achieve the multi-modal spatial-temporal transfer learning to the target station, exploring multivariate time series data available from neighbouring monitoring stations. This study employed historical air quality data from spatially adjacent monitoring stations identified in South Wales, UK. The study evaluates the predictive capabilities of four base models and their corresponding transfer learning variants for estimating NO<sub>2</sub> and PM<sub>10</sub> pollutant levels, which are the most difficult pollutants to meet objectives and limit values in the UK’s air quality strategy. The paper highlights the importance of capturing spatial patterns from different monitoring stations along with temporal trends when it comes to air quality prediction. Our experiments demonstrate that transfer learning models outperform models trained from scratch on air quality multivariate time series prediction problems in a low data environment. The proposed hybrid Graph Convolutional-LSTM model, making use of a novel Granger causality-based adjacency matrix for the new site, has significantly outperformed other baseline models in predicting pollutants, achieving notable improvements in prediction accuracy of approximately 8% for PM<sub>10</sub> and 7% for NO<sub>2</sub> values, as reflected in the RMSE values. It has also demonstrated the potential for data-efficient approaches in spatial transfer learning by reducing the need for large datasets by incorporating prior causal information.</p></div>","PeriodicalId":49109,"journal":{"name":"Air Quality Atmosphere and Health","volume":"18 5","pages":"1425 - 1445"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11869-025-01713-8.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Air Quality Atmosphere and Health","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s11869-025-01713-8","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The short-term air quality forecasting models serve as an early warning system for local agencies, aiding in preparing mitigation strategies against severe pollution episodes. This paper explores the application of Transfer Learning to enhance short-term air quality forecasting model accuracy when labelled data is limited or missing, as often occurs with newly installed monitoring stations or due to sensor malfunctions. These monitoring stations are typically installed in areas of high exposure, like roads or urban/industrial areas, due to recurrent peak episodes or to monitor background pollutant levels generally. Forecasts with greater reliability, even when there is limited historical data available due to the recent installation of the monitoring station for example, are expected to enable the swift implementation of proactive measures to prevent significant pollution episodes from happening. The proposed method leverages knowledge from spatially neighbouring air quality monitoring stations to achieve the multi-modal spatial-temporal transfer learning to the target station, exploring multivariate time series data available from neighbouring monitoring stations. This study employed historical air quality data from spatially adjacent monitoring stations identified in South Wales, UK. The study evaluates the predictive capabilities of four base models and their corresponding transfer learning variants for estimating NO2 and PM10 pollutant levels, which are the most difficult pollutants to meet objectives and limit values in the UK’s air quality strategy. The paper highlights the importance of capturing spatial patterns from different monitoring stations along with temporal trends when it comes to air quality prediction. Our experiments demonstrate that transfer learning models outperform models trained from scratch on air quality multivariate time series prediction problems in a low data environment. The proposed hybrid Graph Convolutional-LSTM model, making use of a novel Granger causality-based adjacency matrix for the new site, has significantly outperformed other baseline models in predicting pollutants, achieving notable improvements in prediction accuracy of approximately 8% for PM10 and 7% for NO2 values, as reflected in the RMSE values. It has also demonstrated the potential for data-efficient approaches in spatial transfer learning by reducing the need for large datasets by incorporating prior causal information.
期刊介绍:
Air Quality, Atmosphere, and Health is a multidisciplinary journal which, by its very name, illustrates the broad range of work it publishes and which focuses on atmospheric consequences of human activities and their implications for human and ecological health.
It offers research papers, critical literature reviews and commentaries, as well as special issues devoted to topical subjects or themes.
International in scope, the journal presents papers that inform and stimulate a global readership, as the topic addressed are global in their import. Consequently, we do not encourage submission of papers involving local data that relate to local problems. Unless they demonstrate wide applicability, these are better submitted to national or regional journals.
Air Quality, Atmosphere & Health addresses such topics as acid precipitation; airborne particulate matter; air quality monitoring and management; exposure assessment; risk assessment; indoor air quality; atmospheric chemistry; atmospheric modeling and prediction; air pollution climatology; climate change and air quality; air pollution measurement; atmospheric impact assessment; forest-fire emissions; atmospheric science; greenhouse gases; health and ecological effects; clean air technology; regional and global change and satellite measurements.
This journal benefits a diverse audience of researchers, public health officials and policy makers addressing problems that call for solutions based in evidence from atmospheric and exposure assessment scientists, epidemiologists, and risk assessors. Publication in the journal affords the opportunity to reach beyond defined disciplinary niches to this broader readership.