{"title":"Integrating multi-dimensional graph attention networks and transformer architecture for predicting air pollution in subway stations","authors":"Dingya Chen, Hui Liu","doi":"10.1016/j.asoc.2025.113033","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of PM<sub>2.5</sub> concentrations in subway stations is crucial for developing effective air pollution control strategies. However, existing methods struggle to accurately predict PM<sub>2.5</sub> concentrations due to the challenges of multi-step ahead forecasting, modeling long time series, capturing complex spatiotemporal correlations, and handling data quality issues such as missing values. This study proposes MAGICFormer, a novel hybrid end-to-end model for predicting PM<sub>2.5</sub> concentrations. The model comprises key components such as data preprocessing, a multi-dimensional graph attention network (md-GAT) module, as well as an Informer encoder and a Cross Decoder based on the Transformer architecture. The data preprocessing method improves data quality by addressing missing values and correcting anomalies. MAGICFormer integrates spatiotemporal correlations to predict PM<sub>2.5</sub> concentrations. The md-GAT module adaptively captures complex spatial relationships among subway stations across different dimensions, with its output serving as input to the Spatio Decoder. The Informer encoder processes long sequences and extracts temporal features, which are then passed to the Spatio Decoder and Temporal Decoder within the Cross Decoder for information fusion. The Cross Decoder aggregates the outputs of the Spatio and Temporal Decoders using a cross-attention mechanism, leveraging the interdependencies between graph-structured and time-series data to enhance prediction accuracy and improve model performance by effectively fusing spatial and temporal information. Experiments on Seoul subway stations show that MAGICFormer improves prediction accuracy by over 20 % compared to existing methods, demonstrating its effectiveness in long-term PM<sub>2.5</sub> forecasting. The proposed model offers a practical decision support tool for enhancing air quality management strategies in subway systems, particularly for long-term monitoring and control.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113033"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625003448","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of PM2.5 concentrations in subway stations is crucial for developing effective air pollution control strategies. However, existing methods struggle to accurately predict PM2.5 concentrations due to the challenges of multi-step ahead forecasting, modeling long time series, capturing complex spatiotemporal correlations, and handling data quality issues such as missing values. This study proposes MAGICFormer, a novel hybrid end-to-end model for predicting PM2.5 concentrations. The model comprises key components such as data preprocessing, a multi-dimensional graph attention network (md-GAT) module, as well as an Informer encoder and a Cross Decoder based on the Transformer architecture. The data preprocessing method improves data quality by addressing missing values and correcting anomalies. MAGICFormer integrates spatiotemporal correlations to predict PM2.5 concentrations. The md-GAT module adaptively captures complex spatial relationships among subway stations across different dimensions, with its output serving as input to the Spatio Decoder. The Informer encoder processes long sequences and extracts temporal features, which are then passed to the Spatio Decoder and Temporal Decoder within the Cross Decoder for information fusion. The Cross Decoder aggregates the outputs of the Spatio and Temporal Decoders using a cross-attention mechanism, leveraging the interdependencies between graph-structured and time-series data to enhance prediction accuracy and improve model performance by effectively fusing spatial and temporal information. Experiments on Seoul subway stations show that MAGICFormer improves prediction accuracy by over 20 % compared to existing methods, demonstrating its effectiveness in long-term PM2.5 forecasting. The proposed model offers a practical decision support tool for enhancing air quality management strategies in subway systems, particularly for long-term monitoring and control.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.