{"title":"AirGPT: Spatio-temporal large language model for air quality prediction","authors":"Zhonghang Li , Tianyi Chen , Yong Xu","doi":"10.1016/j.inffus.2025.103730","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution poses a critical threat to public health, ecosystems, and climate stability worldwide. Accurate air quality prediction is essential for informed policy-making, health risk mitigation, and environmental management, enabling proactive responses to pollution events and long-term planning for sustainable urban development. Despite advances, deep learning models for air quality prediction still face three critical challenges: heavy reliance on abundant historical data, difficulty in effectively fusing diverse information sources, and a lack of interpretability. To address these issues, we propose AirGPT, a large language model framework for air quality prediction. AirGPT integrates a specialized spatio-temporal encoder with a novel spatio-temporal instruction-tuning paradigm, enabling it to efficiently model complex spatio-temporal dependencies and perform sophisticated data fusion. Furthermore, our Chain-of-Thought distillation mechanism allows the model to externalize its predictive reasoning in a transparent, human-readable format, thereby enhancing interpretability. Experimental results demonstrate that AirGPT achieves state-of-the-art accuracy on air quality prediction tasks, particularly in data-scarce and zero-shot scenarios. By integrating interpretable reasoning with transparent predictive outputs, AirGPT provides a robust and reliable framework to support informed environmental decision-making. Our source code is available at: <span><span>https://anonymous.4open.science/r/AirGPT-6ACC</span><svg><path></path></svg></span></div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"127 ","pages":"Article 103730"},"PeriodicalIF":15.5000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525007924","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Air pollution poses a critical threat to public health, ecosystems, and climate stability worldwide. Accurate air quality prediction is essential for informed policy-making, health risk mitigation, and environmental management, enabling proactive responses to pollution events and long-term planning for sustainable urban development. Despite advances, deep learning models for air quality prediction still face three critical challenges: heavy reliance on abundant historical data, difficulty in effectively fusing diverse information sources, and a lack of interpretability. To address these issues, we propose AirGPT, a large language model framework for air quality prediction. AirGPT integrates a specialized spatio-temporal encoder with a novel spatio-temporal instruction-tuning paradigm, enabling it to efficiently model complex spatio-temporal dependencies and perform sophisticated data fusion. Furthermore, our Chain-of-Thought distillation mechanism allows the model to externalize its predictive reasoning in a transparent, human-readable format, thereby enhancing interpretability. Experimental results demonstrate that AirGPT achieves state-of-the-art accuracy on air quality prediction tasks, particularly in data-scarce and zero-shot scenarios. By integrating interpretable reasoning with transparent predictive outputs, AirGPT provides a robust and reliable framework to support informed environmental decision-making. Our source code is available at: https://anonymous.4open.science/r/AirGPT-6ACC
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.