{"title":"通过基于深度学习的异构城市数据整合预测长期空气污染物浓度","authors":"Chao Chen, Hui Liu, Chengming Yu","doi":"10.1016/j.apr.2024.102282","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate prediction of air pollutant concentrations, specifically concerning inhalable particulate matter such as PM<sub>2.5</sub>, is crucial for proactive measures to safeguard the well-being of urban residents. This paper focuses on addressing the perceptible latency effect for long-term PM<sub>2.5</sub> predictions produced by existing statistical models. We emphasize the importance of numerical computations in capturing substantial changes, and enhance prediction accuracy by integrating them with high-dimensional, diverse urban data. Specifically, our approach collects data from a global-to-meso-scale atmospheric dispersion model named System for Integrated modeLling of Atmospheric coMposition (SILAM), along with numerical weather forecasts, traffic congestion measurement, meteorological factors and static sources (road network and points of interest). We find that existing deep learning models are prone to overfitting when applied to complex datasets, primarily due to their uniform treatment of diverse data types as time series without adapting to the specific characteristics of each data type. To counter this, we propose a simple yet transferable deep learning architecture, focusing on the proper use of various data types. Additionally, our comparative analysis, through a case study in Shenzhen, China, shows our model not only enhances SILAM dispersion accuracy for 24h-ahead PM<sub>2.5</sub> forecasts by a significant 30.3%, but also mitigates the noticeable latency effect of existing models by 19.5%. Finally, an ablation study further validates the importance of each data source and module of our approach.</p></div>","PeriodicalId":8604,"journal":{"name":"Atmospheric Pollution Research","volume":"15 11","pages":"Article 102282"},"PeriodicalIF":3.9000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting long-term air pollutant concentrations through deep learning-based integration of heterogeneous urban data\",\"authors\":\"Chao Chen, Hui Liu, Chengming Yu\",\"doi\":\"10.1016/j.apr.2024.102282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurate prediction of air pollutant concentrations, specifically concerning inhalable particulate matter such as PM<sub>2.5</sub>, is crucial for proactive measures to safeguard the well-being of urban residents. This paper focuses on addressing the perceptible latency effect for long-term PM<sub>2.5</sub> predictions produced by existing statistical models. We emphasize the importance of numerical computations in capturing substantial changes, and enhance prediction accuracy by integrating them with high-dimensional, diverse urban data. Specifically, our approach collects data from a global-to-meso-scale atmospheric dispersion model named System for Integrated modeLling of Atmospheric coMposition (SILAM), along with numerical weather forecasts, traffic congestion measurement, meteorological factors and static sources (road network and points of interest). We find that existing deep learning models are prone to overfitting when applied to complex datasets, primarily due to their uniform treatment of diverse data types as time series without adapting to the specific characteristics of each data type. To counter this, we propose a simple yet transferable deep learning architecture, focusing on the proper use of various data types. Additionally, our comparative analysis, through a case study in Shenzhen, China, shows our model not only enhances SILAM dispersion accuracy for 24h-ahead PM<sub>2.5</sub> forecasts by a significant 30.3%, but also mitigates the noticeable latency effect of existing models by 19.5%. Finally, an ablation study further validates the importance of each data source and module of our approach.</p></div>\",\"PeriodicalId\":8604,\"journal\":{\"name\":\"Atmospheric Pollution Research\",\"volume\":\"15 11\",\"pages\":\"Article 102282\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Atmospheric Pollution Research\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1309104224002472\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1309104224002472","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Predicting long-term air pollutant concentrations through deep learning-based integration of heterogeneous urban data
Accurate prediction of air pollutant concentrations, specifically concerning inhalable particulate matter such as PM2.5, is crucial for proactive measures to safeguard the well-being of urban residents. This paper focuses on addressing the perceptible latency effect for long-term PM2.5 predictions produced by existing statistical models. We emphasize the importance of numerical computations in capturing substantial changes, and enhance prediction accuracy by integrating them with high-dimensional, diverse urban data. Specifically, our approach collects data from a global-to-meso-scale atmospheric dispersion model named System for Integrated modeLling of Atmospheric coMposition (SILAM), along with numerical weather forecasts, traffic congestion measurement, meteorological factors and static sources (road network and points of interest). We find that existing deep learning models are prone to overfitting when applied to complex datasets, primarily due to their uniform treatment of diverse data types as time series without adapting to the specific characteristics of each data type. To counter this, we propose a simple yet transferable deep learning architecture, focusing on the proper use of various data types. Additionally, our comparative analysis, through a case study in Shenzhen, China, shows our model not only enhances SILAM dispersion accuracy for 24h-ahead PM2.5 forecasts by a significant 30.3%, but also mitigates the noticeable latency effect of existing models by 19.5%. Finally, an ablation study further validates the importance of each data source and module of our approach.
期刊介绍:
Atmospheric Pollution Research (APR) is an international journal designed for the publication of articles on air pollution. Papers should present novel experimental results, theory and modeling of air pollution on local, regional, or global scales. Areas covered are research on inorganic, organic, and persistent organic air pollutants, air quality monitoring, air quality management, atmospheric dispersion and transport, air-surface (soil, water, and vegetation) exchange of pollutants, dry and wet deposition, indoor air quality, exposure assessment, health effects, satellite measurements, natural emissions, atmospheric chemistry, greenhouse gases, and effects on climate change.