Qunlan Wei , Weiwei Song , Bolan Dai , Hongling Wu , Xiaoqing Zuo , Jinxia Wang , Jianglong Chen , Jiahao Li , Siyuan Li , Zhiyu Chen
{"title":"Spatiotemporal estimation of surface NO2 concentrations in the Pearl River Delta region based on TROPOMI data and machine learning","authors":"Qunlan Wei , Weiwei Song , Bolan Dai , Hongling Wu , Xiaoqing Zuo , Jinxia Wang , Jianglong Chen , Jiahao Li , Siyuan Li , Zhiyu Chen","doi":"10.1016/j.apr.2024.102353","DOIUrl":null,"url":null,"abstract":"<div><div>Nitrogen dioxide (<span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span>) is a major air pollutant, and its concentration data are crucial for the study of air pollution and its impact on the environment. Although satellite data provide an effective method for estimating surface concentrations on a large scale through integrated modeling, the estimation of surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentrations is hampered by the substantial amount of missing satellite data. This restricts in-depth studies of surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> pollution. This study aims to reconstruct the missing data on tropospheric <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> vertical column density from the TROPOspheric Monitoring Instrument (TROPOMI <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span>). Subsequently, the reconstructed TROPOMI <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> data and other predictor variables were utilized to estimate the daily surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentrations at a 1 km resolution for the Pearl River Delta (PRD) region. The TROPOMI <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> reconstruction models and the surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> estimation model were both developed using the Extreme Gradient Boosting (XGBoost) algorithm. Additionally, comparative experiments were conducted between the XGBoost model and other traditional machine learning models, and the performances of the XGBoost model were evaluated through 10-fold cross-validation (CV) sample-based and site-based evaluations. The results indicate that the sample-based and site-based CV R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> values were 0.873 and 0.709, respectively. The feature importance scores indicate that TROPOMI <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> was the most significant variable contributing to the estimation model. This indicates that the reconstruction of TROPOMI <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> data and the development of an XGBoost model are suitable for the spatiotemporal estimation of surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentrations in the PRD region, effectively reflecting the spatiotemporal distribution and evolution of surface <span><math><msub><mrow><mtext>NO</mtext></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentrations in the area.</div></div>","PeriodicalId":8604,"journal":{"name":"Atmospheric Pollution Research","volume":"16 3","pages":"Article 102353"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1309104224003180","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Nitrogen dioxide () is a major air pollutant, and its concentration data are crucial for the study of air pollution and its impact on the environment. Although satellite data provide an effective method for estimating surface concentrations on a large scale through integrated modeling, the estimation of surface concentrations is hampered by the substantial amount of missing satellite data. This restricts in-depth studies of surface pollution. This study aims to reconstruct the missing data on tropospheric vertical column density from the TROPOspheric Monitoring Instrument (TROPOMI ). Subsequently, the reconstructed TROPOMI data and other predictor variables were utilized to estimate the daily surface concentrations at a 1 km resolution for the Pearl River Delta (PRD) region. The TROPOMI reconstruction models and the surface estimation model were both developed using the Extreme Gradient Boosting (XGBoost) algorithm. Additionally, comparative experiments were conducted between the XGBoost model and other traditional machine learning models, and the performances of the XGBoost model were evaluated through 10-fold cross-validation (CV) sample-based and site-based evaluations. The results indicate that the sample-based and site-based CV R values were 0.873 and 0.709, respectively. The feature importance scores indicate that TROPOMI was the most significant variable contributing to the estimation model. This indicates that the reconstruction of TROPOMI data and the development of an XGBoost model are suitable for the spatiotemporal estimation of surface concentrations in the PRD region, effectively reflecting the spatiotemporal distribution and evolution of surface concentrations in the area.
期刊介绍:
Atmospheric Pollution Research (APR) is an international journal designed for the publication of articles on air pollution. Papers should present novel experimental results, theory and modeling of air pollution on local, regional, or global scales. Areas covered are research on inorganic, organic, and persistent organic air pollutants, air quality monitoring, air quality management, atmospheric dispersion and transport, air-surface (soil, water, and vegetation) exchange of pollutants, dry and wet deposition, indoor air quality, exposure assessment, health effects, satellite measurements, natural emissions, atmospheric chemistry, greenhouse gases, and effects on climate change.