Estimation of near-surface NO2 based on TROPOMI and provincial control stations' data using machine learning

F. Deng, Yijian Chen, Lanhui Li, Cong Wang, Luwei Cao, Chaofeng Peng
{"title":"Estimation of near-surface NO2 based on TROPOMI and provincial control stations' data using machine learning","authors":"F. Deng, Yijian Chen, Lanhui Li, Cong Wang, Luwei Cao, Chaofeng Peng","doi":"10.1109/WCMEIM56910.2022.10021549","DOIUrl":null,"url":null,"abstract":"Near-surface nitrogen dioxide NO2 concentration is an essential indicator for ambient air quality monitoring. In this study, the extreme gradient boosting (XGBoost) algorithm and random forest algorithm in machine learning are used to combine the TROPOspheric Monitoring Instrument(TROPOMI) high-resolution remote sensing images, meteorological and other auxiliary data with ground-level NO2 concentration monitoring data (including national and provincial control stations) to construct a dataset of estimation samples and conduct research on estimating the near-surface NO2 concentration on a grid with a spatial precision of 0.05° in Sichuan Province. According to the ten-fold cross-validation results of the test and training sets, the XGBoost model has better accuracy and generalization performance (R2=0.875, RMSE=4.774 ug/m3), In addition, SHapley Additive exPlanation(SHAP) was employed after its development, which showed that TROPOMI satellite data contributed the most to the near-surface NO2 estimation. By contrasting the results with the data set using only national control stations, we can see that after including the atmospheric monitoring data from provincial control stations, the estimated NO2 concentration near the ground is more consistent with the data distribution of ground monitoring stations. Moreover, the spatial distribution of concentration is more continuous and homogeneous, providing essential support for local governments to regulate the atmospheric environment precisely.","PeriodicalId":202270,"journal":{"name":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCMEIM56910.2022.10021549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Near-surface nitrogen dioxide NO2 concentration is an essential indicator for ambient air quality monitoring. In this study, the extreme gradient boosting (XGBoost) algorithm and random forest algorithm in machine learning are used to combine the TROPOspheric Monitoring Instrument(TROPOMI) high-resolution remote sensing images, meteorological and other auxiliary data with ground-level NO2 concentration monitoring data (including national and provincial control stations) to construct a dataset of estimation samples and conduct research on estimating the near-surface NO2 concentration on a grid with a spatial precision of 0.05° in Sichuan Province. According to the ten-fold cross-validation results of the test and training sets, the XGBoost model has better accuracy and generalization performance (R2=0.875, RMSE=4.774 ug/m3), In addition, SHapley Additive exPlanation(SHAP) was employed after its development, which showed that TROPOMI satellite data contributed the most to the near-surface NO2 estimation. By contrasting the results with the data set using only national control stations, we can see that after including the atmospheric monitoring data from provincial control stations, the estimated NO2 concentration near the ground is more consistent with the data distribution of ground monitoring stations. Moreover, the spatial distribution of concentration is more continuous and homogeneous, providing essential support for local governments to regulate the atmospheric environment precisely.
基于TROPOMI和省级测控站数据的近地表NO2机器学习估算
近地表二氧化氮NO2浓度是环境空气质量监测的重要指标。本研究利用机器学习中的极限梯度增强(XGBoost)算法和随机森林算法,结合对流层监测仪器(TROPOMI)高分辨率遥感影像,利用气象等辅助资料,结合地面NO2浓度监测数据(包括国家级和省级控制站),构建估算样本数据集,开展空间精度为0.05°的网格化四川省近地表NO2浓度估算研究。测试集和训练集的十倍交叉验证结果表明,XGBoost模型具有更好的精度和泛化性能(R2=0.875, RMSE=4.774 ug/m3),并且在开发后采用SHapley Additive exPlanation(SHAP),表明TROPOMI卫星数据对近地表NO2估算贡献最大。通过与仅使用国家控制站的数据对比,我们可以看到,在纳入省级控制站的大气监测数据后,近地面NO2浓度估计值与地面监测站的数据分布更加一致。浓度的空间分布更具有连续性和均匀性,为地方政府对大气环境进行精准调控提供了必要的支撑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信