{"title":"改进的地表NO2检索:中国双层机器学习模型构建及时空特征分析(2018-2023)","authors":"Wei Wang, Bingqian Li, Biyan Chen","doi":"10.1016/j.jenvman.2025.125439","DOIUrl":null,"url":null,"abstract":"<div><div>As an important atmospheric pollutant causing serious harm to human health and the natural environment, monitoring of surface NO<sub>2</sub> (SNO<sub>2</sub>) level is of critical importance. However, the current SNO<sub>2</sub> retrieval models neglect to consider the influence of NO<sub>2</sub> vertical hierarchical structure on the process of converting the tropospheric NO<sub>2</sub> columns (XNO<sub>2</sub>) to the SNO<sub>2</sub> levels. Meanwhile, conventional machine learning models struggle to capture complex spatiotemporal relationships between SNO<sub>2</sub> and XNO<sub>2</sub>, which lead to the large differences between the current model results and the site-based measurements. To enhance the accuracy of SNO<sub>2</sub> level inversion, this study incorporated the NO<sub>2</sub> vertical stratification characteristics and its spatial-temporal variation mechanisms over a long time series. By leveraging the advanced Light Gradient Boosting Machine (LGBM) and Extremely Randomized Forests (ERF) models, a Double-Layer Machine Learning (DLML) framework was developed to estimate SNO<sub>2</sub> levels across China from 2018 to 2023. Based on the results of this study, the temporal and spatial variation patterns of SNO<sub>2</sub> levels across China, including key regions, were comprehensively analyzed. The results showed that: (1) Compared with the traditional model, the DLML model proposed in this study showed better performance, in which the R<sup>2</sup> of spatio-temporal cross-validation reached 0.87. This represented an improvement of about 10 % over previous models. At the same time, MAE and RMSE were reduced to about 4.24 μg/m<sup>3</sup> and 5.79 μg/m<sup>3</sup> respectively. (2) The retrieved SNO<sub>2</sub> levels in China mainly showed a decreasing trend from the central and eastern coastal areas to the surrounding areas, and the annual average concentration had reached the level of the World Health Organization (WHO) air quality guidelines. In terms of time, the retrieved SNO<sub>2</sub> levels showed a U-shaped variation, with the highest in winter, followed by autumn, spring, and summer, reaching the peak in January and December, and then reaching the valley in June–August. (3) The two abnormal events occurred in winter, indicating that the meteorological conditions in winter were the main reason for the change of SNO<sub>2</sub> in the air. Among them, the factors that cause the peak values of Wuhan and Yangtze River Delta may also be due to the high level of economic development, dense population activities, and frequent industrial activities in the two regions, resulting in their own SNO<sub>2</sub> emissions.</div></div>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"384 ","pages":"Article 125439"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved surface NO2 Retrieval: Double-layer machine learning model construction and spatio-temporal characterization analysis in China (2018–2023)\",\"authors\":\"Wei Wang, Bingqian Li, Biyan Chen\",\"doi\":\"10.1016/j.jenvman.2025.125439\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As an important atmospheric pollutant causing serious harm to human health and the natural environment, monitoring of surface NO<sub>2</sub> (SNO<sub>2</sub>) level is of critical importance. However, the current SNO<sub>2</sub> retrieval models neglect to consider the influence of NO<sub>2</sub> vertical hierarchical structure on the process of converting the tropospheric NO<sub>2</sub> columns (XNO<sub>2</sub>) to the SNO<sub>2</sub> levels. Meanwhile, conventional machine learning models struggle to capture complex spatiotemporal relationships between SNO<sub>2</sub> and XNO<sub>2</sub>, which lead to the large differences between the current model results and the site-based measurements. To enhance the accuracy of SNO<sub>2</sub> level inversion, this study incorporated the NO<sub>2</sub> vertical stratification characteristics and its spatial-temporal variation mechanisms over a long time series. By leveraging the advanced Light Gradient Boosting Machine (LGBM) and Extremely Randomized Forests (ERF) models, a Double-Layer Machine Learning (DLML) framework was developed to estimate SNO<sub>2</sub> levels across China from 2018 to 2023. Based on the results of this study, the temporal and spatial variation patterns of SNO<sub>2</sub> levels across China, including key regions, were comprehensively analyzed. The results showed that: (1) Compared with the traditional model, the DLML model proposed in this study showed better performance, in which the R<sup>2</sup> of spatio-temporal cross-validation reached 0.87. This represented an improvement of about 10 % over previous models. At the same time, MAE and RMSE were reduced to about 4.24 μg/m<sup>3</sup> and 5.79 μg/m<sup>3</sup> respectively. (2) The retrieved SNO<sub>2</sub> levels in China mainly showed a decreasing trend from the central and eastern coastal areas to the surrounding areas, and the annual average concentration had reached the level of the World Health Organization (WHO) air quality guidelines. In terms of time, the retrieved SNO<sub>2</sub> levels showed a U-shaped variation, with the highest in winter, followed by autumn, spring, and summer, reaching the peak in January and December, and then reaching the valley in June–August. (3) The two abnormal events occurred in winter, indicating that the meteorological conditions in winter were the main reason for the change of SNO<sub>2</sub> in the air. Among them, the factors that cause the peak values of Wuhan and Yangtze River Delta may also be due to the high level of economic development, dense population activities, and frequent industrial activities in the two regions, resulting in their own SNO<sub>2</sub> emissions.</div></div>\",\"PeriodicalId\":356,\"journal\":{\"name\":\"Journal of Environmental Management\",\"volume\":\"384 \",\"pages\":\"Article 125439\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Environmental Management\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S030147972501415X\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S030147972501415X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Improved surface NO2 Retrieval: Double-layer machine learning model construction and spatio-temporal characterization analysis in China (2018–2023)
As an important atmospheric pollutant causing serious harm to human health and the natural environment, monitoring of surface NO2 (SNO2) level is of critical importance. However, the current SNO2 retrieval models neglect to consider the influence of NO2 vertical hierarchical structure on the process of converting the tropospheric NO2 columns (XNO2) to the SNO2 levels. Meanwhile, conventional machine learning models struggle to capture complex spatiotemporal relationships between SNO2 and XNO2, which lead to the large differences between the current model results and the site-based measurements. To enhance the accuracy of SNO2 level inversion, this study incorporated the NO2 vertical stratification characteristics and its spatial-temporal variation mechanisms over a long time series. By leveraging the advanced Light Gradient Boosting Machine (LGBM) and Extremely Randomized Forests (ERF) models, a Double-Layer Machine Learning (DLML) framework was developed to estimate SNO2 levels across China from 2018 to 2023. Based on the results of this study, the temporal and spatial variation patterns of SNO2 levels across China, including key regions, were comprehensively analyzed. The results showed that: (1) Compared with the traditional model, the DLML model proposed in this study showed better performance, in which the R2 of spatio-temporal cross-validation reached 0.87. This represented an improvement of about 10 % over previous models. At the same time, MAE and RMSE were reduced to about 4.24 μg/m3 and 5.79 μg/m3 respectively. (2) The retrieved SNO2 levels in China mainly showed a decreasing trend from the central and eastern coastal areas to the surrounding areas, and the annual average concentration had reached the level of the World Health Organization (WHO) air quality guidelines. In terms of time, the retrieved SNO2 levels showed a U-shaped variation, with the highest in winter, followed by autumn, spring, and summer, reaching the peak in January and December, and then reaching the valley in June–August. (3) The two abnormal events occurred in winter, indicating that the meteorological conditions in winter were the main reason for the change of SNO2 in the air. Among them, the factors that cause the peak values of Wuhan and Yangtze River Delta may also be due to the high level of economic development, dense population activities, and frequent industrial activities in the two regions, resulting in their own SNO2 emissions.
期刊介绍:
The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.