{"title":"[Daily NO<sub>2</sub> Simulation Research Based on Automatic Machine Learning Ensemble Models].","authors":"Kai-Kai Lu, Jing Li, De-Ren Liu, Fa-Zhao Xu, Yu-Na Zhang, Shi-Xing Zhu","doi":"10.13227/j.hjkx.202311087","DOIUrl":null,"url":null,"abstract":"<p><p>To understand the spatial distribution of NO<sub>2</sub> near the surface, we utilized measured data from NO<sub>2</sub> monitoring stations and combined it with column concentration data from the Tropospheric Monitoring Instrument (TROPOMI), taking the Yangtze River Delta region as the study area. We considered the impact of factors such as population, elevation, and meteorological conditions on NO<sub>2</sub> levels. We used automated machine learning to select five machine-learning algorithms with high simulation accuracy, namely ET, RF, XGBoost, LightGBM, and Catboost, and then integrated these five algorithms using the Stacking model to simulate the daily NO<sub>2</sub> concentration in the Yangtze River Delta region from March 2020 to February 2021. The results indicated that the RMAE and MAE values of the Stacking ensemble model were 7.078 and 5.270, respectively, which outperformed the single algorithms of ET, RF, XGBoost, LightGBM, and Catboost. The spatial distribution of high NO<sub>2</sub> concentrations in the Yangtze River Delta region, consisting of three provinces and one municipality, exhibited a U-shaped pattern with the convergence point located at the intersection of the three provinces, extending towards the southwest. Notably, urban pollution was particularly significant in the urban agglomerations centered around Shanghai, Hangzhou, Nanjing, and Hefei. There were 27 cities that exceeded the national standard daily limit. Changzhou was the city with the most serious NO<sub>2</sub> pollution, with the NO<sub>2</sub> concentration exceeding the standard for 14 d, followed by Shanghai, with 13 d. In terms of seasonal variation, the order of severity was as follows: winter, autumn, spring, and summer, with the least NO<sub>2</sub> pollution occurring on July 9th during the summer, and the most severe NO<sub>2</sub> pollution was observed on December 23rd during the winter.</p>","PeriodicalId":35937,"journal":{"name":"Huanjing Kexue/Environmental Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Huanjing Kexue/Environmental Science","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.13227/j.hjkx.202311087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0
Abstract
To understand the spatial distribution of NO2 near the surface, we utilized measured data from NO2 monitoring stations and combined it with column concentration data from the Tropospheric Monitoring Instrument (TROPOMI), taking the Yangtze River Delta region as the study area. We considered the impact of factors such as population, elevation, and meteorological conditions on NO2 levels. We used automated machine learning to select five machine-learning algorithms with high simulation accuracy, namely ET, RF, XGBoost, LightGBM, and Catboost, and then integrated these five algorithms using the Stacking model to simulate the daily NO2 concentration in the Yangtze River Delta region from March 2020 to February 2021. The results indicated that the RMAE and MAE values of the Stacking ensemble model were 7.078 and 5.270, respectively, which outperformed the single algorithms of ET, RF, XGBoost, LightGBM, and Catboost. The spatial distribution of high NO2 concentrations in the Yangtze River Delta region, consisting of three provinces and one municipality, exhibited a U-shaped pattern with the convergence point located at the intersection of the three provinces, extending towards the southwest. Notably, urban pollution was particularly significant in the urban agglomerations centered around Shanghai, Hangzhou, Nanjing, and Hefei. There were 27 cities that exceeded the national standard daily limit. Changzhou was the city with the most serious NO2 pollution, with the NO2 concentration exceeding the standard for 14 d, followed by Shanghai, with 13 d. In terms of seasonal variation, the order of severity was as follows: winter, autumn, spring, and summer, with the least NO2 pollution occurring on July 9th during the summer, and the most severe NO2 pollution was observed on December 23rd during the winter.