Meiru Chen , Jun Liu , Biwu Chu , Di Zhao , Ruiyu Li , Tianzeng Chen , Qingxin Ma , Yonghong Wang , Peng Zhang , Hao Li , Hong He
{"title":"基于机器学习算法的中国不同空间尺度PM2.5影响因素研究","authors":"Meiru Chen , Jun Liu , Biwu Chu , Di Zhao , Ruiyu Li , Tianzeng Chen , Qingxin Ma , Yonghong Wang , Peng Zhang , Hao Li , Hong He","doi":"10.1016/j.envint.2025.109536","DOIUrl":null,"url":null,"abstract":"<div><div>PM<sub>2.5</sub> pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM<sub>2.5</sub> levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R<sup>2</sup> improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R<sup>2</sup> further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM<sub>2.5</sub> levels in urban areas are significantly influenced by pollutant concentration such as NO<sub>2</sub>, CO, and SO<sub>2</sub>, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM<sub>2.5</sub> levels in non-urban areas. Moreover, the model results also suggested that the PM<sub>2.5</sub> pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM<sub>2.5</sub> control policies.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"200 ","pages":"Article 109536"},"PeriodicalIF":10.3000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on the influencing factors of PM2.5 in China at different spatial scales based on machine learning algorithm\",\"authors\":\"Meiru Chen , Jun Liu , Biwu Chu , Di Zhao , Ruiyu Li , Tianzeng Chen , Qingxin Ma , Yonghong Wang , Peng Zhang , Hao Li , Hong He\",\"doi\":\"10.1016/j.envint.2025.109536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>PM<sub>2.5</sub> pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM<sub>2.5</sub> levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R<sup>2</sup> improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R<sup>2</sup> further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM<sub>2.5</sub> levels in urban areas are significantly influenced by pollutant concentration such as NO<sub>2</sub>, CO, and SO<sub>2</sub>, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM<sub>2.5</sub> levels in non-urban areas. Moreover, the model results also suggested that the PM<sub>2.5</sub> pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM<sub>2.5</sub> control policies.</div></div>\",\"PeriodicalId\":308,\"journal\":{\"name\":\"Environment International\",\"volume\":\"200 \",\"pages\":\"Article 109536\"},\"PeriodicalIF\":10.3000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment International\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0160412025002879\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412025002879","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Research on the influencing factors of PM2.5 in China at different spatial scales based on machine learning algorithm
PM2.5 pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM2.5 levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R2 improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R2 further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM2.5 levels in urban areas are significantly influenced by pollutant concentration such as NO2, CO, and SO2, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM2.5 levels in non-urban areas. Moreover, the model results also suggested that the PM2.5 pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM2.5 control policies.
期刊介绍:
Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review.
It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.