Research on the influencing factors of PM2.5 in China at different spatial scales based on machine learning algorithm

IF 10.3 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Meiru Chen , Jun Liu , Biwu Chu , Di Zhao , Ruiyu Li , Tianzeng Chen , Qingxin Ma , Yonghong Wang , Peng Zhang , Hao Li , Hong He
{"title":"Research on the influencing factors of PM2.5 in China at different spatial scales based on machine learning algorithm","authors":"Meiru Chen ,&nbsp;Jun Liu ,&nbsp;Biwu Chu ,&nbsp;Di Zhao ,&nbsp;Ruiyu Li ,&nbsp;Tianzeng Chen ,&nbsp;Qingxin Ma ,&nbsp;Yonghong Wang ,&nbsp;Peng Zhang ,&nbsp;Hao Li ,&nbsp;Hong He","doi":"10.1016/j.envint.2025.109536","DOIUrl":null,"url":null,"abstract":"<div><div>PM<sub>2.5</sub> pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM<sub>2.5</sub> levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R<sup>2</sup> improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R<sup>2</sup> further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM<sub>2.5</sub> levels in urban areas are significantly influenced by pollutant concentration such as NO<sub>2</sub>, CO, and SO<sub>2</sub>, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM<sub>2.5</sub> levels in non-urban areas. Moreover, the model results also suggested that the PM<sub>2.5</sub> pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM<sub>2.5</sub> control policies.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"200 ","pages":"Article 109536"},"PeriodicalIF":10.3000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412025002879","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

PM2.5 pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM2.5 levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R2 improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R2 further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM2.5 levels in urban areas are significantly influenced by pollutant concentration such as NO2, CO, and SO2, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM2.5 levels in non-urban areas. Moreover, the model results also suggested that the PM2.5 pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM2.5 control policies.

Abstract Image

基于机器学习算法的中国不同空间尺度PM2.5影响因素研究
PM2.5污染是中国当前面临的突出环境问题之一,受多种因素的影响,呈现出显著的空间差异。本研究采用光梯度增强机(Light Gradient Boosting Machine, LightGBM)模型,结合SHapley Additive exPlanation (SHAP)方法,探讨2015 - 2022年中国城市和电网PM2.5年均水平的关键影响因素(前体排放、气象条件、地理特征和社会经济因素)。结果表明,将污染物浓度加入到模型中可以增强模型的性能,R2从0.79显著提高到0.93,这凸显了污染物浓度的重要性以及LightGBM算法出色的预测性能。此外,在提高空间分辨率并应用网格级模型后,R2进一步提高到0.96 ~ 0.99。SHAP分析显示,城市地区的PM2.5水平受到NO2、CO和SO2等污染物浓度的显著影响,占总影响的49.3% %。相比之下,基于网格的模式强调了温度和降水等气象因素对非城市地区PM2.5水平的主导作用。此外,模型结果还表明,长三角和珠三角的PM2.5污染主要受一次排放控制,而京津冀、汾渭平原和四川盆地的PM2.5污染主要受大气氧化能力的限制。这项研究强调了机器学习在大气污染控制中的潜力,并为制定更有效和特定区域的PM2.5控制政策提供了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environment International
Environment International 环境科学-环境科学
CiteScore
21.90
自引率
3.40%
发文量
734
审稿时长
2.8 months
期刊介绍: Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review. It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信