Zhiyuan Fu , Xiao Yang , Yike Ma , Yuhang Sun , Tianlian Wang
{"title":"Integrating explainable AI and causal inference to unveil regional air quality drivers in China","authors":"Zhiyuan Fu , Xiao Yang , Yike Ma , Yuhang Sun , Tianlian Wang","doi":"10.1016/j.jenvman.2025.126270","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution poses a pressing global public health challenge, demanding a comprehensive understanding of its causes and evolving dynamics to inform effective control strategies. In China, significant spatial heterogeneity complicates the national air quality improvement process. Different regions exhibit varying pollution drivers, which makes uniform governance approaches less effective. Addressing this complexity requires a framework capable of capturing localized causal mechanisms. This study introduces the CADEPT (Causal Analysis–Detection–Explanation–Prediction–Threshold) multi-scale causal inference framework. Using nationwide air quality monitoring data from 2014 to 2022, the study integrates urban, socio-economic, and climatic datasets. It systematically investigates the driving forces and future evolution of the Air Quality Index (AQI) through four major stages: spatial heterogeneity detection, causal and interpretable inference, threshold identification, and scenario-based prediction. The results reveal the following key findings: (1) Local Moran's I detects significant spatial clustering of AQI, while spatial heterogeneity analysis uncovers region-specific influences from climate, emissions, and industrial structures. (2) Interpretability analysis, based on SHAP and TabPFN, uncovers nonlinear and region-specific contributions of key variables to AQI, highlighting climate's mitigating role and industry's aggravating effect. (3) Causal inference quantifies the true impacts of dominant factors, confirming that increases in temperature and precipitation improve air quality, while SO<sub>2</sub> emissions and industrial expansion worsen it. (4) Threshold analysis identifies critical response intervals, highlighting a synergistic amplification effect between meteorological conditions and emission factors. (5) Scenario simulations suggest that promoting low-carbon transitions and coordinated emission reductions are essential for achieving sustained nationwide air quality improvements.</div></div>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"390 ","pages":"Article 126270"},"PeriodicalIF":8.4000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0301479725022467","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Air pollution poses a pressing global public health challenge, demanding a comprehensive understanding of its causes and evolving dynamics to inform effective control strategies. In China, significant spatial heterogeneity complicates the national air quality improvement process. Different regions exhibit varying pollution drivers, which makes uniform governance approaches less effective. Addressing this complexity requires a framework capable of capturing localized causal mechanisms. This study introduces the CADEPT (Causal Analysis–Detection–Explanation–Prediction–Threshold) multi-scale causal inference framework. Using nationwide air quality monitoring data from 2014 to 2022, the study integrates urban, socio-economic, and climatic datasets. It systematically investigates the driving forces and future evolution of the Air Quality Index (AQI) through four major stages: spatial heterogeneity detection, causal and interpretable inference, threshold identification, and scenario-based prediction. The results reveal the following key findings: (1) Local Moran's I detects significant spatial clustering of AQI, while spatial heterogeneity analysis uncovers region-specific influences from climate, emissions, and industrial structures. (2) Interpretability analysis, based on SHAP and TabPFN, uncovers nonlinear and region-specific contributions of key variables to AQI, highlighting climate's mitigating role and industry's aggravating effect. (3) Causal inference quantifies the true impacts of dominant factors, confirming that increases in temperature and precipitation improve air quality, while SO2 emissions and industrial expansion worsen it. (4) Threshold analysis identifies critical response intervals, highlighting a synergistic amplification effect between meteorological conditions and emission factors. (5) Scenario simulations suggest that promoting low-carbon transitions and coordinated emission reductions are essential for achieving sustained nationwide air quality improvements.
期刊介绍:
The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.