{"title":"Assessment and projection of carbon density in China: An integrated approach combining Boruta-SHAP-machine learning and structural equation modeling","authors":"Ya Wen, Xue Han, Zizhao Ma, Ruirui Zhang","doi":"10.1016/j.rsase.2025.101694","DOIUrl":null,"url":null,"abstract":"<div><div>Carbon density is a key indicator of the carbon sequestration capacity of terrestrial ecosystem. As one of the world's largest carbon emitters, gaining a clear understanding of China's carbon density is essential for achieving the strategic goals of “carbon peak” and “carbon neutrality.” Therefore, this study used machine learning to estimate the current carbon density of vegetation in China. However, machine learning often suffer from the “black box” problem. To address this, the Boruta-SHAP variable selection method was employed to enhance the interpretability of feature importance. Based on the current estimation of China's vegetation carbon density, this study used a structural equation model to explore the driving factors influencing carbon density. The model further predicts China's vegetation carbon density for 2030 and 2060 based on the identified key driving factors. The results of the study indicate the following: (1) Boruta-SHAP effectively visualizes the relative importance of feature variables. Among various vegetation indices, the Leaf Area Index (LAI) is the most important variable for carbon density modeling, with a mean SHAP value of 4.86. (2) In estimating China's carbon density, Boruta-SHAP-Random Forest (R<sup>2</sup> = 0.636) and Boruta-SHAP-XGBoost (R<sup>2</sup> = 0.629) outperform Multiple Linear Regression (R<sup>2</sup> = 0.559). (3) The average vegetation carbon density in China is 48.27 Mg/ha. The Moderately Low Carbon Density Zone accounts for the largest proportion of the total area, primarily located in the northeast region (e.g., Jilin and Heilongjiang), the southern fringe, and the eastern hilly areas. (4) Climate is identified as the main driving factor influencing China's vegetation carbon density, with a path coefficient of 0.72. Among climate variables, maximum temperature exerts the strongest influence, with a path coefficient of 0.85. (5) The average vegetation carbon density in China is projected to be 52.34 Mg C/ha in 2030 and 53.08 Mg C/ha in 2060. The findings of this study are expected to provide theoretical support for optimizing forest management policies and addressing climate change risks, including carbon sink fluctuations caused by extreme events.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"39 ","pages":"Article 101694"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Carbon density is a key indicator of the carbon sequestration capacity of terrestrial ecosystem. As one of the world's largest carbon emitters, gaining a clear understanding of China's carbon density is essential for achieving the strategic goals of “carbon peak” and “carbon neutrality.” Therefore, this study used machine learning to estimate the current carbon density of vegetation in China. However, machine learning often suffer from the “black box” problem. To address this, the Boruta-SHAP variable selection method was employed to enhance the interpretability of feature importance. Based on the current estimation of China's vegetation carbon density, this study used a structural equation model to explore the driving factors influencing carbon density. The model further predicts China's vegetation carbon density for 2030 and 2060 based on the identified key driving factors. The results of the study indicate the following: (1) Boruta-SHAP effectively visualizes the relative importance of feature variables. Among various vegetation indices, the Leaf Area Index (LAI) is the most important variable for carbon density modeling, with a mean SHAP value of 4.86. (2) In estimating China's carbon density, Boruta-SHAP-Random Forest (R2 = 0.636) and Boruta-SHAP-XGBoost (R2 = 0.629) outperform Multiple Linear Regression (R2 = 0.559). (3) The average vegetation carbon density in China is 48.27 Mg/ha. The Moderately Low Carbon Density Zone accounts for the largest proportion of the total area, primarily located in the northeast region (e.g., Jilin and Heilongjiang), the southern fringe, and the eastern hilly areas. (4) Climate is identified as the main driving factor influencing China's vegetation carbon density, with a path coefficient of 0.72. Among climate variables, maximum temperature exerts the strongest influence, with a path coefficient of 0.85. (5) The average vegetation carbon density in China is projected to be 52.34 Mg C/ha in 2030 and 53.08 Mg C/ha in 2060. The findings of this study are expected to provide theoretical support for optimizing forest management policies and addressing climate change risks, including carbon sink fluctuations caused by extreme events.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems