Bifeng Hu, Yibo Geng, Yi Lin, Hanjie Ni, Modian Xie, Nan Wang, Jie Hu, Qian Zou, Songchao Chen, Yin Zhou, Hongyi Li, Zhou Shi
{"title":"机器学习与地统计学方法在区域农田土壤有机碳密度制图及可解释模型中位置特异支配因子可视化中的比较","authors":"Bifeng Hu, Yibo Geng, Yi Lin, Hanjie Ni, Modian Xie, Nan Wang, Jie Hu, Qian Zou, Songchao Chen, Yin Zhou, Hongyi Li, Zhou Shi","doi":"10.1002/ldr.5573","DOIUrl":null,"url":null,"abstract":"High-precision soil organic carbon density (SOCD) map is significant for understanding ecosystem carbon cycles and estimating soil organic carbon storage. However, the current mapping methods are difficult to balance accuracy and interpretability, which brings great challenges to the mapping of SOCD. In the present research, a total of 6223 soil samples were collected, along with data pertaining to 30 environmental covariates, from agricultural land located in the Poyang Lake Plain of Jiangxi Province, southern China. Furthermore, ordinary kriging (OK), geographically weighted regression (GWR), random forest (RF), and empirical Bayesian kriging (EBK), along with three hybrid models (RF-OK, RF-EBK, RF-GWR), were constructed. These models were used to map the SOCD (soil organic carbon density) in the study region with a high resolution of 30 m. After that, shapley additive explanations (SHAP) were used to quantify the global contribution and spatially identify the dominant factors that influence SOCD variation. The study outcomes suggested that compared to the single geostatistics model and hybrid model, the RF method emerged as the most effective predictive model, showcasing superior performance (coefficient of determination (<i>R</i><sup>2</sup>) = 0.44, root mean squared error (RMSE) = 0.61 kg m<sup>−2</sup>, Lin's concordance coefficient (LCCC) = 0.58). Using the SHAP, we found that soil properties contributed the most to the prediction of global SOCD (81.67%). At the pixel level, total nitrogen dominated 50.33% of the farmland, followed by parent material (8.11%), available silicon (8.00%), and mean annual precipitation (5.71%), and the remaining variables accounted for less than 5.50%. In summary, our study offered valuable enlightenment toward achieving a balance between accuracy and interpretability of digital soil mapping, and deepened our understanding of the spatial variation of farmland SOCD.","PeriodicalId":203,"journal":{"name":"Land Degradation & Development","volume":"69 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of Machine Learning and Geostatistical Methods on Mapping Soil Organic Carbon Density in Regional Croplands and Visualizing Its Location-Specific Dominators via Interpretable Model\",\"authors\":\"Bifeng Hu, Yibo Geng, Yi Lin, Hanjie Ni, Modian Xie, Nan Wang, Jie Hu, Qian Zou, Songchao Chen, Yin Zhou, Hongyi Li, Zhou Shi\",\"doi\":\"10.1002/ldr.5573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High-precision soil organic carbon density (SOCD) map is significant for understanding ecosystem carbon cycles and estimating soil organic carbon storage. However, the current mapping methods are difficult to balance accuracy and interpretability, which brings great challenges to the mapping of SOCD. In the present research, a total of 6223 soil samples were collected, along with data pertaining to 30 environmental covariates, from agricultural land located in the Poyang Lake Plain of Jiangxi Province, southern China. Furthermore, ordinary kriging (OK), geographically weighted regression (GWR), random forest (RF), and empirical Bayesian kriging (EBK), along with three hybrid models (RF-OK, RF-EBK, RF-GWR), were constructed. These models were used to map the SOCD (soil organic carbon density) in the study region with a high resolution of 30 m. After that, shapley additive explanations (SHAP) were used to quantify the global contribution and spatially identify the dominant factors that influence SOCD variation. The study outcomes suggested that compared to the single geostatistics model and hybrid model, the RF method emerged as the most effective predictive model, showcasing superior performance (coefficient of determination (<i>R</i><sup>2</sup>) = 0.44, root mean squared error (RMSE) = 0.61 kg m<sup>−2</sup>, Lin's concordance coefficient (LCCC) = 0.58). Using the SHAP, we found that soil properties contributed the most to the prediction of global SOCD (81.67%). At the pixel level, total nitrogen dominated 50.33% of the farmland, followed by parent material (8.11%), available silicon (8.00%), and mean annual precipitation (5.71%), and the remaining variables accounted for less than 5.50%. In summary, our study offered valuable enlightenment toward achieving a balance between accuracy and interpretability of digital soil mapping, and deepened our understanding of the spatial variation of farmland SOCD.\",\"PeriodicalId\":203,\"journal\":{\"name\":\"Land Degradation & Development\",\"volume\":\"69 1\",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Land Degradation & Development\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1002/ldr.5573\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Land Degradation & Development","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1002/ldr.5573","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Comparison of Machine Learning and Geostatistical Methods on Mapping Soil Organic Carbon Density in Regional Croplands and Visualizing Its Location-Specific Dominators via Interpretable Model
High-precision soil organic carbon density (SOCD) map is significant for understanding ecosystem carbon cycles and estimating soil organic carbon storage. However, the current mapping methods are difficult to balance accuracy and interpretability, which brings great challenges to the mapping of SOCD. In the present research, a total of 6223 soil samples were collected, along with data pertaining to 30 environmental covariates, from agricultural land located in the Poyang Lake Plain of Jiangxi Province, southern China. Furthermore, ordinary kriging (OK), geographically weighted regression (GWR), random forest (RF), and empirical Bayesian kriging (EBK), along with three hybrid models (RF-OK, RF-EBK, RF-GWR), were constructed. These models were used to map the SOCD (soil organic carbon density) in the study region with a high resolution of 30 m. After that, shapley additive explanations (SHAP) were used to quantify the global contribution and spatially identify the dominant factors that influence SOCD variation. The study outcomes suggested that compared to the single geostatistics model and hybrid model, the RF method emerged as the most effective predictive model, showcasing superior performance (coefficient of determination (R2) = 0.44, root mean squared error (RMSE) = 0.61 kg m−2, Lin's concordance coefficient (LCCC) = 0.58). Using the SHAP, we found that soil properties contributed the most to the prediction of global SOCD (81.67%). At the pixel level, total nitrogen dominated 50.33% of the farmland, followed by parent material (8.11%), available silicon (8.00%), and mean annual precipitation (5.71%), and the remaining variables accounted for less than 5.50%. In summary, our study offered valuable enlightenment toward achieving a balance between accuracy and interpretability of digital soil mapping, and deepened our understanding of the spatial variation of farmland SOCD.
期刊介绍:
Land Degradation & Development is an international journal which seeks to promote rational study of the recognition, monitoring, control and rehabilitation of degradation in terrestrial environments. The journal focuses on:
- what land degradation is;
- what causes land degradation;
- the impacts of land degradation
- the scale of land degradation;
- the history, current status or future trends of land degradation;
- avoidance, mitigation and control of land degradation;
- remedial actions to rehabilitate or restore degraded land;
- sustainable land management.