Environmental variables controlling soil thickness across elevation zones in the eastern Himalayas

IF 5.7 1区 农林科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY
Xin Zhang , Jianrong Fan , Hongjin Chen
{"title":"Environmental variables controlling soil thickness across elevation zones in the eastern Himalayas","authors":"Xin Zhang ,&nbsp;Jianrong Fan ,&nbsp;Hongjin Chen","doi":"10.1016/j.catena.2025.109526","DOIUrl":null,"url":null,"abstract":"<div><div>Soil thickness (ST) is a critical indicator of ecological processes such as water regulation, nutrient cycling, carbon storage, and vegetation restoration. However, digital mapping of ST is often hindered by prediction bias and difficulties in identifying driving factors, largely due to the prevalence of right-censored data. Here, we investigate the eastern Himalayas (500–7000 m a.s.l.) using 130 field profiles and a suite of environmental variables spanning lithology, vegetation, climate, topography, spectral indices, and soil properties. We propose a composite framework that integrates the inverse probability of censoring weighted random forest (IPCW-RF) with Shapley additive explanations (SHAP) analysis. The IPCW component corrects censoring bias, the RF model provides high-precision spatial prediction, and SHAP yields quantitative insights into the mechanisms shaping ST at global, local, and spatial scales. The IPCW-RF model, trained with climate, topographic, and Sentinel-2 spectral variables, achieved strong predictive performance (R<sup>2</sup> = 0.55; MAE = 18.09 cm; RMSE = 21.59 cm) under 5-fold nearest-neighbor distance matching cross-validation (CV). Areas with high ST values were concentrated in the Yarlung Tsangpo River valley and on gently sloping plateau surfaces. The relationship between ST and elevation followed a nonlinear but structured pattern: ST decreased significantly between 500 and 2500 m, fluctuated between positive and negative correlations from 2500 to 5000 m, and declined again above 5000 m. SHAP analysis revealed an elevation-dependent contribution of environmental factors, with Band 8 emerging as the dominant predictor overall. At low to mid elevations (500–2500 m), vegetation played the primary role; at mid to high elevations (2500–4500 m), both vegetation and topography were influential; and at high elevations (&gt;4500 m), topographic controls predominated. This study demonstrates the integration of an interpretable machine-learning framework with censored data, offering new insights into soil formation processes and improving spatial prediction of ST in complex plateau terrains.</div></div>","PeriodicalId":9801,"journal":{"name":"Catena","volume":"261 ","pages":"Article 109526"},"PeriodicalIF":5.7000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Catena","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0341816225008288","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Soil thickness (ST) is a critical indicator of ecological processes such as water regulation, nutrient cycling, carbon storage, and vegetation restoration. However, digital mapping of ST is often hindered by prediction bias and difficulties in identifying driving factors, largely due to the prevalence of right-censored data. Here, we investigate the eastern Himalayas (500–7000 m a.s.l.) using 130 field profiles and a suite of environmental variables spanning lithology, vegetation, climate, topography, spectral indices, and soil properties. We propose a composite framework that integrates the inverse probability of censoring weighted random forest (IPCW-RF) with Shapley additive explanations (SHAP) analysis. The IPCW component corrects censoring bias, the RF model provides high-precision spatial prediction, and SHAP yields quantitative insights into the mechanisms shaping ST at global, local, and spatial scales. The IPCW-RF model, trained with climate, topographic, and Sentinel-2 spectral variables, achieved strong predictive performance (R2 = 0.55; MAE = 18.09 cm; RMSE = 21.59 cm) under 5-fold nearest-neighbor distance matching cross-validation (CV). Areas with high ST values were concentrated in the Yarlung Tsangpo River valley and on gently sloping plateau surfaces. The relationship between ST and elevation followed a nonlinear but structured pattern: ST decreased significantly between 500 and 2500 m, fluctuated between positive and negative correlations from 2500 to 5000 m, and declined again above 5000 m. SHAP analysis revealed an elevation-dependent contribution of environmental factors, with Band 8 emerging as the dominant predictor overall. At low to mid elevations (500–2500 m), vegetation played the primary role; at mid to high elevations (2500–4500 m), both vegetation and topography were influential; and at high elevations (>4500 m), topographic controls predominated. This study demonstrates the integration of an interpretable machine-learning framework with censored data, offering new insights into soil formation processes and improving spatial prediction of ST in complex plateau terrains.
控制喜马拉雅东部高程带土壤厚度的环境变量
土壤厚度是土壤水分调节、养分循环、碳储量和植被恢复等生态过程的重要指标。然而,ST的数字制图经常受到预测偏差和识别驱动因素的困难的阻碍,这主要是由于右审查数据的普遍存在。在这里,我们利用130个野外剖面和一系列环境变量,包括岩性、植被、气候、地形、光谱指数和土壤性质,研究了喜马拉雅东部(500-7000米a.s.l.)。本文提出了一个将加权随机森林(IPCW-RF)逆概率与Shapley加性解释(SHAP)分析相结合的复合框架。IPCW组件纠正了审查偏差,RF模型提供了高精度的空间预测,而SHAP可以在全球、局部和空间尺度上定量地了解形成ST的机制。使用气候、地形和Sentinel-2光谱变量训练的IPCW-RF模型在5倍最近邻距离匹配交叉验证(CV)下获得了较强的预测性能(R2 = 0.55; MAE = 18.09 cm; RMSE = 21.59 cm)。温度值高的地区主要集中在雅鲁藏布江流域和缓坡高原表面。温度与高程之间呈非线性结构关系:温度在500 ~ 2500 m之间显著降低,在2500 ~ 5000 m之间在正相关和负相关之间波动,在5000 m以上再次下降。SHAP分析揭示了环境因素的海拔依赖性贡献,8级成为总体上的主要预测因子。在中低海拔地区(500 ~ 2500 m),植被起主要作用;在中高海拔地区(2500 ~ 4500 m),植被和地形均有影响;在高海拔地区(4500米),地形控制占主导地位。该研究展示了可解释的机器学习框架与审查数据的集成,为土壤形成过程提供了新的见解,并改善了复杂高原地形中温度的空间预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Catena
Catena 环境科学-地球科学综合
CiteScore
10.50
自引率
9.70%
发文量
816
审稿时长
54 days
期刊介绍: Catena publishes papers describing original field and laboratory investigations and reviews on geoecology and landscape evolution with emphasis on interdisciplinary aspects of soil science, hydrology and geomorphology. It aims to disseminate new knowledge and foster better understanding of the physical environment, of evolutionary sequences that have resulted in past and current landscapes, and of the natural processes that are likely to determine the fate of our terrestrial environment. Papers within any one of the above topics are welcome provided they are of sufficiently wide interest and relevance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信