Azamat Suleymanov , Yakov Kuzyakov , Ilgiz Asylbaev , Igor Rusakov , Ruslan Suleymanov , Iren Tuktarova , Larisa Belan
{"title":"Mechanisms and drivers of soil pH assessed by Shapley additive explanation","authors":"Azamat Suleymanov , Yakov Kuzyakov , Ilgiz Asylbaev , Igor Rusakov , Ruslan Suleymanov , Iren Tuktarova , Larisa Belan","doi":"10.1016/j.catena.2025.109301","DOIUrl":null,"url":null,"abstract":"<div><div>Soil pH is a critical property influencing soil health and functions, nutrient availability and microbial activities, and agricultural productivity. Interpreting machine learning models in soil science is a challenge, despite their increasing application. We used a dataset of 2651 soil samples up to 60 cm depth to understand the drivers controlling spatial and depth distribution of soil pH and related them to soil-forming factors in foothills mountains and semi-arid steppes of southern Ural (Russia). Machine learning approach allowed to analyse the effects of key soil-forming factors (climate, topography, vegetation, soil and parent materials) for the predictions and the role of covariates utilizing Shapley values, a game theory-based method to quantify the average marginal contribution of a predictor. The developed models explained 62 %, 56 % and 54 % of the pH variation in 0–20, 20–40 and 40–60 cm, respectively. Climate (precipitation, cloud cover and surface temperature), soil type and elevation were the most important factors of soil pH across all depths. When precipitation in December exceeds 30–35 mm, cloud cover 58–60 % and elevation 400–450 m, the model predicted a lower pH compared with a mean level across all depths. The generated pH maps also revealed a change in soil pH from mountainous forested ecosystems to semi-arid steppe landscapes. These findings are mainly explained by the difference in precipitation-driven leaching and evapotranspiration-induced salt accumulation in soils in the area (9,500 km<sup>2</sup>). Our study underscores the complexity and non-linearity of the relationships between pH and the environmental variables, providing valuable insights into their variations across both horizontal and vertical spatial dimensions.</div></div>","PeriodicalId":9801,"journal":{"name":"Catena","volume":"259 ","pages":"Article 109301"},"PeriodicalIF":5.7000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Catena","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0341816225006034","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Soil pH is a critical property influencing soil health and functions, nutrient availability and microbial activities, and agricultural productivity. Interpreting machine learning models in soil science is a challenge, despite their increasing application. We used a dataset of 2651 soil samples up to 60 cm depth to understand the drivers controlling spatial and depth distribution of soil pH and related them to soil-forming factors in foothills mountains and semi-arid steppes of southern Ural (Russia). Machine learning approach allowed to analyse the effects of key soil-forming factors (climate, topography, vegetation, soil and parent materials) for the predictions and the role of covariates utilizing Shapley values, a game theory-based method to quantify the average marginal contribution of a predictor. The developed models explained 62 %, 56 % and 54 % of the pH variation in 0–20, 20–40 and 40–60 cm, respectively. Climate (precipitation, cloud cover and surface temperature), soil type and elevation were the most important factors of soil pH across all depths. When precipitation in December exceeds 30–35 mm, cloud cover 58–60 % and elevation 400–450 m, the model predicted a lower pH compared with a mean level across all depths. The generated pH maps also revealed a change in soil pH from mountainous forested ecosystems to semi-arid steppe landscapes. These findings are mainly explained by the difference in precipitation-driven leaching and evapotranspiration-induced salt accumulation in soils in the area (9,500 km2). Our study underscores the complexity and non-linearity of the relationships between pH and the environmental variables, providing valuable insights into their variations across both horizontal and vertical spatial dimensions.
期刊介绍:
Catena publishes papers describing original field and laboratory investigations and reviews on geoecology and landscape evolution with emphasis on interdisciplinary aspects of soil science, hydrology and geomorphology. It aims to disseminate new knowledge and foster better understanding of the physical environment, of evolutionary sequences that have resulted in past and current landscapes, and of the natural processes that are likely to determine the fate of our terrestrial environment.
Papers within any one of the above topics are welcome provided they are of sufficiently wide interest and relevance.