Azamat Suleymanov , Evgeny Abakumov , Vyacheslav Polyakov , Alexander Kozlov , Nicolas P.A. Saby , Petr Kuzmenko , Salavat Telyagissov , João Augusto Coblinski
{"title":"Estimation and mapping of soil pH in urban landscapes","authors":"Azamat Suleymanov , Evgeny Abakumov , Vyacheslav Polyakov , Alexander Kozlov , Nicolas P.A. Saby , Petr Kuzmenko , Salavat Telyagissov , João Augusto Coblinski","doi":"10.1016/j.geodrs.2025.e00919","DOIUrl":null,"url":null,"abstract":"<div><div>Despite their significance in understanding soil ecology and health, there is a scarcity of studies on soil modelling in urbanized landscapes. In this study, we evaluated the performance of machine learning (ML) and hybrid techniques in predicting topsoil pH (0–20 cm) in the city of St. Petersburg (Russia). We used a dataset of 84 soil pH measurements and environmental covariates, including remote sensing data, relief and anthropogenic maps. We applied Random Forest (RF) and RF plus Residual Kriging (RFRK) approaches for digital mapping of pH values. The predictive performance of the models was assessed using several metrics, mean absolute error (MAE), including root mean squared error (RMSE), coefficient of determination (R<sup>2</sup>) and Nash–Sutcliffe model efficiency coefficient (NSE). We also evaluated the prediction uncertainty with the prediction interval coverage probability (PICP) and “Area of applicability” (AOA) approach. Our results showed the pH levels varied between 4.4 and 8.6 and were characterized by moderate spatial dependence. Both models demonstrated similar performance, whereas the RFRK model slightly outperformed the RF approach with prediction performance MAE = 0.50, RMSE = 0.58, R<sup>2</sup> = 0.63 and NSE = 0.47. The PICP suggested that the uncertainty associated with pH was underestimated, whereas almost all predicted areas were within the AOA. We found that remote sensing covariates (vegetation indices) were the most important predictors of soil pH. According to the generated maps, alkaline soils were mostly located in urbanized areas with dense buildings, whereas low pH values were observed in parks and open relatively undisturbed areas. Our findings highlight the potential of remote sensing data for digital mapping of soil pH in urban environments, typically characterized by higher complexity and heterogeneity.</div></div>","PeriodicalId":56001,"journal":{"name":"Geoderma Regional","volume":"40 ","pages":"Article e00919"},"PeriodicalIF":3.1000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma Regional","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352009425000045","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Despite their significance in understanding soil ecology and health, there is a scarcity of studies on soil modelling in urbanized landscapes. In this study, we evaluated the performance of machine learning (ML) and hybrid techniques in predicting topsoil pH (0–20 cm) in the city of St. Petersburg (Russia). We used a dataset of 84 soil pH measurements and environmental covariates, including remote sensing data, relief and anthropogenic maps. We applied Random Forest (RF) and RF plus Residual Kriging (RFRK) approaches for digital mapping of pH values. The predictive performance of the models was assessed using several metrics, mean absolute error (MAE), including root mean squared error (RMSE), coefficient of determination (R2) and Nash–Sutcliffe model efficiency coefficient (NSE). We also evaluated the prediction uncertainty with the prediction interval coverage probability (PICP) and “Area of applicability” (AOA) approach. Our results showed the pH levels varied between 4.4 and 8.6 and were characterized by moderate spatial dependence. Both models demonstrated similar performance, whereas the RFRK model slightly outperformed the RF approach with prediction performance MAE = 0.50, RMSE = 0.58, R2 = 0.63 and NSE = 0.47. The PICP suggested that the uncertainty associated with pH was underestimated, whereas almost all predicted areas were within the AOA. We found that remote sensing covariates (vegetation indices) were the most important predictors of soil pH. According to the generated maps, alkaline soils were mostly located in urbanized areas with dense buildings, whereas low pH values were observed in parks and open relatively undisturbed areas. Our findings highlight the potential of remote sensing data for digital mapping of soil pH in urban environments, typically characterized by higher complexity and heterogeneity.
期刊介绍:
Global issues require studies and solutions on national and regional levels. Geoderma Regional focuses on studies that increase understanding and advance our scientific knowledge of soils in all regions of the world. The journal embraces every aspect of soil science and welcomes reviews of regional progress.