Evaluating soil erosion zones in the Kangsabati River basin using a stacking framework and SHAP model: a comparative study of machine learning approaches
{"title":"Evaluating soil erosion zones in the Kangsabati River basin using a stacking framework and SHAP model: a comparative study of machine learning approaches","authors":"Javed Mallick, Saeed Alqadhi, Swapan Talukdar, Md Nawaj Sarif, Tania Nasrin, Hazem Ghassan Abdo","doi":"10.1186/s12302-025-01079-9","DOIUrl":null,"url":null,"abstract":"<div><p>Soil erosion is a major concern in the Kangsabati River basin, necessitating a comprehensive scientific approach for effective soil erosion management. This study aimed to predict soil erosion susceptibility zones in the basin using integrated soil erosion and deep learning (DL) based stacking framework. Additionally, the SHAP (SHapley Additive exPlanations) model was utilized to augment the interpretability of the DL model. The study employed the RUSLE model to estimate the soil loss. Through ArcGIS, 2000 erosion sites and 2000 non-erosion sites were randomly selected to generate an inventory map. The study considered 17 factors in four primary categories: topographic, climatic, soil, and land use/land cover (LULC). The Boruta algorithm assessed the importance of these variables. Random Forest (RF), (Deep Neural Networks) DNN, Convolution Neural Network (CNN), and stacking (Meta model) models were used to map soil erosion susceptibility based on the inventory map and controlling features. The RUSLE model revealed five erosion zones with soil loss rates ranging from very low (less than 9 t/ha/year) to very high (above 43 t/ha/year). The results demonstrated that 24.93% of the study area fell within the very high erosion susceptibility zone as predicted by DNN, while RF identified 34.32%, Meta model identified 24.84%, and CNN indicated 10.47% of the study area falling into the very high erosion susceptibility category. In terms of RMSE (value) and MSE, the Meta model demonstrates superior performance, whereas the DNN model excels in terms of MAE. The SHAP values output highlights the substantial impact of Land Use and Land Cover (LULC), the K factor, soil moisture, and elevation on the DNN model. These findings provide a scientific basis for developing strategies and policies to combat soil erosion in the Kangsabati River basin, aiding targeted interventions and sustainable land management decisions.</p></div>","PeriodicalId":546,"journal":{"name":"Environmental Sciences Europe","volume":"37 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1186/s12302-025-01079-9.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Sciences Europe","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1186/s12302-025-01079-9","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Soil erosion is a major concern in the Kangsabati River basin, necessitating a comprehensive scientific approach for effective soil erosion management. This study aimed to predict soil erosion susceptibility zones in the basin using integrated soil erosion and deep learning (DL) based stacking framework. Additionally, the SHAP (SHapley Additive exPlanations) model was utilized to augment the interpretability of the DL model. The study employed the RUSLE model to estimate the soil loss. Through ArcGIS, 2000 erosion sites and 2000 non-erosion sites were randomly selected to generate an inventory map. The study considered 17 factors in four primary categories: topographic, climatic, soil, and land use/land cover (LULC). The Boruta algorithm assessed the importance of these variables. Random Forest (RF), (Deep Neural Networks) DNN, Convolution Neural Network (CNN), and stacking (Meta model) models were used to map soil erosion susceptibility based on the inventory map and controlling features. The RUSLE model revealed five erosion zones with soil loss rates ranging from very low (less than 9 t/ha/year) to very high (above 43 t/ha/year). The results demonstrated that 24.93% of the study area fell within the very high erosion susceptibility zone as predicted by DNN, while RF identified 34.32%, Meta model identified 24.84%, and CNN indicated 10.47% of the study area falling into the very high erosion susceptibility category. In terms of RMSE (value) and MSE, the Meta model demonstrates superior performance, whereas the DNN model excels in terms of MAE. The SHAP values output highlights the substantial impact of Land Use and Land Cover (LULC), the K factor, soil moisture, and elevation on the DNN model. These findings provide a scientific basis for developing strategies and policies to combat soil erosion in the Kangsabati River basin, aiding targeted interventions and sustainable land management decisions.
期刊介绍:
ESEU is an international journal, focusing primarily on Europe, with a broad scope covering all aspects of environmental sciences, including the main topic regulation.
ESEU will discuss the entanglement between environmental sciences and regulation because, in recent years, there have been misunderstandings and even disagreement between stakeholders in these two areas. ESEU will help to improve the comprehension of issues between environmental sciences and regulation.
ESEU will be an outlet from the German-speaking (DACH) countries to Europe and an inlet from Europe to the DACH countries regarding environmental sciences and regulation.
Moreover, ESEU will facilitate the exchange of ideas and interaction between Europe and the DACH countries regarding environmental regulatory issues.
Although Europe is at the center of ESEU, the journal will not exclude the rest of the world, because regulatory issues pertaining to environmental sciences can be fully seen only from a global perspective.