{"title":"Predictive performance and uncertainty analysis of ensemble models in gully erosion susceptibility assessment","authors":"Congtan Liu , Haoming Fan , Yixuan Wang","doi":"10.1016/j.iswcr.2025.01.004","DOIUrl":null,"url":null,"abstract":"<div><div>Gully erosion, as a significant natural process in geomorphological evolution, poses serious threats to natural environments and socio-economic stability. In response, Gully Erosion Susceptibility Maps (GESMs) have become essential references for effective watershed management. This study aims to identify the optimal feature datasets and to quantify the uncertainty associated with gully erosion prediction models by developing a novel methodological framework based on ensembles of the three machine learning models: Random Forest (RF), Convolutional Neural Network (CNN), and Transformer models. This study area is the Tuquan watershed in Inner Mongolia, China. A total of 25 Geo-Environmental Factors (GEFs) were selected to build datasets, supplemented by a gully inventory map comprising 823 gullies, resulting in 12,946 samples of both gully and non-gully occurrences. 3 ensemble methods including probability mean (PM), Probability Weighted Mean (PWM), and Probability Empirical Weighted Mean (PEWM) were used. Subsequently, the datasets underwent multi-collinearity testing before model computations. The optimal feature datasets S<sub>7</sub> included factors such as the Convergence Index (CI), Topographic Wetness Index (TWI), Terrain Ruggedness Index (TRI), distance from river, annual rainfall, distance from road, drainage density, elevation, Normalized Difference Vegetation Index NDVI, slope, and Slope Length (LS). The ensemble model Transformer-RF-CNN employing PEWM demonstrated superior performance, validated by 10-fold cross-validation and 8 metrics: Efficiency (E), True Positive Rate (TPR), False Positive Rate (FPR), True Skill Statistics (TSS), Kappa coefficient (K), Area Under the receiver operating characteristic Curve (AUC), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). The uncertainty associated with GESMs was quantified using the Coefficient of Variation (CV) map, resulting in a confidence map that classified 20 zones, with 75.976% of gullies located in high-susceptibility and low-uncertainty areas. This study provides critical insights for regulators and decision-makers, facilitating more informed planning for gully erosion prevention and control.</div></div>","PeriodicalId":48622,"journal":{"name":"International Soil and Water Conservation Research","volume":"13 2","pages":"Pages 319-333"},"PeriodicalIF":7.3000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Soil and Water Conservation Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S209563392500005X","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Gully erosion, as a significant natural process in geomorphological evolution, poses serious threats to natural environments and socio-economic stability. In response, Gully Erosion Susceptibility Maps (GESMs) have become essential references for effective watershed management. This study aims to identify the optimal feature datasets and to quantify the uncertainty associated with gully erosion prediction models by developing a novel methodological framework based on ensembles of the three machine learning models: Random Forest (RF), Convolutional Neural Network (CNN), and Transformer models. This study area is the Tuquan watershed in Inner Mongolia, China. A total of 25 Geo-Environmental Factors (GEFs) were selected to build datasets, supplemented by a gully inventory map comprising 823 gullies, resulting in 12,946 samples of both gully and non-gully occurrences. 3 ensemble methods including probability mean (PM), Probability Weighted Mean (PWM), and Probability Empirical Weighted Mean (PEWM) were used. Subsequently, the datasets underwent multi-collinearity testing before model computations. The optimal feature datasets S7 included factors such as the Convergence Index (CI), Topographic Wetness Index (TWI), Terrain Ruggedness Index (TRI), distance from river, annual rainfall, distance from road, drainage density, elevation, Normalized Difference Vegetation Index NDVI, slope, and Slope Length (LS). The ensemble model Transformer-RF-CNN employing PEWM demonstrated superior performance, validated by 10-fold cross-validation and 8 metrics: Efficiency (E), True Positive Rate (TPR), False Positive Rate (FPR), True Skill Statistics (TSS), Kappa coefficient (K), Area Under the receiver operating characteristic Curve (AUC), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). The uncertainty associated with GESMs was quantified using the Coefficient of Variation (CV) map, resulting in a confidence map that classified 20 zones, with 75.976% of gullies located in high-susceptibility and low-uncertainty areas. This study provides critical insights for regulators and decision-makers, facilitating more informed planning for gully erosion prevention and control.
期刊介绍:
The International Soil and Water Conservation Research (ISWCR), the official journal of World Association of Soil and Water Conservation (WASWAC) http://www.waswac.org, is a multidisciplinary journal of soil and water conservation research, practice, policy, and perspectives. It aims to disseminate new knowledge and promote the practice of soil and water conservation.
The scope of International Soil and Water Conservation Research includes research, strategies, and technologies for prediction, prevention, and protection of soil and water resources. It deals with identification, characterization, and modeling; dynamic monitoring and evaluation; assessment and management of conservation practice and creation and implementation of quality standards.
Examples of appropriate topical areas include (but are not limited to):
• Conservation models, tools, and technologies
• Conservation agricultural
• Soil health resources, indicators, assessment, and management
• Land degradation
• Sustainable development
• Soil erosion and its control
• Soil erosion processes
• Water resources assessment and management
• Watershed management
• Soil erosion models
• Literature review on topics related soil and water conservation research