{"title":"Assessment of resampling methods on performance of landslide susceptibility predictions using machine learning in Kendari City, Indonesia","authors":"S. Aldiansyah, Farida Wardani","doi":"10.2166/wpt.2024.002","DOIUrl":null,"url":null,"abstract":"\n \n Landslide susceptibility projections that rely on independent models produce biased results. This situation will worsen class balance if working with a small population. This study proposes a landslide susceptibility prediction model based on resampling, cross-validation, bootstrap, and random subsampling approaches, which is integrated with the machine learning model, generalized linear model, support vector machine, random forest, boosted regression trees, classification and regression tree, multivariate adaptive regression splines, mixture discriminate analysis, flexible discriminant analysis, maximum entropy, and maximum likelihood. This methodology was applied in Kendari City, an urban area which faced destructive erosion. Area under the ROC curve (AUC), true skill statistics (TSS), correlation coefficient (COR), normalized mutual information (NMI), and correct classification rate (CCR) were used to evaluate the predictive accuracy of the proposed model. The results show that the resampling algorithm improves the performance of the standalone model. Results also revealed that standalone models had better performance with the BT algorithm compared to the CV and RS algorithms. The Bt-RF model excels in statistical measures (AUC = 0.97, TSS = 0.97, COR = 0.99, NMI = 0.50, and CCR = 0.91). Given the admirable performance of the proposed models in a moderate scale area, promising results can be expected from these models for other regions.","PeriodicalId":510255,"journal":{"name":"Water Practice & Technology","volume":"15 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Practice & Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2166/wpt.2024.002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Landslide susceptibility projections that rely on independent models produce biased results. This situation will worsen class balance if working with a small population. This study proposes a landslide susceptibility prediction model based on resampling, cross-validation, bootstrap, and random subsampling approaches, which is integrated with the machine learning model, generalized linear model, support vector machine, random forest, boosted regression trees, classification and regression tree, multivariate adaptive regression splines, mixture discriminate analysis, flexible discriminant analysis, maximum entropy, and maximum likelihood. This methodology was applied in Kendari City, an urban area which faced destructive erosion. Area under the ROC curve (AUC), true skill statistics (TSS), correlation coefficient (COR), normalized mutual information (NMI), and correct classification rate (CCR) were used to evaluate the predictive accuracy of the proposed model. The results show that the resampling algorithm improves the performance of the standalone model. Results also revealed that standalone models had better performance with the BT algorithm compared to the CV and RS algorithms. The Bt-RF model excels in statistical measures (AUC = 0.97, TSS = 0.97, COR = 0.99, NMI = 0.50, and CCR = 0.91). Given the admirable performance of the proposed models in a moderate scale area, promising results can be expected from these models for other regions.