Assessment of resampling methods on performance of landslide susceptibility predictions using machine learning in Kendari City, Indonesia

S. Aldiansyah, Farida Wardani
{"title":"Assessment of resampling methods on performance of landslide susceptibility predictions using machine learning in Kendari City, Indonesia","authors":"S. Aldiansyah, Farida Wardani","doi":"10.2166/wpt.2024.002","DOIUrl":null,"url":null,"abstract":"\n \n Landslide susceptibility projections that rely on independent models produce biased results. This situation will worsen class balance if working with a small population. This study proposes a landslide susceptibility prediction model based on resampling, cross-validation, bootstrap, and random subsampling approaches, which is integrated with the machine learning model, generalized linear model, support vector machine, random forest, boosted regression trees, classification and regression tree, multivariate adaptive regression splines, mixture discriminate analysis, flexible discriminant analysis, maximum entropy, and maximum likelihood. This methodology was applied in Kendari City, an urban area which faced destructive erosion. Area under the ROC curve (AUC), true skill statistics (TSS), correlation coefficient (COR), normalized mutual information (NMI), and correct classification rate (CCR) were used to evaluate the predictive accuracy of the proposed model. The results show that the resampling algorithm improves the performance of the standalone model. Results also revealed that standalone models had better performance with the BT algorithm compared to the CV and RS algorithms. The Bt-RF model excels in statistical measures (AUC = 0.97, TSS = 0.97, COR = 0.99, NMI = 0.50, and CCR = 0.91). Given the admirable performance of the proposed models in a moderate scale area, promising results can be expected from these models for other regions.","PeriodicalId":510255,"journal":{"name":"Water Practice & Technology","volume":"15 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Practice & Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2166/wpt.2024.002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Landslide susceptibility projections that rely on independent models produce biased results. This situation will worsen class balance if working with a small population. This study proposes a landslide susceptibility prediction model based on resampling, cross-validation, bootstrap, and random subsampling approaches, which is integrated with the machine learning model, generalized linear model, support vector machine, random forest, boosted regression trees, classification and regression tree, multivariate adaptive regression splines, mixture discriminate analysis, flexible discriminant analysis, maximum entropy, and maximum likelihood. This methodology was applied in Kendari City, an urban area which faced destructive erosion. Area under the ROC curve (AUC), true skill statistics (TSS), correlation coefficient (COR), normalized mutual information (NMI), and correct classification rate (CCR) were used to evaluate the predictive accuracy of the proposed model. The results show that the resampling algorithm improves the performance of the standalone model. Results also revealed that standalone models had better performance with the BT algorithm compared to the CV and RS algorithms. The Bt-RF model excels in statistical measures (AUC = 0.97, TSS = 0.97, COR = 0.99, NMI = 0.50, and CCR = 0.91). Given the admirable performance of the proposed models in a moderate scale area, promising results can be expected from these models for other regions.
在印度尼西亚肯达里市利用机器学习评估重采样方法对滑坡易发性预测的影响
依靠独立模型进行的滑坡易发性预测会产生有偏差的结果。如果处理的是小规模人口,这种情况会加剧类平衡。本研究提出了一种基于重采样、交叉验证、引导和随机子采样方法的滑坡易发性预测模型,该模型与机器学习模型、广义线性模型、支持向量机、随机森林、助推回归树、分类和回归树、多元自适应回归样条、混合判别分析、灵活判别分析、最大熵和最大似然等方法相结合。该方法被应用于面临破坏性侵蚀的肯达里市。ROC 曲线下面积 (AUC)、真实技能统计量 (TSS)、相关系数 (COR)、归一化互信息 (NMI) 和正确分类率 (CCR) 被用来评估所提出模型的预测准确性。结果表明,重采样算法提高了独立模型的性能。结果还显示,与 CV 算法和 RS 算法相比,使用 BT 算法的独立模型具有更好的性能。Bt-RF 模型在统计测量方面表现出色(AUC = 0.97、TSS = 0.97、COR = 0.99、NMI = 0.50 和 CCR = 0.91)。鉴于所提出的模型在中等规模地区的出色表现,预计这些模型在其他地区也会取得可喜的成果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信