A Study on Development of Landslide Susceptibility Map in Malaysia Landslide Prone Areas by Using Geographic Information System (GIS) and Machine Learning

Dorothy Martin, S. Chai
{"title":"A Study on Development of Landslide Susceptibility Map in Malaysia Landslide Prone Areas by Using Geographic Information System (GIS) and Machine Learning","authors":"Dorothy Martin, S. Chai","doi":"10.1109/CSPA55076.2022.9781943","DOIUrl":null,"url":null,"abstract":"Landslide is a natural disaster that is common and frequently occurring in Malaysia. Thus, to reduce the impact of the landslide’s tragedy, a landslide susceptibility map is needed. The ultimate goal of this paper is to use Geographic Information System (GIS) and machine learning to develop a landslide susceptibility map. In two different landslide-prone areas in Malaysia, the performance of the two different machine learning models, Random Forest and Extreme Gradient Boosting (XGBoost) are evaluated and cross-validated. The Cameron Highland and Penang Island, Malaysia which are the subjects of this study, have a total of 233 and 443 landslides locations, respectively. These landslide locations were randomly divided into 70% for training and 30% for testing. The Digital Elevation Model (DEM), slope angle, slope length, Normalized Vegetation Index (NDVI), plan curvature, profile curvature, distance from the stream, distance from roads, Topographic Wetness Index (TWI) and Stream Power Index (SPI) are among the ten landslide conditioning factors, for which the spatial databases were developed by using GIS software. The area under the curve (AUC) of the receiver operating characteristic curve (ROC) had been applied to evaluate the machine learnings prediction accuracy. The result indicated that both XGBoost and Random Forest had a great performance across both study areas. For Penang Island, the AUC of XGBoost is 95.02% and the AUC of Random Forest is 94.99%. Meanwhile, for Cameron Highland, the AUC of XGBoost is 91.99% and the AUC of Random Forest is 92.32%. The final prediction map from this study might be useful for better planning in mitigating the occurrence of landslides.","PeriodicalId":174315,"journal":{"name":"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA55076.2022.9781943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Landslide is a natural disaster that is common and frequently occurring in Malaysia. Thus, to reduce the impact of the landslide’s tragedy, a landslide susceptibility map is needed. The ultimate goal of this paper is to use Geographic Information System (GIS) and machine learning to develop a landslide susceptibility map. In two different landslide-prone areas in Malaysia, the performance of the two different machine learning models, Random Forest and Extreme Gradient Boosting (XGBoost) are evaluated and cross-validated. The Cameron Highland and Penang Island, Malaysia which are the subjects of this study, have a total of 233 and 443 landslides locations, respectively. These landslide locations were randomly divided into 70% for training and 30% for testing. The Digital Elevation Model (DEM), slope angle, slope length, Normalized Vegetation Index (NDVI), plan curvature, profile curvature, distance from the stream, distance from roads, Topographic Wetness Index (TWI) and Stream Power Index (SPI) are among the ten landslide conditioning factors, for which the spatial databases were developed by using GIS software. The area under the curve (AUC) of the receiver operating characteristic curve (ROC) had been applied to evaluate the machine learnings prediction accuracy. The result indicated that both XGBoost and Random Forest had a great performance across both study areas. For Penang Island, the AUC of XGBoost is 95.02% and the AUC of Random Forest is 94.99%. Meanwhile, for Cameron Highland, the AUC of XGBoost is 91.99% and the AUC of Random Forest is 92.32%. The final prediction map from this study might be useful for better planning in mitigating the occurrence of landslides.
基于地理信息系统(GIS)和机器学习的马来西亚滑坡易发地区滑坡易发图开发研究
滑坡是马来西亚常见且频繁发生的自然灾害。因此,为了减少滑坡悲剧的影响,需要绘制滑坡易感性图。本文的最终目标是利用地理信息系统(GIS)和机器学习来开发滑坡易感性图。在马来西亚两个不同的滑坡易发地区,对随机森林和极端梯度增强(XGBoost)两种不同机器学习模型的性能进行了评估和交叉验证。作为本次研究对象的马来西亚金马仑高原和槟城岛,分别有233个和443个滑坡地点。这些滑坡位置随机分为70%用于训练,30%用于测试。数字高程模型(DEM)、坡角、坡长、归一化植被指数(NDVI)、平面曲率、剖面曲率、距河流距离、距道路距离、地形湿度指数(TWI)和河流动力指数(SPI)是滑坡的10个影响因素,利用GIS软件建立了滑坡空间数据库。采用受试者工作特征曲线(ROC)的曲线下面积(AUC)来评价机器学习的预测精度。结果表明,XGBoost和Random Forest在两个研究领域都有很好的表现。对于Penang Island, XGBoost的AUC为95.02%,Random Forest的AUC为94.99%。同时,对于Cameron Highland, XGBoost的AUC为91.99%,Random Forest的AUC为92.32%。本研究的最终预测图可能有助于更好地规划减少滑坡的发生。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信