{"title":"基于地理信息系统(GIS)和机器学习的马来西亚滑坡易发地区滑坡易发图开发研究","authors":"Dorothy Martin, S. Chai","doi":"10.1109/CSPA55076.2022.9781943","DOIUrl":null,"url":null,"abstract":"Landslide is a natural disaster that is common and frequently occurring in Malaysia. Thus, to reduce the impact of the landslide’s tragedy, a landslide susceptibility map is needed. The ultimate goal of this paper is to use Geographic Information System (GIS) and machine learning to develop a landslide susceptibility map. In two different landslide-prone areas in Malaysia, the performance of the two different machine learning models, Random Forest and Extreme Gradient Boosting (XGBoost) are evaluated and cross-validated. The Cameron Highland and Penang Island, Malaysia which are the subjects of this study, have a total of 233 and 443 landslides locations, respectively. These landslide locations were randomly divided into 70% for training and 30% for testing. The Digital Elevation Model (DEM), slope angle, slope length, Normalized Vegetation Index (NDVI), plan curvature, profile curvature, distance from the stream, distance from roads, Topographic Wetness Index (TWI) and Stream Power Index (SPI) are among the ten landslide conditioning factors, for which the spatial databases were developed by using GIS software. The area under the curve (AUC) of the receiver operating characteristic curve (ROC) had been applied to evaluate the machine learnings prediction accuracy. The result indicated that both XGBoost and Random Forest had a great performance across both study areas. For Penang Island, the AUC of XGBoost is 95.02% and the AUC of Random Forest is 94.99%. Meanwhile, for Cameron Highland, the AUC of XGBoost is 91.99% and the AUC of Random Forest is 92.32%. The final prediction map from this study might be useful for better planning in mitigating the occurrence of landslides.","PeriodicalId":174315,"journal":{"name":"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Study on Development of Landslide Susceptibility Map in Malaysia Landslide Prone Areas by Using Geographic Information System (GIS) and Machine Learning\",\"authors\":\"Dorothy Martin, S. Chai\",\"doi\":\"10.1109/CSPA55076.2022.9781943\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Landslide is a natural disaster that is common and frequently occurring in Malaysia. Thus, to reduce the impact of the landslide’s tragedy, a landslide susceptibility map is needed. The ultimate goal of this paper is to use Geographic Information System (GIS) and machine learning to develop a landslide susceptibility map. In two different landslide-prone areas in Malaysia, the performance of the two different machine learning models, Random Forest and Extreme Gradient Boosting (XGBoost) are evaluated and cross-validated. The Cameron Highland and Penang Island, Malaysia which are the subjects of this study, have a total of 233 and 443 landslides locations, respectively. These landslide locations were randomly divided into 70% for training and 30% for testing. The Digital Elevation Model (DEM), slope angle, slope length, Normalized Vegetation Index (NDVI), plan curvature, profile curvature, distance from the stream, distance from roads, Topographic Wetness Index (TWI) and Stream Power Index (SPI) are among the ten landslide conditioning factors, for which the spatial databases were developed by using GIS software. The area under the curve (AUC) of the receiver operating characteristic curve (ROC) had been applied to evaluate the machine learnings prediction accuracy. The result indicated that both XGBoost and Random Forest had a great performance across both study areas. For Penang Island, the AUC of XGBoost is 95.02% and the AUC of Random Forest is 94.99%. Meanwhile, for Cameron Highland, the AUC of XGBoost is 91.99% and the AUC of Random Forest is 92.32%. The final prediction map from this study might be useful for better planning in mitigating the occurrence of landslides.\",\"PeriodicalId\":174315,\"journal\":{\"name\":\"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA55076.2022.9781943\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA55076.2022.9781943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
滑坡是马来西亚常见且频繁发生的自然灾害。因此,为了减少滑坡悲剧的影响,需要绘制滑坡易感性图。本文的最终目标是利用地理信息系统(GIS)和机器学习来开发滑坡易感性图。在马来西亚两个不同的滑坡易发地区,对随机森林和极端梯度增强(XGBoost)两种不同机器学习模型的性能进行了评估和交叉验证。作为本次研究对象的马来西亚金马仑高原和槟城岛,分别有233个和443个滑坡地点。这些滑坡位置随机分为70%用于训练,30%用于测试。数字高程模型(DEM)、坡角、坡长、归一化植被指数(NDVI)、平面曲率、剖面曲率、距河流距离、距道路距离、地形湿度指数(TWI)和河流动力指数(SPI)是滑坡的10个影响因素,利用GIS软件建立了滑坡空间数据库。采用受试者工作特征曲线(ROC)的曲线下面积(AUC)来评价机器学习的预测精度。结果表明,XGBoost和Random Forest在两个研究领域都有很好的表现。对于Penang Island, XGBoost的AUC为95.02%,Random Forest的AUC为94.99%。同时,对于Cameron Highland, XGBoost的AUC为91.99%,Random Forest的AUC为92.32%。本研究的最终预测图可能有助于更好地规划减少滑坡的发生。
A Study on Development of Landslide Susceptibility Map in Malaysia Landslide Prone Areas by Using Geographic Information System (GIS) and Machine Learning
Landslide is a natural disaster that is common and frequently occurring in Malaysia. Thus, to reduce the impact of the landslide’s tragedy, a landslide susceptibility map is needed. The ultimate goal of this paper is to use Geographic Information System (GIS) and machine learning to develop a landslide susceptibility map. In two different landslide-prone areas in Malaysia, the performance of the two different machine learning models, Random Forest and Extreme Gradient Boosting (XGBoost) are evaluated and cross-validated. The Cameron Highland and Penang Island, Malaysia which are the subjects of this study, have a total of 233 and 443 landslides locations, respectively. These landslide locations were randomly divided into 70% for training and 30% for testing. The Digital Elevation Model (DEM), slope angle, slope length, Normalized Vegetation Index (NDVI), plan curvature, profile curvature, distance from the stream, distance from roads, Topographic Wetness Index (TWI) and Stream Power Index (SPI) are among the ten landslide conditioning factors, for which the spatial databases were developed by using GIS software. The area under the curve (AUC) of the receiver operating characteristic curve (ROC) had been applied to evaluate the machine learnings prediction accuracy. The result indicated that both XGBoost and Random Forest had a great performance across both study areas. For Penang Island, the AUC of XGBoost is 95.02% and the AUC of Random Forest is 94.99%. Meanwhile, for Cameron Highland, the AUC of XGBoost is 91.99% and the AUC of Random Forest is 92.32%. The final prediction map from this study might be useful for better planning in mitigating the occurrence of landslides.