Landslide susceptibility mapping using hybrid machine learning classifiers: a case study of Neelum Valley, Pakistan

IF 3.7 2区 工程技术 Q3 ENGINEERING, ENVIRONMENTAL
Sansar Raj Meena, Muhammad Afaq Hussain, Hafiz Ullah, Ibad Ullah
{"title":"Landslide susceptibility mapping using hybrid machine learning classifiers: a case study of Neelum Valley, Pakistan","authors":"Sansar Raj Meena,&nbsp;Muhammad Afaq Hussain,&nbsp;Hafiz Ullah,&nbsp;Ibad Ullah","doi":"10.1007/s10064-025-04270-7","DOIUrl":null,"url":null,"abstract":"<div><p>The Neelum Valley in the Himalayan region of Pakistan frequently experiences landslides triggered by heavy rainfall, seismic activity, and human interventions, leading to significant loss of life and damage to infrastructure. Studying landslides in this region is essential for effective disaster management and risk assessment. Landslide susceptibility mapping (LSM) in areas like the Neelum Valley is crucial for proactive hazard mitigation and sustainable development. In this study, innovative data visualization approaches are used to create hybrid LSM, representing an innovation in LSM studies. This study aims to examine and compare the performance of various Machine Learning (ML) classifiers, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost) as well as a Hybrid classifier (XGB + LGBM + CatB), for mapping landslide susceptibility. Utilizing various data sources, a total of 360 landslide locations were initially identified in the Neelum Valley to create a comprehensive landslide inventory map. These locations were then randomly split into two datasets, using a 70/30 ratio, for training and validation purposes. In the second step, a landslide factor database was developed, consisting of 14 factors related to hydrology, climate, geology, topography, and human activities. Subsequently, Pearson's correlation coefficient and the ReliefF technique were applied to rank the importance of these factors. Additionally, several performance metrics were used to evaluate the classifiers, including the area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, F-measure, Matthew's correlation coefficients, root mean square error, mean square error, Cohens Kappa and Jaccard Index. The Hybrid classifier (XGB + LGBM + CATB) achieved the highest AUC value (0.9039), indicating it was the most efficient model, followed by XGBoost, LightGBM, and CatBoost, with AUC values of 0.8920, 0.8935, and 0.8945, respectively. The study concludes that the stacking ensemble classifier shows significant potential for LSM in Neelum Valley, effectively identifying landslide-prone areas.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"84 5","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-025-04270-7","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

The Neelum Valley in the Himalayan region of Pakistan frequently experiences landslides triggered by heavy rainfall, seismic activity, and human interventions, leading to significant loss of life and damage to infrastructure. Studying landslides in this region is essential for effective disaster management and risk assessment. Landslide susceptibility mapping (LSM) in areas like the Neelum Valley is crucial for proactive hazard mitigation and sustainable development. In this study, innovative data visualization approaches are used to create hybrid LSM, representing an innovation in LSM studies. This study aims to examine and compare the performance of various Machine Learning (ML) classifiers, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost) as well as a Hybrid classifier (XGB + LGBM + CatB), for mapping landslide susceptibility. Utilizing various data sources, a total of 360 landslide locations were initially identified in the Neelum Valley to create a comprehensive landslide inventory map. These locations were then randomly split into two datasets, using a 70/30 ratio, for training and validation purposes. In the second step, a landslide factor database was developed, consisting of 14 factors related to hydrology, climate, geology, topography, and human activities. Subsequently, Pearson's correlation coefficient and the ReliefF technique were applied to rank the importance of these factors. Additionally, several performance metrics were used to evaluate the classifiers, including the area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, F-measure, Matthew's correlation coefficients, root mean square error, mean square error, Cohens Kappa and Jaccard Index. The Hybrid classifier (XGB + LGBM + CATB) achieved the highest AUC value (0.9039), indicating it was the most efficient model, followed by XGBoost, LightGBM, and CatBoost, with AUC values of 0.8920, 0.8935, and 0.8945, respectively. The study concludes that the stacking ensemble classifier shows significant potential for LSM in Neelum Valley, effectively identifying landslide-prone areas.

使用混合机器学习分类器绘制滑坡易感性图:以巴基斯坦尼勒姆山谷为例
巴基斯坦喜马拉雅地区的尼勒姆山谷经常发生由强降雨、地震活动和人为干预引发的山体滑坡,导致重大生命损失和基础设施破坏。研究该地区的山体滑坡对有效的灾害管理和风险评估至关重要。在尼勒姆河谷等地区进行滑坡易感性测绘(LSM)对于主动减轻灾害和可持续发展至关重要。在本研究中,创新的数据可视化方法被用于创建混合LSM,代表了LSM研究的创新。本研究旨在检查和比较各种机器学习(ML)分类器的性能,包括极端梯度增强(XGBoost)、轻梯度增强机(LightGBM)、分类增强(CatBoost)以及用于绘制滑坡易感性的混合分类器(XGB + LGBM + CatB)。利用各种数据源,初步确定了尼勒姆河谷共360个滑坡位置,创建了一个全面的滑坡清单图。然后将这些位置随机分成两个数据集,使用70/30的比例,用于训练和验证目的。第二步,建立滑坡因子数据库,由水文、气候、地质、地形、人类活动等14个因子组成。随后,应用Pearson相关系数和ReliefF技术对这些因素的重要性进行排序。此外,还使用了几个性能指标来评估分类器,包括接收者工作特征曲线下面积(AUROC)、准确度、精密度、召回率、f测量、马修相关系数、均方根误差、均方误差、科恩斯卡帕和贾卡德指数。Hybrid分类器(XGB + LGBM + CATB)的AUC值最高(0.9039),是最高效的分类器,其次是XGBoost、LightGBM和CatBoost, AUC值分别为0.8920、0.8935和0.8945。研究表明,层叠集成分类器在Neelum河谷的滑坡易发区具有显著的识别潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Bulletin of Engineering Geology and the Environment
Bulletin of Engineering Geology and the Environment 工程技术-地球科学综合
CiteScore
7.10
自引率
11.90%
发文量
445
审稿时长
4.1 months
期刊介绍: Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces: • the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations; • the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change; • the assessment of the mechanical and hydrological behaviour of soil and rock masses; • the prediction of changes to the above properties with time; • the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信