Sansar Raj Meena, Muhammad Afaq Hussain, Hafiz Ullah, Ibad Ullah
{"title":"使用混合机器学习分类器绘制滑坡易感性图:以巴基斯坦尼勒姆山谷为例","authors":"Sansar Raj Meena, Muhammad Afaq Hussain, Hafiz Ullah, Ibad Ullah","doi":"10.1007/s10064-025-04270-7","DOIUrl":null,"url":null,"abstract":"<div><p>The Neelum Valley in the Himalayan region of Pakistan frequently experiences landslides triggered by heavy rainfall, seismic activity, and human interventions, leading to significant loss of life and damage to infrastructure. Studying landslides in this region is essential for effective disaster management and risk assessment. Landslide susceptibility mapping (LSM) in areas like the Neelum Valley is crucial for proactive hazard mitigation and sustainable development. In this study, innovative data visualization approaches are used to create hybrid LSM, representing an innovation in LSM studies. This study aims to examine and compare the performance of various Machine Learning (ML) classifiers, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost) as well as a Hybrid classifier (XGB + LGBM + CatB), for mapping landslide susceptibility. Utilizing various data sources, a total of 360 landslide locations were initially identified in the Neelum Valley to create a comprehensive landslide inventory map. These locations were then randomly split into two datasets, using a 70/30 ratio, for training and validation purposes. In the second step, a landslide factor database was developed, consisting of 14 factors related to hydrology, climate, geology, topography, and human activities. Subsequently, Pearson's correlation coefficient and the ReliefF technique were applied to rank the importance of these factors. Additionally, several performance metrics were used to evaluate the classifiers, including the area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, F-measure, Matthew's correlation coefficients, root mean square error, mean square error, Cohens Kappa and Jaccard Index. The Hybrid classifier (XGB + LGBM + CATB) achieved the highest AUC value (0.9039), indicating it was the most efficient model, followed by XGBoost, LightGBM, and CatBoost, with AUC values of 0.8920, 0.8935, and 0.8945, respectively. The study concludes that the stacking ensemble classifier shows significant potential for LSM in Neelum Valley, effectively identifying landslide-prone areas.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"84 5","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Landslide susceptibility mapping using hybrid machine learning classifiers: a case study of Neelum Valley, Pakistan\",\"authors\":\"Sansar Raj Meena, Muhammad Afaq Hussain, Hafiz Ullah, Ibad Ullah\",\"doi\":\"10.1007/s10064-025-04270-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The Neelum Valley in the Himalayan region of Pakistan frequently experiences landslides triggered by heavy rainfall, seismic activity, and human interventions, leading to significant loss of life and damage to infrastructure. Studying landslides in this region is essential for effective disaster management and risk assessment. Landslide susceptibility mapping (LSM) in areas like the Neelum Valley is crucial for proactive hazard mitigation and sustainable development. In this study, innovative data visualization approaches are used to create hybrid LSM, representing an innovation in LSM studies. This study aims to examine and compare the performance of various Machine Learning (ML) classifiers, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost) as well as a Hybrid classifier (XGB + LGBM + CatB), for mapping landslide susceptibility. Utilizing various data sources, a total of 360 landslide locations were initially identified in the Neelum Valley to create a comprehensive landslide inventory map. These locations were then randomly split into two datasets, using a 70/30 ratio, for training and validation purposes. In the second step, a landslide factor database was developed, consisting of 14 factors related to hydrology, climate, geology, topography, and human activities. Subsequently, Pearson's correlation coefficient and the ReliefF technique were applied to rank the importance of these factors. Additionally, several performance metrics were used to evaluate the classifiers, including the area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, F-measure, Matthew's correlation coefficients, root mean square error, mean square error, Cohens Kappa and Jaccard Index. The Hybrid classifier (XGB + LGBM + CATB) achieved the highest AUC value (0.9039), indicating it was the most efficient model, followed by XGBoost, LightGBM, and CatBoost, with AUC values of 0.8920, 0.8935, and 0.8945, respectively. The study concludes that the stacking ensemble classifier shows significant potential for LSM in Neelum Valley, effectively identifying landslide-prone areas.</p></div>\",\"PeriodicalId\":500,\"journal\":{\"name\":\"Bulletin of Engineering Geology and the Environment\",\"volume\":\"84 5\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of Engineering Geology and the Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10064-025-04270-7\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-025-04270-7","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Landslide susceptibility mapping using hybrid machine learning classifiers: a case study of Neelum Valley, Pakistan
The Neelum Valley in the Himalayan region of Pakistan frequently experiences landslides triggered by heavy rainfall, seismic activity, and human interventions, leading to significant loss of life and damage to infrastructure. Studying landslides in this region is essential for effective disaster management and risk assessment. Landslide susceptibility mapping (LSM) in areas like the Neelum Valley is crucial for proactive hazard mitigation and sustainable development. In this study, innovative data visualization approaches are used to create hybrid LSM, representing an innovation in LSM studies. This study aims to examine and compare the performance of various Machine Learning (ML) classifiers, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost) as well as a Hybrid classifier (XGB + LGBM + CatB), for mapping landslide susceptibility. Utilizing various data sources, a total of 360 landslide locations were initially identified in the Neelum Valley to create a comprehensive landslide inventory map. These locations were then randomly split into two datasets, using a 70/30 ratio, for training and validation purposes. In the second step, a landslide factor database was developed, consisting of 14 factors related to hydrology, climate, geology, topography, and human activities. Subsequently, Pearson's correlation coefficient and the ReliefF technique were applied to rank the importance of these factors. Additionally, several performance metrics were used to evaluate the classifiers, including the area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, F-measure, Matthew's correlation coefficients, root mean square error, mean square error, Cohens Kappa and Jaccard Index. The Hybrid classifier (XGB + LGBM + CATB) achieved the highest AUC value (0.9039), indicating it was the most efficient model, followed by XGBoost, LightGBM, and CatBoost, with AUC values of 0.8920, 0.8935, and 0.8945, respectively. The study concludes that the stacking ensemble classifier shows significant potential for LSM in Neelum Valley, effectively identifying landslide-prone areas.
期刊介绍:
Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces:
• the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations;
• the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change;
• the assessment of the mechanical and hydrological behaviour of soil and rock masses;
• the prediction of changes to the above properties with time;
• the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.