{"title":"Enhancing landslide susceptibility mapping in the Himalayas: geospatial and machine learning with explainable AI (XAI)","authors":"Manas Utthasini , Idhayachandhiran Ilampooranan , Suraj Kumar Singh , Shruti Kanga , Pankaj Kumar , Krishnagopal Halder , Biswajeet Pradhan , Amit Kumar Srivastava , Ranit Sundar Chatterjee , Rabin Chakrabortty , Tarig Ali , Gowhar Meraj","doi":"10.1016/j.gr.2025.08.003","DOIUrl":null,"url":null,"abstract":"<div><div>Landslides present a critical hazard in the Himalayas, where steep topography, intense rainfall, and tectonic activity converge to destabilize slopes. Accurate delineation of high-susceptibility zones is essential to safeguard lives, infrastructure, and ecosystems. Here, we construct a comprehensive Landslide Susceptibility Map (LSM) for Uttarakhand, a landslide-prone state in northern India, by integrating advanced ensemble machine learning (ML) with explainable AI. Our analysis comprises 35 geo-environmental variables, ranging from historical landslide inventories and remote sensing data to GIS-based geomorphological, hydrological, and anthropogenic layers. We evaluate six ML models (Logistic Regression, Support Vector Machine, Random Forest, Extra Trees, Gradient Boosting, and eXtreme Gradient Boosting) before consolidating them into a stacking ensemble (SE), achieving an Area Under the Curve (AUC) of 0.987 on the training set and 0.979 on the test set. Across models, false-negative rates were low; Extra Trees minimized missed events (FNR = 3.5 %) but with a high false-positive rate (23.6 %), whereas XGBoost and the SE achieved a better sensitivity–specificity balance (FNR = 5.6 and 5.5 %, respectively) with comparatively lower false positives, favoring operational use. Spatial transferability to Sikkim was strong (Uttarakhand test accuracies 0.864–0.917; Sikkim 0.905–0.971), with XGBoost yielding the highest Sikkim test accuracy (0.971) and ensemble approaches (GB, XGBoost, SE) all exceeding 0.96, highlighting robust generalization across different Himalayan regions. Our ensemble model surpasses all individual models and classifies the study area into five susceptibility zones (very low to very high), with 18.20 % of Uttarakhand, particularly in Pithoragarh, Chamoli, and Rudraprayag districts, falling under high-susceptibility zones. Further interpretability is provided by SHapley Additive exPlanations (SHAP), which highlight key drivers of slope failure, including slope angle, fault proximity, and rainfall. Our findings highlight the value of combining robust ML techniques with geoscientific data, thereby enhancing hazard assessments and informing disaster risk reduction across the Himalayas and similarly vulnerable terrains worldwide.</div></div>","PeriodicalId":12761,"journal":{"name":"Gondwana Research","volume":"149 ","pages":"Pages 262-290"},"PeriodicalIF":7.2000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gondwana Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1342937X25002679","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Landslides present a critical hazard in the Himalayas, where steep topography, intense rainfall, and tectonic activity converge to destabilize slopes. Accurate delineation of high-susceptibility zones is essential to safeguard lives, infrastructure, and ecosystems. Here, we construct a comprehensive Landslide Susceptibility Map (LSM) for Uttarakhand, a landslide-prone state in northern India, by integrating advanced ensemble machine learning (ML) with explainable AI. Our analysis comprises 35 geo-environmental variables, ranging from historical landslide inventories and remote sensing data to GIS-based geomorphological, hydrological, and anthropogenic layers. We evaluate six ML models (Logistic Regression, Support Vector Machine, Random Forest, Extra Trees, Gradient Boosting, and eXtreme Gradient Boosting) before consolidating them into a stacking ensemble (SE), achieving an Area Under the Curve (AUC) of 0.987 on the training set and 0.979 on the test set. Across models, false-negative rates were low; Extra Trees minimized missed events (FNR = 3.5 %) but with a high false-positive rate (23.6 %), whereas XGBoost and the SE achieved a better sensitivity–specificity balance (FNR = 5.6 and 5.5 %, respectively) with comparatively lower false positives, favoring operational use. Spatial transferability to Sikkim was strong (Uttarakhand test accuracies 0.864–0.917; Sikkim 0.905–0.971), with XGBoost yielding the highest Sikkim test accuracy (0.971) and ensemble approaches (GB, XGBoost, SE) all exceeding 0.96, highlighting robust generalization across different Himalayan regions. Our ensemble model surpasses all individual models and classifies the study area into five susceptibility zones (very low to very high), with 18.20 % of Uttarakhand, particularly in Pithoragarh, Chamoli, and Rudraprayag districts, falling under high-susceptibility zones. Further interpretability is provided by SHapley Additive exPlanations (SHAP), which highlight key drivers of slope failure, including slope angle, fault proximity, and rainfall. Our findings highlight the value of combining robust ML techniques with geoscientific data, thereby enhancing hazard assessments and informing disaster risk reduction across the Himalayas and similarly vulnerable terrains worldwide.
期刊介绍:
Gondwana Research (GR) is an International Journal aimed to promote high quality research publications on all topics related to solid Earth, particularly with reference to the origin and evolution of continents, continental assemblies and their resources. GR is an "all earth science" journal with no restrictions on geological time, terrane or theme and covers a wide spectrum of topics in geosciences such as geology, geomorphology, palaeontology, structure, petrology, geochemistry, stable isotopes, geochronology, economic geology, exploration geology, engineering geology, geophysics, and environmental geology among other themes, and provides an appropriate forum to integrate studies from different disciplines and different terrains. In addition to regular articles and thematic issues, the journal invites high profile state-of-the-art reviews on thrust area topics for its column, ''GR FOCUS''. Focus articles include short biographies and photographs of the authors. Short articles (within ten printed pages) for rapid publication reporting important discoveries or innovative models of global interest will be considered under the category ''GR LETTERS''.