利用机器学习方法增强印度锡金滑坡易感性制图

IF 1.4 4区 地球科学 Q3 GEOSCIENCES, MULTIDISCIPLINARY
Geological Journal Pub Date : 2025-04-10 DOI:10.1002/gj.5198
Sujit Kumar Roy, Sumon Dey, Jayanta Das, Billal Hossen, Swarup Das, Md. Mahmudul Hasan, Pratik Mojumder
{"title":"利用机器学习方法增强印度锡金滑坡易感性制图","authors":"Sujit Kumar Roy,&nbsp;Sumon Dey,&nbsp;Jayanta Das,&nbsp;Billal Hossen,&nbsp;Swarup Das,&nbsp;Md. Mahmudul Hasan,&nbsp;Pratik Mojumder","doi":"10.1002/gj.5198","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Landslides pose significant hazards in the mountainous region of Sikkim, India, necessitating accurate susceptibility mapping to mitigate risks. This study applies four machine learning models: Boosted Tree (BT), Gradient Boosting Machine (GBM), K-Nearest Neighbour (KNN), and Multilayer Perceptron (MLP) to develop a detailed landslide susceptibility map. Feature selection was performed using correlation analysis, the Boruta model, and multicollinearity tests, which identified 13 key landslide conditioning factors based on 1456 landslide inventory points. The GBM model demonstrated the highest predictive performance with an AUC of 0.99, followed by BT (AUC: 0.965), MLP (AUC: 0.940), and KNN (AUC: 0.895) in the testing dataset. The confusion matrix validation confirmed that GBM outperformed other models, achieving the highest F1 score (0.894) and accuracy (89.4%), followed by BT with an F1 score of 0.874 and accuracy of 87.8%. KNN and MLP displayed lower performance, with KNN showing an F1 score of 0.724 and accuracy of 72.6%, and MLP significantly underperforming with an F1 score of 0.096 and accuracy of 48.6%. Statistical significance testing using the Wilcoxon Signed-Rank Test revealed significant differences between BT and MLP (<i>p</i> = 0.018), while other model pairs exhibited no statistically significant performance differences. Additionally, the variable importance analysis highlighted Diurnal Temperature Range (DTR) as the most critical factor influencing landslide occurrence (43.99%), followed by elevation (21.59%). These findings provide valuable insights for policymakers and government authorities, enabling them to take necessary measures for effective landslide management in the vulnerable areas of Sikkim, confirming the efficacy of machine learning models for geohazard assessments.</p>\n </div>","PeriodicalId":12784,"journal":{"name":"Geological Journal","volume":"60 5","pages":"1150-1169"},"PeriodicalIF":1.4000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Utilising Machine Learning Approaches for Enhanced Landslide Susceptibility Mapping in Sikkim, India\",\"authors\":\"Sujit Kumar Roy,&nbsp;Sumon Dey,&nbsp;Jayanta Das,&nbsp;Billal Hossen,&nbsp;Swarup Das,&nbsp;Md. Mahmudul Hasan,&nbsp;Pratik Mojumder\",\"doi\":\"10.1002/gj.5198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Landslides pose significant hazards in the mountainous region of Sikkim, India, necessitating accurate susceptibility mapping to mitigate risks. This study applies four machine learning models: Boosted Tree (BT), Gradient Boosting Machine (GBM), K-Nearest Neighbour (KNN), and Multilayer Perceptron (MLP) to develop a detailed landslide susceptibility map. Feature selection was performed using correlation analysis, the Boruta model, and multicollinearity tests, which identified 13 key landslide conditioning factors based on 1456 landslide inventory points. The GBM model demonstrated the highest predictive performance with an AUC of 0.99, followed by BT (AUC: 0.965), MLP (AUC: 0.940), and KNN (AUC: 0.895) in the testing dataset. The confusion matrix validation confirmed that GBM outperformed other models, achieving the highest F1 score (0.894) and accuracy (89.4%), followed by BT with an F1 score of 0.874 and accuracy of 87.8%. KNN and MLP displayed lower performance, with KNN showing an F1 score of 0.724 and accuracy of 72.6%, and MLP significantly underperforming with an F1 score of 0.096 and accuracy of 48.6%. Statistical significance testing using the Wilcoxon Signed-Rank Test revealed significant differences between BT and MLP (<i>p</i> = 0.018), while other model pairs exhibited no statistically significant performance differences. Additionally, the variable importance analysis highlighted Diurnal Temperature Range (DTR) as the most critical factor influencing landslide occurrence (43.99%), followed by elevation (21.59%). These findings provide valuable insights for policymakers and government authorities, enabling them to take necessary measures for effective landslide management in the vulnerable areas of Sikkim, confirming the efficacy of machine learning models for geohazard assessments.</p>\\n </div>\",\"PeriodicalId\":12784,\"journal\":{\"name\":\"Geological Journal\",\"volume\":\"60 5\",\"pages\":\"1150-1169\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geological Journal\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/gj.5198\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geological Journal","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gj.5198","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

山体滑坡对印度锡金山区造成重大危害,需要精确的易感性测绘来减轻风险。本研究应用四种机器学习模型:增强树(BT)、梯度增强机(GBM)、k近邻(KNN)和多层感知器(MLP)来开发详细的滑坡易感性图。利用相关分析、Boruta模型和多重共线性检验进行特征选择,基于1456个滑坡盘查点识别出13个关键的滑坡影响因素。在测试数据集中,GBM模型的预测性能最高,AUC为0.99,其次是BT (AUC: 0.965)、MLP (AUC: 0.940)和KNN (AUC: 0.895)。混淆矩阵验证证实,GBM的F1得分最高(0.894),准确率最高(89.4%),其次是BT, F1得分为0.874,准确率为87.8%。KNN和MLP表现较差,KNN的F1得分为0.724,准确率为72.6%,MLP的F1得分为0.096,准确率为48.6%。采用Wilcoxon sign - rank检验的统计显著性检验显示,BT与MLP之间存在显著性差异(p = 0.018),而其他模型对的性能差异无统计学意义。此外,变量重要性分析显示,日温差(DTR)是影响滑坡发生的最关键因素(43.99%),其次是海拔(21.59%)。这些发现为政策制定者和政府当局提供了有价值的见解,使他们能够采取必要的措施,在锡金的脆弱地区进行有效的滑坡管理,证实了机器学习模型在地质灾害评估中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Utilising Machine Learning Approaches for Enhanced Landslide Susceptibility Mapping in Sikkim, India

Utilising Machine Learning Approaches for Enhanced Landslide Susceptibility Mapping in Sikkim, India

Landslides pose significant hazards in the mountainous region of Sikkim, India, necessitating accurate susceptibility mapping to mitigate risks. This study applies four machine learning models: Boosted Tree (BT), Gradient Boosting Machine (GBM), K-Nearest Neighbour (KNN), and Multilayer Perceptron (MLP) to develop a detailed landslide susceptibility map. Feature selection was performed using correlation analysis, the Boruta model, and multicollinearity tests, which identified 13 key landslide conditioning factors based on 1456 landslide inventory points. The GBM model demonstrated the highest predictive performance with an AUC of 0.99, followed by BT (AUC: 0.965), MLP (AUC: 0.940), and KNN (AUC: 0.895) in the testing dataset. The confusion matrix validation confirmed that GBM outperformed other models, achieving the highest F1 score (0.894) and accuracy (89.4%), followed by BT with an F1 score of 0.874 and accuracy of 87.8%. KNN and MLP displayed lower performance, with KNN showing an F1 score of 0.724 and accuracy of 72.6%, and MLP significantly underperforming with an F1 score of 0.096 and accuracy of 48.6%. Statistical significance testing using the Wilcoxon Signed-Rank Test revealed significant differences between BT and MLP (p = 0.018), while other model pairs exhibited no statistically significant performance differences. Additionally, the variable importance analysis highlighted Diurnal Temperature Range (DTR) as the most critical factor influencing landslide occurrence (43.99%), followed by elevation (21.59%). These findings provide valuable insights for policymakers and government authorities, enabling them to take necessary measures for effective landslide management in the vulnerable areas of Sikkim, confirming the efficacy of machine learning models for geohazard assessments.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Geological Journal
Geological Journal 地学-地球科学综合
CiteScore
4.20
自引率
11.10%
发文量
269
审稿时长
3 months
期刊介绍: In recent years there has been a growth of specialist journals within geological sciences. Nevertheless, there is an important role for a journal of an interdisciplinary kind. Traditionally, GEOLOGICAL JOURNAL has been such a journal and continues in its aim of promoting interest in all branches of the Geological Sciences, through publication of original research papers and review articles. The journal publishes Special Issues with a common theme or regional coverage e.g. Chinese Dinosaurs; Tectonics of the Eastern Mediterranean, Triassic basins of the Central and North Atlantic Borderlands). These are extensively cited. The Journal has a particular interest in publishing papers on regional case studies from any global locality which have conclusions of general interest. Such papers may emphasize aspects across the full spectrum of geological sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信