Investigating the effects of different data classification methods on landslide susceptibility mapping

IF 2.8 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS
Halil Akinci, Ayse Yavuz Ozalp
{"title":"Investigating the effects of different data classification methods on landslide susceptibility mapping","authors":"Halil Akinci,&nbsp;Ayse Yavuz Ozalp","doi":"10.1016/j.asr.2024.12.020","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, landslide susceptibility maps (LSMs) were produced for three regions where landslides are common in the Eastern Black Sea Region of Türkiye. The regions studied include the districts of Trabzon, Rize and Artvin. The eXtreme Gradient Boosting (XGBoost) machine learning algorithm was used to generate the LSMs. Ten different factors that can affect landslides including lithology, land cover, topographic wetness index (TWI), plan and profile curvature, slope, elevation, aspect, distance to roads and drainages were used for the research. The study tested various spatial data classification methods for these factors. Specifically, the data was categorized using five distinct classification methods: “geometric interval,” “equal interval,” “manual interval,” “natural breaks,” and “quantile.” The main objective of the study was to see how these classification methods affect the accuracy of LSMs. For this purpose, six different models using the XGBoost algorithm were created. In the first model, continuous data was used for most of the factors, while some factors (aspect, land cover and lithology) were used as discrete data. The other five models categorized the data using the different classification methods mentioned above. The receiver operating characteristic (ROC) curve and area under the curve (AUC) approach were used to measure how well each model performed. The results showed that the Model_1 using mostly continuous data performed the best among all three study areas with the highest AUC value. The model with the lowest AUC value was the model using the equal interval classification method (Model_3). The most important finding gained from this study was that when producing LSMs, it is preferable to maintain continuous data as is rather than reclassifying it, as this improves the accuracy of the susceptibility model.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"75 4","pages":"Pages 3427-3450"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0273117724012328","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

In this study, landslide susceptibility maps (LSMs) were produced for three regions where landslides are common in the Eastern Black Sea Region of Türkiye. The regions studied include the districts of Trabzon, Rize and Artvin. The eXtreme Gradient Boosting (XGBoost) machine learning algorithm was used to generate the LSMs. Ten different factors that can affect landslides including lithology, land cover, topographic wetness index (TWI), plan and profile curvature, slope, elevation, aspect, distance to roads and drainages were used for the research. The study tested various spatial data classification methods for these factors. Specifically, the data was categorized using five distinct classification methods: “geometric interval,” “equal interval,” “manual interval,” “natural breaks,” and “quantile.” The main objective of the study was to see how these classification methods affect the accuracy of LSMs. For this purpose, six different models using the XGBoost algorithm were created. In the first model, continuous data was used for most of the factors, while some factors (aspect, land cover and lithology) were used as discrete data. The other five models categorized the data using the different classification methods mentioned above. The receiver operating characteristic (ROC) curve and area under the curve (AUC) approach were used to measure how well each model performed. The results showed that the Model_1 using mostly continuous data performed the best among all three study areas with the highest AUC value. The model with the lowest AUC value was the model using the equal interval classification method (Model_3). The most important finding gained from this study was that when producing LSMs, it is preferable to maintain continuous data as is rather than reclassifying it, as this improves the accuracy of the susceptibility model.
探讨不同数据分类方法对滑坡易感性填图的影响
在这项研究中,滑坡易感性地图(LSMs)制作了三个地区,其中滑坡是常见的东黑海地区的 rkiye。研究的地区包括特拉布宗、里泽和阿尔文地区。使用极限梯度增强(XGBoost)机器学习算法生成lsm。影响滑坡的10个不同因素包括岩性、土地覆盖、地形湿度指数(TWI)、平面和剖面曲率、坡度、高程、坡向、与道路的距离和排水系统。本研究对这些因素的不同空间数据分类方法进行了检验。具体来说,使用五种不同的分类方法对数据进行分类:“几何间隔”、“相等间隔”、“人工间隔”、“自然间隔”和“分位数”。本研究的主要目的是了解这些分类方法如何影响lsm的准确性。为此,使用XGBoost算法创建了六个不同的模型。在第一个模型中,大部分因子使用连续数据,而一些因子(坡向、土地覆盖和岩性)使用离散数据。其他五个模型使用上述不同的分类方法对数据进行分类。采用受试者工作特征(ROC)曲线和曲线下面积(AUC)方法来衡量每个模型的表现。结果表明,在AUC值最高的3个研究区域中,使用大部分连续数据的Model_1表现最好。AUC值最低的模型是采用等区间分类方法的模型(Model_3)。从这项研究中获得的最重要的发现是,当产生lsm时,最好保持连续的数据,而不是重新分类,因为这提高了敏感性模型的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Space Research
Advances in Space Research 地学天文-地球科学综合
CiteScore
5.20
自引率
11.50%
发文量
800
审稿时长
5.8 months
期刊介绍: The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc. NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR). All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信