{"title":"Uncertainties in landslide susceptibility prediction modeling: A review on the incompleteness of landslide inventory and its influence rules","authors":"Faming Huang , Daxiong Mao , Shui-Hua Jiang , Chuangbing Zhou , Xuanmei Fan , Ziqiang Zeng , Filippo Catani , Changshi Yu , Zhilu Chang , Jinsong Huang , Bingchen Jiang , Yijing Li","doi":"10.1016/j.gsf.2024.101886","DOIUrl":null,"url":null,"abstract":"<div><p>Landslide inventory is an indispensable output variable of landslide susceptibility prediction (LSP) modelling. However, the influence of landslide inventory incompleteness on LSP and the transfer rules of LSP resulting error in the model have not been explored. Adopting Xunwu County, China, as an example, the existing landslide inventory is first obtained and assumed to contain all landslide inventory samples under ideal conditions, after which different landslide inventory sample missing conditions are simulated by random sampling. It includes the condition that the landslide inventory samples in the whole study area are missing randomly at the proportions of 10%, 20%, 30%, 40% and 50%, as well as the condition that the landslide inventory samples in the south of Xunwu County are missing in aggregation. Then, five machine learning models, namely, Random Forest (RF), and Support Vector Machine (SVM), are used to perform LSP. Finally, the LSP results are evaluated to analyze the LSP uncertainties under various conditions. In addition, this study introduces various interpretability methods of machine learning model to explore the changes in the decision basis of the RF model under various conditions. Results show that (1) randomly missing landslide inventory samples at certain proportions (10%–50%) may affect the LSP results for local areas. (2) Aggregation of missing landslide inventory samples may cause significant biases in LSP, particularly in areas where samples are missing. (3) When 50% of landslide samples are missing (either randomly or aggregated), the changes in the decision basis of the RF model are mainly manifested in two aspects: first, the importance ranking of environmental factors slightly differs; second, in regard to LSP modelling in the same test grid unit, the weights of individual model factors may drastically vary.</p></div>","PeriodicalId":12711,"journal":{"name":"Geoscience frontiers","volume":"15 6","pages":"Article 101886"},"PeriodicalIF":8.5000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1674987124001105/pdfft?md5=d219d857960e78562f2dcaa80cfe9ca3&pid=1-s2.0-S1674987124001105-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience frontiers","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1674987124001105","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Landslide inventory is an indispensable output variable of landslide susceptibility prediction (LSP) modelling. However, the influence of landslide inventory incompleteness on LSP and the transfer rules of LSP resulting error in the model have not been explored. Adopting Xunwu County, China, as an example, the existing landslide inventory is first obtained and assumed to contain all landslide inventory samples under ideal conditions, after which different landslide inventory sample missing conditions are simulated by random sampling. It includes the condition that the landslide inventory samples in the whole study area are missing randomly at the proportions of 10%, 20%, 30%, 40% and 50%, as well as the condition that the landslide inventory samples in the south of Xunwu County are missing in aggregation. Then, five machine learning models, namely, Random Forest (RF), and Support Vector Machine (SVM), are used to perform LSP. Finally, the LSP results are evaluated to analyze the LSP uncertainties under various conditions. In addition, this study introduces various interpretability methods of machine learning model to explore the changes in the decision basis of the RF model under various conditions. Results show that (1) randomly missing landslide inventory samples at certain proportions (10%–50%) may affect the LSP results for local areas. (2) Aggregation of missing landslide inventory samples may cause significant biases in LSP, particularly in areas where samples are missing. (3) When 50% of landslide samples are missing (either randomly or aggregated), the changes in the decision basis of the RF model are mainly manifested in two aspects: first, the importance ranking of environmental factors slightly differs; second, in regard to LSP modelling in the same test grid unit, the weights of individual model factors may drastically vary.
Geoscience frontiersEarth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
17.80
自引率
3.40%
发文量
147
审稿时长
35 days
期刊介绍:
Geoscience Frontiers (GSF) is the Journal of China University of Geosciences (Beijing) and Peking University. It publishes peer-reviewed research articles and reviews in interdisciplinary fields of Earth and Planetary Sciences. GSF covers various research areas including petrology and geochemistry, lithospheric architecture and mantle dynamics, global tectonics, economic geology and fuel exploration, geophysics, stratigraphy and paleontology, environmental and engineering geology, astrogeology, and the nexus of resources-energy-emissions-climate under Sustainable Development Goals. The journal aims to bridge innovative, provocative, and challenging concepts and models in these fields, providing insights on correlations and evolution.