A Comparative Study of Susceptibility and Hazard for Mass Movements Applying Quantitative Machine Learning Techniques—Case Study: Northern Lima Commonwealth, Peru
Edwin Badillo-Rivera, Manuel Olcese, Ramiro Santiago, Teófilo Poma, Neftalí Muñoz, Carlos Rojas-León, Teodosio Chávez, Luz Eyzaguirre, César Rodríguez, Fernando Oyanguren
{"title":"A Comparative Study of Susceptibility and Hazard for Mass Movements Applying Quantitative Machine Learning Techniques—Case Study: Northern Lima Commonwealth, Peru","authors":"Edwin Badillo-Rivera, Manuel Olcese, Ramiro Santiago, Teófilo Poma, Neftalí Muñoz, Carlos Rojas-León, Teodosio Chávez, Luz Eyzaguirre, César Rodríguez, Fernando Oyanguren","doi":"10.3390/geosciences14060168","DOIUrl":null,"url":null,"abstract":"This study addresses the importance of conducting mass movement susceptibility mapping and hazard assessment using quantitative techniques, including machine learning, in the Northern Lima Commonwealth (NLC). A previous exploration of the topographic variables revealed a high correlation and multicollinearity among some of them, which led to dimensionality reduction through a principal component analysis (PCA). Six susceptibility models were generated using weights of evidence, logistic regression, multilayer perceptron, support vector machine, random forest, and naive Bayes methods to produce quantitative susceptibility maps and assess the hazard associated with two scenarios: the first being El Niño phenomenon and the second being an earthquake exceeding 8.8 Mw. The main findings indicate that machine learning models exhibit excellent predictive performance for the presence and absence of mass movement events, as all models surpassed an AUC value of >0.9, with the random forest model standing out. In terms of hazard levels, in the event of an El Niño phenomenon or an earthquake exceeding 8.8 Mw, approximately 40% and 35% respectively, of the NLC area would be exposed to the highest hazard levels. The importance of integrating methodologies in mass movement susceptibility models is also emphasized; these methodologies include the correlation analysis, multicollinearity assessment, dimensionality reduction of variables, and coupling statistical models with machine learning models to improve the predictive accuracy of machine learning models. The findings of this research are expected to serve as a supportive tool for land managers in formulating effective disaster prevention and risk reduction strategies.","PeriodicalId":509137,"journal":{"name":"Geosciences","volume":"37 24","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/geosciences14060168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study addresses the importance of conducting mass movement susceptibility mapping and hazard assessment using quantitative techniques, including machine learning, in the Northern Lima Commonwealth (NLC). A previous exploration of the topographic variables revealed a high correlation and multicollinearity among some of them, which led to dimensionality reduction through a principal component analysis (PCA). Six susceptibility models were generated using weights of evidence, logistic regression, multilayer perceptron, support vector machine, random forest, and naive Bayes methods to produce quantitative susceptibility maps and assess the hazard associated with two scenarios: the first being El Niño phenomenon and the second being an earthquake exceeding 8.8 Mw. The main findings indicate that machine learning models exhibit excellent predictive performance for the presence and absence of mass movement events, as all models surpassed an AUC value of >0.9, with the random forest model standing out. In terms of hazard levels, in the event of an El Niño phenomenon or an earthquake exceeding 8.8 Mw, approximately 40% and 35% respectively, of the NLC area would be exposed to the highest hazard levels. The importance of integrating methodologies in mass movement susceptibility models is also emphasized; these methodologies include the correlation analysis, multicollinearity assessment, dimensionality reduction of variables, and coupling statistical models with machine learning models to improve the predictive accuracy of machine learning models. The findings of this research are expected to serve as a supportive tool for land managers in formulating effective disaster prevention and risk reduction strategies.