Automated Machine Learning-Based Landslide Susceptibility Mapping for the Three Gorges Reservoir Area, China

IF 3.6 3区地球科学 Q2 GEOSCIENCES, MULTIDISCIPLINARY

Mathematical Geosciences Pub Date : 2023-11-28 DOI:10.1007/s11004-023-10116-3

Junwei Ma, Dongze Lei, Zhiyuan Ren, Chunhai Tan, Ding Xia, Haixiang Guo

{"title":"Automated Machine Learning-Based Landslide Susceptibility Mapping for the Three Gorges Reservoir Area, China","authors":"Junwei Ma, Dongze Lei, Zhiyuan Ren, Chunhai Tan, Ding Xia, Haixiang Guo","doi":"10.1007/s11004-023-10116-3","DOIUrl":null,"url":null,"abstract":"<p>Machine learning (ML)-based landslide susceptibility mapping (LSM) has achieved substantial success in landslide risk management applications. However, the complexity of classically trained ML is often beyond nonexperts. With the rapid growth of practical applications, an “off-the-shelf” ML technique that can be easily used by nonexperts is highly relevant. In the present study, a new paradigm for an end-to-end ML modeling was adopted for LSM in the Three Gorges Reservoir area (TGRA) using automated machine learning (AutoML) as the backend model support for the paradigm. A well-defined database consisting of data from 290 landslides and 11 conditioning factors was collected for implementing AutoML and compared with classically trained ML approaches. The stacked ensemble model from AutoML achieved the best performance (0.954), surpassing the support vector machine with artificial bee colony optimization (ABC-SVM, 0.931), gray wolf optimization (GWO-SVM, 0.925), particle swarm optimization (PSO-SVM, 0.925), water cycle algorithm (WCA-SVM, 0.925), grid search (GS-SVM, 0.920), multilayer perceptron (MLP, 0.908), classification and regression tree (CART, 0.891), K-nearest neighbor (KNN, 0.898), and random forest (RF, 0.909) in terms of the area under the receiver operating characteristic curve (AUC). Notable improvements of up to 11% in AUC demonstrate that the AutoML approach succeeded in LSM and could be used to select the best model with minimal effort or intervention from the user. Moreover, a simple model that has been customarily ignored by practitioners and researchers has been identified with performance satisfying practical requirements. The experimental results indicate that AutoML provides an attractive alternative to manual ML practice, especially for practitioners with little expert knowledge in ML, by delivering a high-performance off-the-shelf solution for ML model development for LSM.</p>","PeriodicalId":51117,"journal":{"name":"Mathematical Geosciences","volume":"1 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Geosciences","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11004-023-10116-3","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning (ML)-based landslide susceptibility mapping (LSM) has achieved substantial success in landslide risk management applications. However, the complexity of classically trained ML is often beyond nonexperts. With the rapid growth of practical applications, an “off-the-shelf” ML technique that can be easily used by nonexperts is highly relevant. In the present study, a new paradigm for an end-to-end ML modeling was adopted for LSM in the Three Gorges Reservoir area (TGRA) using automated machine learning (AutoML) as the backend model support for the paradigm. A well-defined database consisting of data from 290 landslides and 11 conditioning factors was collected for implementing AutoML and compared with classically trained ML approaches. The stacked ensemble model from AutoML achieved the best performance (0.954), surpassing the support vector machine with artificial bee colony optimization (ABC-SVM, 0.931), gray wolf optimization (GWO-SVM, 0.925), particle swarm optimization (PSO-SVM, 0.925), water cycle algorithm (WCA-SVM, 0.925), grid search (GS-SVM, 0.920), multilayer perceptron (MLP, 0.908), classification and regression tree (CART, 0.891), K-nearest neighbor (KNN, 0.898), and random forest (RF, 0.909) in terms of the area under the receiver operating characteristic curve (AUC). Notable improvements of up to 11% in AUC demonstrate that the AutoML approach succeeded in LSM and could be used to select the best model with minimal effort or intervention from the user. Moreover, a simple model that has been customarily ignored by practitioners and researchers has been identified with performance satisfying practical requirements. The experimental results indicate that AutoML provides an attractive alternative to manual ML practice, especially for practitioners with little expert knowledge in ML, by delivering a high-performance off-the-shelf solution for ML model development for LSM.

Abstract Image

查看原文本刊更多论文

基于机器学习的三峡库区滑坡敏感性自动制图

基于机器学习(ML)的滑坡敏感性制图(LSM)在滑坡风险管理应用中取得了巨大的成功。然而，经过经典训练的机器学习的复杂性往往超出非专业人士。随着实际应用的快速增长，一种非专家可以轻松使用的“现成的”ML技术是高度相关的。在本研究中，采用一种新的端到端机器学习建模范式，以自动机器学习(AutoML)作为该范式的后端模型支持，对三峡库区(TGRA) LSM进行端到端机器学习建模。收集了来自290个滑坡和11个条件因素的数据组成的定义良好的数据库，用于实现AutoML，并与经典训练的ML方法进行了比较。AutoML的堆叠集成模型获得了最好的性能(0.954)，超过了人工蜂群优化(ABC-SVM, 0.931)、灰狼优化(GWO-SVM, 0.925)、粒子群优化(PSO-SVM, 0.925)、水循环算法(WCA-SVM, 0.925)、网格搜索(GS-SVM, 0.920)、多层感知器(MLP, 0.908)、分类与回归树(CART, 0.891)、k近邻(KNN, 0.898)和随机森林(RF, 0.908)的支持向量机。0.909)，表示接收器工作特性曲线(AUC)下的面积。AUC的显著改进高达11%，这表明AutoML方法在LSM中取得了成功，并且可以在用户最小的努力或干预下选择最佳模型。此外，一个通常被实践者和研究者忽视的简单模型也被证明具有满足实际要求的性能。实验结果表明，通过为LSM的ML模型开发提供高性能的现成解决方案，AutoML为人工ML实践提供了一个有吸引力的替代方案，特别是对于缺乏ML专业知识的从业者。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Mathematical Geosciences 地学-地球科学综合

CiteScore

5.30

自引率

15.40%

发文量

审稿时长

>12 weeks

期刊介绍： Mathematical Geosciences (formerly Mathematical Geology) publishes original, high-quality, interdisciplinary papers in geomathematics focusing on quantitative methods and studies of the Earth, its natural resources and the environment. This international publication is the official journal of the IAMG. Mathematical Geosciences is an essential reference for researchers and practitioners of geomathematics who develop and apply quantitative models to earth science and geo-engineering problems.