用于训练样本不平衡情况下滑坡预测的生成对抗神经网络与多层堆叠集合混合模型

IF 3.9 3区 环境科学与生态学 Q1 ENGINEERING, CIVIL
Wajid Hussain, Hong Shu, Hasnain Abbas, Sajid Hussain, Isma Kulsoom, Saqib Hussain, Hajra Mustafa, Aftab Ahmed Khan, Muhammad Ismail, Javed Iqbal
{"title":"用于训练样本不平衡情况下滑坡预测的生成对抗神经网络与多层堆叠集合混合模型","authors":"Wajid Hussain, Hong Shu, Hasnain Abbas, Sajid Hussain, Isma Kulsoom, Saqib Hussain, Hajra Mustafa, Aftab Ahmed Khan, Muhammad Ismail, Javed Iqbal","doi":"10.1007/s00477-024-02722-2","DOIUrl":null,"url":null,"abstract":"<p>Gilgit-Baltistan, Pakistan, is particularly susceptible to landslides due to various geological, tectonics, meteorological, and anthropogenic factors consequently. However, the persisting conundrum of landslide database/data imbalance stands as a formidable challenge within this domain. To better stabilize the objective of landslide prediction, stacking ensemble Machine Learning and Generative Adversarial Network (GAN) were applied, because previous research in this area has mostly been limited by a lack of data. GAN is employed to synthesize training samples, ensuring the creation of a balanced dataset. Stacking ensemble architecture involves two stages of learning: the first class of learners incorporates diverse machine learning algorithms, while, the second level logistic regression model integrates prediction based on the strong learner, thereby enhancing overall prediction performance. To investigate landslide susceptibility in District Chilas, Northern Pakistan, we employed optical remote sensing and introduced a GAN with a Multi-Layers Hybrid Model (MLHM). This study involved the preparation of a spatial database with a total of 106 landslides and ten major landslide factors. We utilized a hybrid ensemble model and compared its performance with different algorithms like Conventional Neural Network, Artificial Neural network, Decision Tree, K-Nearest Neighbouring, and Hybrid Model, achieving accuracies of 0.91, 0.92, 0.90, 0.89, and 0.93, respectively. this approach has with Hybrid architecture learning accuracy of 0.98. The GAN with MLHM developed improved landslide susceptibility assessment with cross-comparison of Persistent Scattered Interferometric Synthetic Aperture Radar (PS-InSAR) investigation to ensure the safe functioning of KKH. </p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"71 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The generative adversarial neural network with multi-layers stack ensemble hybrid model for landslide prediction in case of training sample imbalance\",\"authors\":\"Wajid Hussain, Hong Shu, Hasnain Abbas, Sajid Hussain, Isma Kulsoom, Saqib Hussain, Hajra Mustafa, Aftab Ahmed Khan, Muhammad Ismail, Javed Iqbal\",\"doi\":\"10.1007/s00477-024-02722-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Gilgit-Baltistan, Pakistan, is particularly susceptible to landslides due to various geological, tectonics, meteorological, and anthropogenic factors consequently. However, the persisting conundrum of landslide database/data imbalance stands as a formidable challenge within this domain. To better stabilize the objective of landslide prediction, stacking ensemble Machine Learning and Generative Adversarial Network (GAN) were applied, because previous research in this area has mostly been limited by a lack of data. GAN is employed to synthesize training samples, ensuring the creation of a balanced dataset. Stacking ensemble architecture involves two stages of learning: the first class of learners incorporates diverse machine learning algorithms, while, the second level logistic regression model integrates prediction based on the strong learner, thereby enhancing overall prediction performance. To investigate landslide susceptibility in District Chilas, Northern Pakistan, we employed optical remote sensing and introduced a GAN with a Multi-Layers Hybrid Model (MLHM). This study involved the preparation of a spatial database with a total of 106 landslides and ten major landslide factors. We utilized a hybrid ensemble model and compared its performance with different algorithms like Conventional Neural Network, Artificial Neural network, Decision Tree, K-Nearest Neighbouring, and Hybrid Model, achieving accuracies of 0.91, 0.92, 0.90, 0.89, and 0.93, respectively. this approach has with Hybrid architecture learning accuracy of 0.98. The GAN with MLHM developed improved landslide susceptibility assessment with cross-comparison of Persistent Scattered Interferometric Synthetic Aperture Radar (PS-InSAR) investigation to ensure the safe functioning of KKH. </p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"71 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-04-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02722-2\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02722-2","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

由于地质、构造、气象和人为因素的影响,巴基斯坦吉尔吉特-巴尔蒂斯坦特别容易发生山体滑坡。然而,山体滑坡数据库/数据不平衡的难题始终是这一领域面临的巨大挑战。为了更好地稳定滑坡预测目标,我们应用了堆叠集合机器学习和生成对抗网络(GAN),因为该领域以往的研究大多受限于数据的缺乏。GAN 用于合成训练样本,确保创建一个平衡的数据集。堆叠集合架构包括两个学习阶段:第一级学习器包含多种机器学习算法,而第二级逻辑回归模型则基于强学习器进行综合预测,从而提高整体预测性能。为了研究巴基斯坦北部奇拉斯地区的滑坡易发性,我们采用了光学遥感技术,并引入了具有多层混合模型(MLHM)的 GAN。这项研究包括建立一个空间数据库,其中包含 106 个滑坡点和 10 个主要滑坡因素。我们使用了混合集合模型,并将其性能与传统神经网络、人工神经网络、决策树、K-最近邻和混合模型等不同算法进行了比较,其准确率分别为 0.91、0.92、0.90、0.89 和 0.93。通过与持久散射干涉合成孔径雷达(PS-InSAR)调查的交叉比较,利用 MLHM 开发的 GAN 改进了滑坡易感性评估,以确保 KKH 的安全运行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

The generative adversarial neural network with multi-layers stack ensemble hybrid model for landslide prediction in case of training sample imbalance

The generative adversarial neural network with multi-layers stack ensemble hybrid model for landslide prediction in case of training sample imbalance

Gilgit-Baltistan, Pakistan, is particularly susceptible to landslides due to various geological, tectonics, meteorological, and anthropogenic factors consequently. However, the persisting conundrum of landslide database/data imbalance stands as a formidable challenge within this domain. To better stabilize the objective of landslide prediction, stacking ensemble Machine Learning and Generative Adversarial Network (GAN) were applied, because previous research in this area has mostly been limited by a lack of data. GAN is employed to synthesize training samples, ensuring the creation of a balanced dataset. Stacking ensemble architecture involves two stages of learning: the first class of learners incorporates diverse machine learning algorithms, while, the second level logistic regression model integrates prediction based on the strong learner, thereby enhancing overall prediction performance. To investigate landslide susceptibility in District Chilas, Northern Pakistan, we employed optical remote sensing and introduced a GAN with a Multi-Layers Hybrid Model (MLHM). This study involved the preparation of a spatial database with a total of 106 landslides and ten major landslide factors. We utilized a hybrid ensemble model and compared its performance with different algorithms like Conventional Neural Network, Artificial Neural network, Decision Tree, K-Nearest Neighbouring, and Hybrid Model, achieving accuracies of 0.91, 0.92, 0.90, 0.89, and 0.93, respectively. this approach has with Hybrid architecture learning accuracy of 0.98. The GAN with MLHM developed improved landslide susceptibility assessment with cross-comparison of Persistent Scattered Interferometric Synthetic Aperture Radar (PS-InSAR) investigation to ensure the safe functioning of KKH.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
9.50%
发文量
189
审稿时长
3.8 months
期刊介绍: Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas: - Spatiotemporal analysis and mapping of natural processes. - Enviroinformatics. - Environmental risk assessment, reliability analysis and decision making. - Surface and subsurface hydrology and hydraulics. - Multiphase porous media domains and contaminant transport modelling. - Hazardous waste site characterization. - Stochastic turbulence and random hydrodynamic fields. - Chaotic and fractal systems. - Random waves and seafloor morphology. - Stochastic atmospheric and climate processes. - Air pollution and quality assessment research. - Modern geostatistics. - Mechanisms of pollutant formation, emission, exposure and absorption. - Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection. - Bioinformatics. - Probabilistic methods in ecology and population biology. - Epidemiological investigations. - Models using stochastic differential equations stochastic or partial differential equations. - Hazardous waste site characterization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信