基于超采样技术的风险预测和通过树状结构parzed估计器优化的集合模型

IF 4.2 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY
{"title":"基于超采样技术的风险预测和通过树状结构parzed估计器优化的集合模型","authors":"","doi":"10.1016/j.ijdrr.2024.104753","DOIUrl":null,"url":null,"abstract":"<div><p>High accuracy prediction of urban flood risk is conducive to avoid potential losses, however, it's negatively affected by unbalanced data. Furthermore, ensemble model has been demonstrated to have the ability to improve to prediction accuracy. Nevertheless, the performance of ensemble model is influenced by basic model and ensemble rules, and determining the best ensemble model remains an open issue. To improve the accuracy of flood risk prediction, an approach covering data optimization and ensemble modeling was presented to optimize unbalanced flood data and the selection of various ensemble models based on efficiency and performance. A practical application in Zhengzhou City shows that Borderline-SMOTE2 is the most applicable for optimizing the flood risk data among the state-of-the-art oversampling algorithm utilized, because of the excellent entropy value. The effect of unbalanced data on the performance of the basic models was pervasive according to changes of the common indicators. The optimal ensemble model for flood risk prediction is composed of K-Nearest Neighbor, Decision Tree, Gaussian Naive Bayes and Extreme Gradient Boosting under Stacking rule in the current study. The results of this study supply the valuable reference for the flood prediction and mitigation.</p></div>","PeriodicalId":13915,"journal":{"name":"International journal of disaster risk reduction","volume":null,"pages":null},"PeriodicalIF":4.2000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Risk prediction based on oversampling technology and ensemble model optimized by tree-structured parzed estimator\",\"authors\":\"\",\"doi\":\"10.1016/j.ijdrr.2024.104753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>High accuracy prediction of urban flood risk is conducive to avoid potential losses, however, it's negatively affected by unbalanced data. Furthermore, ensemble model has been demonstrated to have the ability to improve to prediction accuracy. Nevertheless, the performance of ensemble model is influenced by basic model and ensemble rules, and determining the best ensemble model remains an open issue. To improve the accuracy of flood risk prediction, an approach covering data optimization and ensemble modeling was presented to optimize unbalanced flood data and the selection of various ensemble models based on efficiency and performance. A practical application in Zhengzhou City shows that Borderline-SMOTE2 is the most applicable for optimizing the flood risk data among the state-of-the-art oversampling algorithm utilized, because of the excellent entropy value. The effect of unbalanced data on the performance of the basic models was pervasive according to changes of the common indicators. The optimal ensemble model for flood risk prediction is composed of K-Nearest Neighbor, Decision Tree, Gaussian Naive Bayes and Extreme Gradient Boosting under Stacking rule in the current study. The results of this study supply the valuable reference for the flood prediction and mitigation.</p></div>\",\"PeriodicalId\":13915,\"journal\":{\"name\":\"International journal of disaster risk reduction\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of disaster risk reduction\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2212420924005156\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of disaster risk reduction","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212420924005156","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

高精度的城市洪水风险预测有利于避免潜在损失,但不平衡数据会对其产生负面影响。此外,集合模型已被证明具有提高预测精度的能力。然而,集合模型的性能受基本模型和集合规则的影响,如何确定最佳的集合模型仍是一个有待解决的问题。为了提高洪水风险预测的准确性,提出了一种涵盖数据优化和集合建模的方法,以优化不平衡洪水数据,并根据效率和性能选择各种集合模型。在郑州市的实际应用表明,Borderline-SMOTE2 是目前最先进的超采样算法中最适用于洪水风险数据优化的算法,因为它具有出色的熵值。根据共性指标的变化,不平衡数据对基本模型性能的影响是普遍存在的。在本研究中,洪水风险预测的最优集合模型由 K-近邻、决策树、高斯奈维贝叶和堆叠规则下的极梯度提升组成。研究结果为洪水预测和减灾提供了有价值的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Risk prediction based on oversampling technology and ensemble model optimized by tree-structured parzed estimator

High accuracy prediction of urban flood risk is conducive to avoid potential losses, however, it's negatively affected by unbalanced data. Furthermore, ensemble model has been demonstrated to have the ability to improve to prediction accuracy. Nevertheless, the performance of ensemble model is influenced by basic model and ensemble rules, and determining the best ensemble model remains an open issue. To improve the accuracy of flood risk prediction, an approach covering data optimization and ensemble modeling was presented to optimize unbalanced flood data and the selection of various ensemble models based on efficiency and performance. A practical application in Zhengzhou City shows that Borderline-SMOTE2 is the most applicable for optimizing the flood risk data among the state-of-the-art oversampling algorithm utilized, because of the excellent entropy value. The effect of unbalanced data on the performance of the basic models was pervasive according to changes of the common indicators. The optimal ensemble model for flood risk prediction is composed of K-Nearest Neighbor, Decision Tree, Gaussian Naive Bayes and Extreme Gradient Boosting under Stacking rule in the current study. The results of this study supply the valuable reference for the flood prediction and mitigation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International journal of disaster risk reduction
International journal of disaster risk reduction GEOSCIENCES, MULTIDISCIPLINARYMETEOROLOGY-METEOROLOGY & ATMOSPHERIC SCIENCES
CiteScore
8.70
自引率
18.00%
发文量
688
审稿时长
79 days
期刊介绍: The International Journal of Disaster Risk Reduction (IJDRR) is the journal for researchers, policymakers and practitioners across diverse disciplines: earth sciences and their implications; environmental sciences; engineering; urban studies; geography; and the social sciences. IJDRR publishes fundamental and applied research, critical reviews, policy papers and case studies with a particular focus on multi-disciplinary research that aims to reduce the impact of natural, technological, social and intentional disasters. IJDRR stimulates exchange of ideas and knowledge transfer on disaster research, mitigation, adaptation, prevention and risk reduction at all geographical scales: local, national and international. Key topics:- -multifaceted disaster and cascading disasters -the development of disaster risk reduction strategies and techniques -discussion and development of effective warning and educational systems for risk management at all levels -disasters associated with climate change -vulnerability analysis and vulnerability trends -emerging risks -resilience against disasters. The journal particularly encourages papers that approach risk from a multi-disciplinary perspective.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信