比较二元统计方法、增强和叠加模型在洪水易感性评估中的性能的新方法

IF 8 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Le Ngoc Hanh , Le Phuc Chi Lang , Phan Anh Hang , Nguyen Van An , Nguyen Hoang Son
{"title":"比较二元统计方法、增强和叠加模型在洪水易感性评估中的性能的新方法","authors":"Le Ngoc Hanh ,&nbsp;Le Phuc Chi Lang ,&nbsp;Phan Anh Hang ,&nbsp;Nguyen Van An ,&nbsp;Nguyen Hoang Son","doi":"10.1016/j.jenvman.2025.125670","DOIUrl":null,"url":null,"abstract":"<div><div>Evaluating the performance of flood susceptibility assessment methodologies is critical for optimizing flood management strategies. This study presents a novel methodology for comparing bivariate statistical methods, boosting, and stacking models to determine the most effective technique for flood susceptibility assessment in Hoa Vang District, Da Nang City, Vietnam. Twelve key factors from an initial set of seventeen factors determine their impact on flooding based on information gain ratio (IGR) and multicollinearity analysis. The study extracted 2,172 samples from Sentinel 1 imagery and field survey data, dividing them into training (70 %) and testing (30 %) sets using a random method. The two primary indices utilized for the bivariate statistical approach were the weight of evidence (WoE) and frequency ratio (FR). In bivariate statistics, the study utilizes two methods for classifying factors influencing flooding: the traditional Jenks natural breaks (JNB) and an improved version of JNB that accounts for correlation with flood data. Boosting models (AdaBoost (AB), XGBoost (XGB), CatBoost (CB), Light Gradient Boosting Machine (LGB), and Gradient Boosting (GB)) were employed both independently and in combination as base learners within the stacking model framework. Performance evaluation utilized the receiver operating characteristic and area under the curve (ROC-AUC), Kappa statistics, and other indices. The results show that the stacking models delivered the highest evaluation performance, with an average score of 0.882, outperforming the boosting models (0.76) and significantly surpassing the flood susceptibility maps generated by the bivariate statistical methods WoE (0.282) and FR (0.136). The study identified high and very high-risk flood zones, encompassing 14 % of the district, focusing on the southern communes. These findings provide valuable insights for enhancing flood susceptibility management and mitigation strategies, offering a robust tool for decision-making in flood-prone areas.</div></div>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"387 ","pages":"Article 125670"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel approach in comparing the performance of bivariate statistical methods, boosting, and stacking models in flood susceptibility assessment\",\"authors\":\"Le Ngoc Hanh ,&nbsp;Le Phuc Chi Lang ,&nbsp;Phan Anh Hang ,&nbsp;Nguyen Van An ,&nbsp;Nguyen Hoang Son\",\"doi\":\"10.1016/j.jenvman.2025.125670\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Evaluating the performance of flood susceptibility assessment methodologies is critical for optimizing flood management strategies. This study presents a novel methodology for comparing bivariate statistical methods, boosting, and stacking models to determine the most effective technique for flood susceptibility assessment in Hoa Vang District, Da Nang City, Vietnam. Twelve key factors from an initial set of seventeen factors determine their impact on flooding based on information gain ratio (IGR) and multicollinearity analysis. The study extracted 2,172 samples from Sentinel 1 imagery and field survey data, dividing them into training (70 %) and testing (30 %) sets using a random method. The two primary indices utilized for the bivariate statistical approach were the weight of evidence (WoE) and frequency ratio (FR). In bivariate statistics, the study utilizes two methods for classifying factors influencing flooding: the traditional Jenks natural breaks (JNB) and an improved version of JNB that accounts for correlation with flood data. Boosting models (AdaBoost (AB), XGBoost (XGB), CatBoost (CB), Light Gradient Boosting Machine (LGB), and Gradient Boosting (GB)) were employed both independently and in combination as base learners within the stacking model framework. Performance evaluation utilized the receiver operating characteristic and area under the curve (ROC-AUC), Kappa statistics, and other indices. The results show that the stacking models delivered the highest evaluation performance, with an average score of 0.882, outperforming the boosting models (0.76) and significantly surpassing the flood susceptibility maps generated by the bivariate statistical methods WoE (0.282) and FR (0.136). The study identified high and very high-risk flood zones, encompassing 14 % of the district, focusing on the southern communes. These findings provide valuable insights for enhancing flood susceptibility management and mitigation strategies, offering a robust tool for decision-making in flood-prone areas.</div></div>\",\"PeriodicalId\":356,\"journal\":{\"name\":\"Journal of Environmental Management\",\"volume\":\"387 \",\"pages\":\"Article 125670\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Environmental Management\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0301479725016469\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0301479725016469","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

评估洪水易感性评估方法的性能对于优化洪水管理策略至关重要。本研究提出了一种比较二元统计方法、增强和叠加模型的新方法,以确定越南岘港市Hoa Vang地区洪水易感性评估的最有效技术。基于信息增益比(IGR)和多重共线性分析,从最初的17个因素中选出12个关键因素来确定它们对洪水的影响。该研究从Sentinel 1号图像和实地调查数据中提取了2172个样本,使用随机方法将其分为训练集(70%)和测试集(30%)。双变量统计方法使用的两个主要指标是证据权重(WoE)和频率比(FR)。在双变量统计中,本研究使用两种方法对影响洪水的因素进行分类:传统的Jenks自然断裂(JNB)和一种改进的JNB,该方法考虑了与洪水数据的相关性。增强模型(AdaBoost (AB)、XGBoost (XGB)、CatBoost (CB)、Light Gradient Boosting Machine (LGB)和Gradient Boosting (GB))在叠加模型框架内单独或组合使用作为基础学习器。性能评价采用受试者工作特征和曲线下面积(ROC-AUC)、Kappa统计等指标。结果表明,叠加模型的评价性能最高,平均得分为0.882,优于提升模型(0.76),显著优于二元统计方法(WoE)和FR(0.136)生成的洪水敏感性图。该研究确定了高风险区和极高风险区,覆盖了该地区14%的面积,重点是南部的公社。这些发现为加强洪水易感性管理和减灾战略提供了有价值的见解,为洪水易发地区的决策提供了强有力的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A novel approach in comparing the performance of bivariate statistical methods, boosting, and stacking models in flood susceptibility assessment
Evaluating the performance of flood susceptibility assessment methodologies is critical for optimizing flood management strategies. This study presents a novel methodology for comparing bivariate statistical methods, boosting, and stacking models to determine the most effective technique for flood susceptibility assessment in Hoa Vang District, Da Nang City, Vietnam. Twelve key factors from an initial set of seventeen factors determine their impact on flooding based on information gain ratio (IGR) and multicollinearity analysis. The study extracted 2,172 samples from Sentinel 1 imagery and field survey data, dividing them into training (70 %) and testing (30 %) sets using a random method. The two primary indices utilized for the bivariate statistical approach were the weight of evidence (WoE) and frequency ratio (FR). In bivariate statistics, the study utilizes two methods for classifying factors influencing flooding: the traditional Jenks natural breaks (JNB) and an improved version of JNB that accounts for correlation with flood data. Boosting models (AdaBoost (AB), XGBoost (XGB), CatBoost (CB), Light Gradient Boosting Machine (LGB), and Gradient Boosting (GB)) were employed both independently and in combination as base learners within the stacking model framework. Performance evaluation utilized the receiver operating characteristic and area under the curve (ROC-AUC), Kappa statistics, and other indices. The results show that the stacking models delivered the highest evaluation performance, with an average score of 0.882, outperforming the boosting models (0.76) and significantly surpassing the flood susceptibility maps generated by the bivariate statistical methods WoE (0.282) and FR (0.136). The study identified high and very high-risk flood zones, encompassing 14 % of the district, focusing on the southern communes. These findings provide valuable insights for enhancing flood susceptibility management and mitigation strategies, offering a robust tool for decision-making in flood-prone areas.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Environmental Management
Journal of Environmental Management 环境科学-环境科学
CiteScore
13.70
自引率
5.70%
发文量
2477
审稿时长
84 days
期刊介绍: The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信