Cutting-Edge strategies for absence data identification in natural hazards: Leveraging Voronoi-Entropy in flood susceptibility mapping with advanced AI techniques
Seyed Vahid Razavi-Termeh , Abolghasem Sadeghi-Niaraki , Farman Ali , Rizwan Ali Naqvi , Soo-Mi Choi
{"title":"Cutting-Edge strategies for absence data identification in natural hazards: Leveraging Voronoi-Entropy in flood susceptibility mapping with advanced AI techniques","authors":"Seyed Vahid Razavi-Termeh , Abolghasem Sadeghi-Niaraki , Farman Ali , Rizwan Ali Naqvi , Soo-Mi Choi","doi":"10.1016/j.jhydrol.2024.132337","DOIUrl":null,"url":null,"abstract":"<div><div>One of the non-structural methods for flood management is preparing flood susceptibility mapping (FSM). The performance of flood susceptibility models significantly depends on the learning methods and data sampling. Random sampling is commonly employed for data sampling owing to its practicality and straightforwardness. A purely random method may not be the best choice for non-flood sampling, as floods typically occur in different locations at different times. Therefore, this research aims to propose a new method for determining flood absence points using a combination of Voronoi diagrams and the entropy method. To achieve this, flood susceptibility modeling was conducted using the XGBoost (eXtreme Gradient Boosting) algorithm optimized with the cat swarm optimization (CSO) algorithm, employing the proposed method for absence point determination in the sub-basin of the Khuzestan province of Iran. Therefore, for flood susceptibility modeling, three scenarios for determining no-occurrence points were employed: random sampling (Scenario 1), Voronoi diagram (Scenario 2), and the combination of Voronoi diagram with entropy-based method (Scenario 3). Additionally, three data split ratios (60:40, 70:30, and 80:20) were utilized for partitioning training and validation datasets.</div><div>Scenario 1 demonstrates varying area under the curve (AUC) of the receiver operating characteristic (ROC) curve values across different data split ratios, with the 60:40 ratio showing moderate accuracy (AUC = 0.720). Scenario 2 exhibits improved performance with higher AUC values (0.847 to 0.908) and balanced sensitivity–specificity trade-offs. Scenario 3 demonstrates varying performance across different data split ratios, achieving the highest modeling accuracy with an AUC of 0.888 in the 60:40 split. Overall, Scenarios 2 and 3 outperform Scenario 1, showcasing significant accuracy improvements ranging from 23.33 % to 30.1 % across different data split ratios. Using Voronoi diagrams and entropy-based methods notably enhances accuracy in determining no-occurrence points compared to random selection, emphasizing the importance of method selection in flood susceptibility modeling.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"648 ","pages":"Article 132337"},"PeriodicalIF":5.9000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169424017335","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
One of the non-structural methods for flood management is preparing flood susceptibility mapping (FSM). The performance of flood susceptibility models significantly depends on the learning methods and data sampling. Random sampling is commonly employed for data sampling owing to its practicality and straightforwardness. A purely random method may not be the best choice for non-flood sampling, as floods typically occur in different locations at different times. Therefore, this research aims to propose a new method for determining flood absence points using a combination of Voronoi diagrams and the entropy method. To achieve this, flood susceptibility modeling was conducted using the XGBoost (eXtreme Gradient Boosting) algorithm optimized with the cat swarm optimization (CSO) algorithm, employing the proposed method for absence point determination in the sub-basin of the Khuzestan province of Iran. Therefore, for flood susceptibility modeling, three scenarios for determining no-occurrence points were employed: random sampling (Scenario 1), Voronoi diagram (Scenario 2), and the combination of Voronoi diagram with entropy-based method (Scenario 3). Additionally, three data split ratios (60:40, 70:30, and 80:20) were utilized for partitioning training and validation datasets.
Scenario 1 demonstrates varying area under the curve (AUC) of the receiver operating characteristic (ROC) curve values across different data split ratios, with the 60:40 ratio showing moderate accuracy (AUC = 0.720). Scenario 2 exhibits improved performance with higher AUC values (0.847 to 0.908) and balanced sensitivity–specificity trade-offs. Scenario 3 demonstrates varying performance across different data split ratios, achieving the highest modeling accuracy with an AUC of 0.888 in the 60:40 split. Overall, Scenarios 2 and 3 outperform Scenario 1, showcasing significant accuracy improvements ranging from 23.33 % to 30.1 % across different data split ratios. Using Voronoi diagrams and entropy-based methods notably enhances accuracy in determining no-occurrence points compared to random selection, emphasizing the importance of method selection in flood susceptibility modeling.
期刊介绍:
The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.