Maryam Jahanbani, Mohammad H. Vahidnia, Hossein Aghamohammadi, Zahra Azizi
{"title":"通过地理信息学和集合学习方法绘制伊朗马赞达兰的洪水易感性地图,重点是 AdaBoost-Decision Tree 算法","authors":"Maryam Jahanbani, Mohammad H. Vahidnia, Hossein Aghamohammadi, Zahra Azizi","doi":"10.1007/s12145-023-01213-2","DOIUrl":null,"url":null,"abstract":"<p>Floods, as natural disasters, impose significant human and financial burdens, necessitating stringent mitigation measures. The recurrent annual incidence of floods precipitates considerable economic setbacks and tragic human casualties. In the realm of disaster management, flood susceptibility mapping has evolved into an indispensable instrument for preemptive intervention. In recent years, the amalgamation of machine learning (ML) methodologies and geographic information systems (GIS) has demonstrated remarkable promise in the realm of flood susceptibility mapping. Nonetheless, the inherent limitations of standalone ML models have constrained their predictive efficacy. Several shortcomings are evident in prior research. These include the failure to utilize contemporary ensemble approaches capable of enhancing performance and the limited exploration of diverse classifier combinations, which are instrumental in augmenting reliability. Simultaneously, there is an absence of current and up-to-date flood susceptibility maps on recent floods within the study area. Hence, this study endeavors to enhance the precision of flood susceptibility mapping, within the Haraz-Neka River basin across Mazandaran province, by harnessing an ensemble of ML models. The research methodology encompassed several pivotal phases. Initially, data about 240 flood sites were meticulously compiled. Subsequently, 70% of this dataset was allocated for training and cartographic elucidations, whereas the remaining 30%, selected at random, served to validate the resultant maps. The analytical framework incorporated a spectrum of influential parameters, encompassing Elevation, Slope, Aspect, Rainfall, land use, Vegetation Differentiation Index (NDVI), Soil Hydrology Groups, Proximity to the River, Distance from Landslides, Topographic Wetness Index (TWI), Stream Power Index (SPI), and Sediment Transport Index (STI) for spatial modeling. The results undeniably highlight the superior performance of the ensemble model compared to its individual counterparts. Validation exercises, leveraging historical flood data, prominently endorsed the AdaBoost algorithm integrated with the Decision Tree classifier as the most efficacious. Garnering an Area Under ROC curve surpassing 0.96, accompanied by an accuracy of 0.93%, a sensitivity of 0.95%, and a specificity of 0.92%, this amalgamation substantiates its prowess. The proposed framework stands poised to empower decision-makers in identifying vulnerable regions and devising efficacious flood risk mitigation strategies.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"65 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Flood susceptibility mapping through geoinformatics and ensemble learning methods, with an emphasis on the AdaBoost-Decision Tree algorithm, in Mazandaran, Iran\",\"authors\":\"Maryam Jahanbani, Mohammad H. Vahidnia, Hossein Aghamohammadi, Zahra Azizi\",\"doi\":\"10.1007/s12145-023-01213-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Floods, as natural disasters, impose significant human and financial burdens, necessitating stringent mitigation measures. The recurrent annual incidence of floods precipitates considerable economic setbacks and tragic human casualties. In the realm of disaster management, flood susceptibility mapping has evolved into an indispensable instrument for preemptive intervention. In recent years, the amalgamation of machine learning (ML) methodologies and geographic information systems (GIS) has demonstrated remarkable promise in the realm of flood susceptibility mapping. Nonetheless, the inherent limitations of standalone ML models have constrained their predictive efficacy. Several shortcomings are evident in prior research. These include the failure to utilize contemporary ensemble approaches capable of enhancing performance and the limited exploration of diverse classifier combinations, which are instrumental in augmenting reliability. Simultaneously, there is an absence of current and up-to-date flood susceptibility maps on recent floods within the study area. Hence, this study endeavors to enhance the precision of flood susceptibility mapping, within the Haraz-Neka River basin across Mazandaran province, by harnessing an ensemble of ML models. The research methodology encompassed several pivotal phases. Initially, data about 240 flood sites were meticulously compiled. Subsequently, 70% of this dataset was allocated for training and cartographic elucidations, whereas the remaining 30%, selected at random, served to validate the resultant maps. The analytical framework incorporated a spectrum of influential parameters, encompassing Elevation, Slope, Aspect, Rainfall, land use, Vegetation Differentiation Index (NDVI), Soil Hydrology Groups, Proximity to the River, Distance from Landslides, Topographic Wetness Index (TWI), Stream Power Index (SPI), and Sediment Transport Index (STI) for spatial modeling. The results undeniably highlight the superior performance of the ensemble model compared to its individual counterparts. Validation exercises, leveraging historical flood data, prominently endorsed the AdaBoost algorithm integrated with the Decision Tree classifier as the most efficacious. Garnering an Area Under ROC curve surpassing 0.96, accompanied by an accuracy of 0.93%, a sensitivity of 0.95%, and a specificity of 0.92%, this amalgamation substantiates its prowess. The proposed framework stands poised to empower decision-makers in identifying vulnerable regions and devising efficacious flood risk mitigation strategies.</p>\",\"PeriodicalId\":49318,\"journal\":{\"name\":\"Earth Science Informatics\",\"volume\":\"65 1\",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Earth Science Informatics\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s12145-023-01213-2\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-023-01213-2","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
摘要
洪水作为自然灾害,给人类和财政带来了沉重负担,必须采取严格的缓解措施。每年反复发生的洪灾造成了巨大的经济损失和惨重的人员伤亡。在灾害管理领域,绘制洪水易发区地图已发展成为先期干预不可或缺的工具。近年来,机器学习(ML)方法与地理信息系统(GIS)的结合在洪水易感性绘图领域展现出了显著的前景。然而,独立 ML 模型的固有局限性限制了其预测效果。在之前的研究中,有几个缺陷是显而易见的。其中包括未能利用能够提高性能的现代集合方法,以及对有助于提高可靠性的多种分类器组合的探索有限。同时,研究区域内也缺乏最新的洪水易感性地图。因此,本研究试图通过利用一组 ML 模型来提高马赞达兰省哈拉兹-内卡河流域洪水易发性绘图的精确度。研究方法包括几个关键阶段。首先,对 240 个洪水点的数据进行了细致的汇编。随后,该数据集的 70% 被分配用于训练和制图阐释,而随机选择的其余 30% 则用于验证所绘制的地图。分析框架纳入了一系列有影响力的参数,包括用于空间建模的海拔高度、坡度、坡向、降雨量、土地利用、植被分异指数(NDVI)、土壤水文组、河流临近度、滑坡距离、地形湿润指数(TWI)、溪流动力指数(SPI)和沉积物迁移指数(STI)。与单个模型相比,这些结果无可否认地凸显了集合模型的卓越性能。在利用历史洪水数据进行的验证工作中,AdaBoost 算法与决策树分类器相结合的效果最为显著。该组合的 ROC 曲线下面积超过 0.96,准确率为 0.93%,灵敏度为 0.95%,特异性为 0.92%,证明了其卓越的能力。所提出的框架可帮助决策者识别脆弱地区并制定有效的洪水风险缓解战略。
Flood susceptibility mapping through geoinformatics and ensemble learning methods, with an emphasis on the AdaBoost-Decision Tree algorithm, in Mazandaran, Iran
Floods, as natural disasters, impose significant human and financial burdens, necessitating stringent mitigation measures. The recurrent annual incidence of floods precipitates considerable economic setbacks and tragic human casualties. In the realm of disaster management, flood susceptibility mapping has evolved into an indispensable instrument for preemptive intervention. In recent years, the amalgamation of machine learning (ML) methodologies and geographic information systems (GIS) has demonstrated remarkable promise in the realm of flood susceptibility mapping. Nonetheless, the inherent limitations of standalone ML models have constrained their predictive efficacy. Several shortcomings are evident in prior research. These include the failure to utilize contemporary ensemble approaches capable of enhancing performance and the limited exploration of diverse classifier combinations, which are instrumental in augmenting reliability. Simultaneously, there is an absence of current and up-to-date flood susceptibility maps on recent floods within the study area. Hence, this study endeavors to enhance the precision of flood susceptibility mapping, within the Haraz-Neka River basin across Mazandaran province, by harnessing an ensemble of ML models. The research methodology encompassed several pivotal phases. Initially, data about 240 flood sites were meticulously compiled. Subsequently, 70% of this dataset was allocated for training and cartographic elucidations, whereas the remaining 30%, selected at random, served to validate the resultant maps. The analytical framework incorporated a spectrum of influential parameters, encompassing Elevation, Slope, Aspect, Rainfall, land use, Vegetation Differentiation Index (NDVI), Soil Hydrology Groups, Proximity to the River, Distance from Landslides, Topographic Wetness Index (TWI), Stream Power Index (SPI), and Sediment Transport Index (STI) for spatial modeling. The results undeniably highlight the superior performance of the ensemble model compared to its individual counterparts. Validation exercises, leveraging historical flood data, prominently endorsed the AdaBoost algorithm integrated with the Decision Tree classifier as the most efficacious. Garnering an Area Under ROC curve surpassing 0.96, accompanied by an accuracy of 0.93%, a sensitivity of 0.95%, and a specificity of 0.92%, this amalgamation substantiates its prowess. The proposed framework stands poised to empower decision-makers in identifying vulnerable regions and devising efficacious flood risk mitigation strategies.
期刊介绍:
The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.