A novel technology for unraveling the spatial risk of Natech disasters based on machine learning and GIS: a case study from the city of Changzhou, China
IF 2.7 4区 地球科学Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
{"title":"A novel technology for unraveling the spatial risk of Natech disasters based on machine learning and GIS: a case study from the city of Changzhou, China","authors":"Weiyi Ju, Zhixiang Xing","doi":"10.1007/s12145-024-01484-3","DOIUrl":null,"url":null,"abstract":"<p>In recent years, technical accidents caused by natural disasters have caused huge losses. The purpose of this study is to develop a mathematical model to predict and prevent the risk of such accidents. The model applied machine learning to predict the risk of such accidents in the hope of providing risk visualization results for local governments. The expected impact of this research will benefit residents and public welfare organizations. In this study, Random Forest (RF), the K-Nearest Neighbor (KNN), the Back Propagation (BP) neural network, Adaptive Boosting (AdaBoost), Gradient Boosting Decision Tree (GBDT), and the Extreme Gradient Boosting (XGBoost) was applied to predict the risk value. At the same time, this study applied ArcGIS to spatially interpolate the risk prediction values to generate the risk map. The results demonstrated that the RF algorithm achieved the highest classification performance among the five algorithms tested. Specifically, the RF algorithm attained an accuracy of 0.874, an F1-Score of 0.887, and an Area Under the Curve (AUC) of 0.984. The three townships with the highest risk were Xueyan, Daibu, and Shanghuang, with the proportion of risk area accounting for 48.39%, 44.34% and 79.64% respectively. This study provides a reference for the local government, which can take targeted measures to prevent and control. For disaster managers, the risks for those high-risk areas should receive sufficient attention. The government should establish a real-time updated disaster database to monitor the development of the situation. Moreover, the development and acquisition of historical disaster data is worthy of encouragement.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-024-01484-3","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, technical accidents caused by natural disasters have caused huge losses. The purpose of this study is to develop a mathematical model to predict and prevent the risk of such accidents. The model applied machine learning to predict the risk of such accidents in the hope of providing risk visualization results for local governments. The expected impact of this research will benefit residents and public welfare organizations. In this study, Random Forest (RF), the K-Nearest Neighbor (KNN), the Back Propagation (BP) neural network, Adaptive Boosting (AdaBoost), Gradient Boosting Decision Tree (GBDT), and the Extreme Gradient Boosting (XGBoost) was applied to predict the risk value. At the same time, this study applied ArcGIS to spatially interpolate the risk prediction values to generate the risk map. The results demonstrated that the RF algorithm achieved the highest classification performance among the five algorithms tested. Specifically, the RF algorithm attained an accuracy of 0.874, an F1-Score of 0.887, and an Area Under the Curve (AUC) of 0.984. The three townships with the highest risk were Xueyan, Daibu, and Shanghuang, with the proportion of risk area accounting for 48.39%, 44.34% and 79.64% respectively. This study provides a reference for the local government, which can take targeted measures to prevent and control. For disaster managers, the risks for those high-risk areas should receive sufficient attention. The government should establish a real-time updated disaster database to monitor the development of the situation. Moreover, the development and acquisition of historical disaster data is worthy of encouragement.
期刊介绍:
The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.