利用交织的土地和建筑环境特征预测城市山洪热点的可解释机器学习

IF 7.1 1区 地球科学 Q1 ENVIRONMENTAL STUDIES
Zhewei Liu , Tyler Felton , Ali Mostafavi
{"title":"利用交织的土地和建筑环境特征预测城市山洪热点的可解释机器学习","authors":"Zhewei Liu ,&nbsp;Tyler Felton ,&nbsp;Ali Mostafavi","doi":"10.1016/j.compenvurbsys.2024.102096","DOIUrl":null,"url":null,"abstract":"<div><p>Pluvial flash floods are fast-moving hazards and causes significant disruptions in urban areas. With the increase in heavy precipitations, the ability to proactively identify flash floods hotspots in cities is critical for flood nowcasting and predictive monitoring of risks. While rainfall runoff models and hydrologic models are useful models for flash flood prediction, these models are computationally expensive and effort intensive to be used for flood nowcasting. To address this challenge, this study presents interpretable machine learning models for predicting urban flash flood hotspots based on intertwined land and built environment features. The task of predicting flash flood hotspots is formulated as a binary classification problem, and three recent flash flood events in U.S. cities are selected for data collection and model validation. Various features related to land and built environment characteristics are constructed using diverse datasets, and the occurrences of flash floods are captured using crowdsource data from the events. Using these features and datasets, the flash flood hotspots of cities are predicted with two ensemble models based on decision trees. The results demonstrate that the models can achieve good accuracy (0.8) in identifying flooded/non-flooded locations. Especially, the models can achieve high true positive rate (0.83–0.89) and low missing rate, demonstrating the methods' practicability for accurately predicting flooded hotspots. The model interpretation results indicate that land features related to hydrological and topological features have greater impacts on flash flood risk, than built environment features. Further analysis reveals that the feature importance, model performance, and model transferability performance vary among cities and localized specifications of the models are needed for accurate prediction of flash flood for a particular city. The data-driven machine learning models presented in this study provide a useful tool for predicting flash flood hotspots based on the intertwined features of land and the built environment in cities to enable nowcasting and proactive monitoring of flash flood hotspots for emergency response and also inform integrated urban design and development towards flash flood risk reduction.</p></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"110 ","pages":"Article 102096"},"PeriodicalIF":7.1000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretable machine learning for predicting urban flash flood hotspots using intertwined land and built-environment features\",\"authors\":\"Zhewei Liu ,&nbsp;Tyler Felton ,&nbsp;Ali Mostafavi\",\"doi\":\"10.1016/j.compenvurbsys.2024.102096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Pluvial flash floods are fast-moving hazards and causes significant disruptions in urban areas. With the increase in heavy precipitations, the ability to proactively identify flash floods hotspots in cities is critical for flood nowcasting and predictive monitoring of risks. While rainfall runoff models and hydrologic models are useful models for flash flood prediction, these models are computationally expensive and effort intensive to be used for flood nowcasting. To address this challenge, this study presents interpretable machine learning models for predicting urban flash flood hotspots based on intertwined land and built environment features. The task of predicting flash flood hotspots is formulated as a binary classification problem, and three recent flash flood events in U.S. cities are selected for data collection and model validation. Various features related to land and built environment characteristics are constructed using diverse datasets, and the occurrences of flash floods are captured using crowdsource data from the events. Using these features and datasets, the flash flood hotspots of cities are predicted with two ensemble models based on decision trees. The results demonstrate that the models can achieve good accuracy (0.8) in identifying flooded/non-flooded locations. Especially, the models can achieve high true positive rate (0.83–0.89) and low missing rate, demonstrating the methods' practicability for accurately predicting flooded hotspots. The model interpretation results indicate that land features related to hydrological and topological features have greater impacts on flash flood risk, than built environment features. Further analysis reveals that the feature importance, model performance, and model transferability performance vary among cities and localized specifications of the models are needed for accurate prediction of flash flood for a particular city. The data-driven machine learning models presented in this study provide a useful tool for predicting flash flood hotspots based on the intertwined features of land and the built environment in cities to enable nowcasting and proactive monitoring of flash flood hotspots for emergency response and also inform integrated urban design and development towards flash flood risk reduction.</p></div>\",\"PeriodicalId\":48241,\"journal\":{\"name\":\"Computers Environment and Urban Systems\",\"volume\":\"110 \",\"pages\":\"Article 102096\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2024-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers Environment and Urban Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0198971524000255\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971524000255","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0

摘要

冲积山洪是瞬息万变的灾害,会给城市地区造成严重破坏。随着强降水的增加,主动识别城市山洪热点的能力对于洪水预报和风险预测监测至关重要。虽然降雨径流模型和水文模型是预测山洪暴发的有用模型,但这些模型用于洪水预报的计算成本高、工作量大。为了应对这一挑战,本研究提出了可解释的机器学习模型,用于根据相互交织的土地和建筑环境特征预测城市山洪热点。预测山洪热点的任务被表述为一个二元分类问题,并选择了美国城市最近发生的三次山洪事件进行数据收集和模型验证。利用不同的数据集构建了与土地和建筑环境特征相关的各种特征,并利用事件中的众包数据捕捉了山洪暴发的情况。利用这些特征和数据集,两个基于决策树的集合模型对城市的山洪热点进行了预测。结果表明,模型在识别洪水/非洪水地点方面可以达到很高的准确率(0.8)。特别是,模型的真阳性率较高(0.83-0.89),缺失率较低,表明这些方法在准确预测洪涝热点方面具有实用性。模型解释结果表明,与建筑环境特征相比,与水文和地形特征相关的土地特征对山洪风险的影响更大。进一步的分析表明,不同城市的特征重要性、模型性能和模型可移植性能各不相同,因此需要对模型进行本地化规范,以准确预测特定城市的山洪灾害。本研究中提出的数据驱动型机器学习模型为根据城市中相互交织的土地和建筑环境特征预测山洪热点提供了有用的工具,从而能够对山洪热点进行预报和主动监测,以采取应急措施,并为降低山洪风险的综合城市设计和发展提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Interpretable machine learning for predicting urban flash flood hotspots using intertwined land and built-environment features

Pluvial flash floods are fast-moving hazards and causes significant disruptions in urban areas. With the increase in heavy precipitations, the ability to proactively identify flash floods hotspots in cities is critical for flood nowcasting and predictive monitoring of risks. While rainfall runoff models and hydrologic models are useful models for flash flood prediction, these models are computationally expensive and effort intensive to be used for flood nowcasting. To address this challenge, this study presents interpretable machine learning models for predicting urban flash flood hotspots based on intertwined land and built environment features. The task of predicting flash flood hotspots is formulated as a binary classification problem, and three recent flash flood events in U.S. cities are selected for data collection and model validation. Various features related to land and built environment characteristics are constructed using diverse datasets, and the occurrences of flash floods are captured using crowdsource data from the events. Using these features and datasets, the flash flood hotspots of cities are predicted with two ensemble models based on decision trees. The results demonstrate that the models can achieve good accuracy (0.8) in identifying flooded/non-flooded locations. Especially, the models can achieve high true positive rate (0.83–0.89) and low missing rate, demonstrating the methods' practicability for accurately predicting flooded hotspots. The model interpretation results indicate that land features related to hydrological and topological features have greater impacts on flash flood risk, than built environment features. Further analysis reveals that the feature importance, model performance, and model transferability performance vary among cities and localized specifications of the models are needed for accurate prediction of flash flood for a particular city. The data-driven machine learning models presented in this study provide a useful tool for predicting flash flood hotspots based on the intertwined features of land and the built environment in cities to enable nowcasting and proactive monitoring of flash flood hotspots for emergency response and also inform integrated urban design and development towards flash flood risk reduction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
13.30
自引率
7.40%
发文量
111
审稿时长
32 days
期刊介绍: Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信