Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang
{"title":"评估城市洪水深度估算的大型多模态模型","authors":"Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang","doi":"10.1029/2024wr039494","DOIUrl":null,"url":null,"abstract":"Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"17 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing Large Multimodal Models for Urban Floodwater Depth Estimation\",\"authors\":\"Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang\",\"doi\":\"10.1029/2024wr039494\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.\",\"PeriodicalId\":23799,\"journal\":{\"name\":\"Water Resources Research\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1029/2024wr039494\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2024wr039494","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Assessing Large Multimodal Models for Urban Floodwater Depth Estimation
Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.
期刊介绍:
Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.