评估城市洪水深度估算的大型多模态模型

IF 4.6 1区地球科学 Q2 ENVIRONMENTAL SCIENCES

Water Resources Research Pub Date : 2025-04-10 DOI:10.1029/2024wr039494

Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang

{"title":"评估城市洪水深度估算的大型多模态模型","authors":"Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang","doi":"10.1029/2024wr039494","DOIUrl":null,"url":null,"abstract":"Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"17 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing Large Multimodal Models for Urban Floodwater Depth Estimation\",\"authors\":\"Heng Lyu, Shun'an Zhou, Ze Wang, Guangtao Fu, Chi Zhang\",\"doi\":\"10.1029/2024wr039494\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.\",\"PeriodicalId\":23799,\"journal\":{\"name\":\"Water Resources Research\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1029/2024wr039494\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2024wr039494","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

城市洪水监测对于了解洪水过程和实施管理策略至关重要。然而，目前的监测系统不能全面捕捉城市洪水动态。在这里，我们探索使用先进的大型多模态模型（lmm）从地面图像估计洪水深度，作为替代观测方法。在两个城市洪水图像数据集上进行评估，lmm生成的估计与地面真实情况具有可接受的一致性，GPT-4的精度最高，为0.65，Spearman相关系数为0.88。此外，还发现图像复杂度和文本提示的共同作用会影响lmm的性能。我们的研究首次系统地证明了lmm可以作为基于图像的城市洪水监测的有效工具，扩大了洪水预报和模型校准的数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Assessing Large Multimodal Models for Urban Floodwater Depth Estimation

Urban flood monitoring is crucial for understanding flood processes and implementing management strategies. However, current monitoring systems cannot comprehensively capture urban flooding dynamics. Here we explore the use of cutting-edge Large Multimodal Models (LMMs) to estimate floodwater depth from ground-level images, as alternative observational approaches. Evaluated on two urban flood image data sets, LMMs generate estimations exhibiting acceptable concordance to ground truth, with GPT-4 achieving the highest accuracy of 0.65 and a Spearman correlation coefficient of 0.88. Furthermore, a combined effect of image complexity and textual prompt is found to influence LMMs' performance. Our study systematically demonstrates, for the first time, that LMMs can be effective tools for imaging-based urban flood monitoring, enlarging the data for flood forecasting and model calibration.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Water Resources Research 环境科学-湖沼学

CiteScore

8.80

自引率

13.00%

发文量

599

审稿时长

3.5 months

期刊介绍： Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.