极端洪水预报中数据稀缺性的处理:一种深度生成建模方法

IF 4.2 2区 环境科学与生态学 Q1 WATER RESOURCES
Ali Sattari , Hamid Moradkhani
{"title":"极端洪水预报中数据稀缺性的处理:一种深度生成建模方法","authors":"Ali Sattari ,&nbsp;Hamid Moradkhani","doi":"10.1016/j.advwatres.2025.105063","DOIUrl":null,"url":null,"abstract":"<div><div>Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.</div></div>","PeriodicalId":7614,"journal":{"name":"Advances in Water Resources","volume":"204 ","pages":"Article 105063"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Coping with data scarcity in extreme flood forecasting: A deep generative modeling approach\",\"authors\":\"Ali Sattari ,&nbsp;Hamid Moradkhani\",\"doi\":\"10.1016/j.advwatres.2025.105063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.</div></div>\",\"PeriodicalId\":7614,\"journal\":{\"name\":\"Advances in Water Resources\",\"volume\":\"204 \",\"pages\":\"Article 105063\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Water Resources\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0309170825001770\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Water Resources","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0309170825001770","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0

摘要

洪水是最具破坏性的自然灾害之一,在世界范围内造成巨大的经济损失和人员伤亡。提高洪水预报模型的准确性对于减轻这些影响和提供早期预警系统至关重要。然而,这些模型的性能在很大程度上依赖于训练数据的长度和质量。缺乏足够的历史洪水事件数据会削弱这些模型提供准确预报的能力。为了解决这个问题,我们采用时间序列生成对抗网络(TimeGAN)来综合生成洪水事件,在数量和质量上丰富了训练数据集。TimeGAN使用9个特征进行训练,包括美国地质调查局(USGS)排放数据和北美土地数据同化系统(NLDAS-2)数据集的气象数据,以生成包括NLDAS合成数据集及其相应的美国地质调查局排放数据的合成数据。增强的数据集结合了历史数据和合成数据,然后用于训练长短期记忆(LSTM)模型,以预测不同提前期的流量。此外,我们在模型中结合小波变换(WT)来分解观察到的排放数据,确定趋势。该模型的性能在德克萨斯州东南部的24个盆地进行了测试,重点是飓风哈维期间的极端条件。结果表明,数据增强提高了模型的性能,在长达4天的交货时间内,24个盆地的平均纳什-苏特克利夫效率(NSE)分别提高了约10%、5%、19%和38%。这些发现证明了该模型在现实场景中的稳健性和适用性,突出了其作为极端事件风险管理决策者的有效工具的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Coping with data scarcity in extreme flood forecasting: A deep generative modeling approach
Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advances in Water Resources
Advances in Water Resources 环境科学-水资源
CiteScore
9.40
自引率
6.40%
发文量
171
审稿时长
36 days
期刊介绍: Advances in Water Resources provides a forum for the presentation of fundamental scientific advances in the understanding of water resources systems. The scope of Advances in Water Resources includes any combination of theoretical, computational, and experimental approaches used to advance fundamental understanding of surface or subsurface water resources systems or the interaction of these systems with the atmosphere, geosphere, biosphere, and human societies. Manuscripts involving case studies that do not attempt to reach broader conclusions, research on engineering design, applied hydraulics, or water quality and treatment, as well as applications of existing knowledge that do not advance fundamental understanding of hydrological processes, are not appropriate for Advances in Water Resources. Examples of appropriate topical areas that will be considered include the following: • Surface and subsurface hydrology • Hydrometeorology • Environmental fluid dynamics • Ecohydrology and ecohydrodynamics • Multiphase transport phenomena in porous media • Fluid flow and species transport and reaction processes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信