{"title":"极端洪水预报中数据稀缺性的处理:一种深度生成建模方法","authors":"Ali Sattari , Hamid Moradkhani","doi":"10.1016/j.advwatres.2025.105063","DOIUrl":null,"url":null,"abstract":"<div><div>Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.</div></div>","PeriodicalId":7614,"journal":{"name":"Advances in Water Resources","volume":"204 ","pages":"Article 105063"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Coping with data scarcity in extreme flood forecasting: A deep generative modeling approach\",\"authors\":\"Ali Sattari , Hamid Moradkhani\",\"doi\":\"10.1016/j.advwatres.2025.105063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.</div></div>\",\"PeriodicalId\":7614,\"journal\":{\"name\":\"Advances in Water Resources\",\"volume\":\"204 \",\"pages\":\"Article 105063\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Water Resources\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0309170825001770\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Water Resources","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0309170825001770","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
Coping with data scarcity in extreme flood forecasting: A deep generative modeling approach
Floods are among the most devastating natural disasters, causing substantial economic losses and fatalities worldwide. Enhancing the accuracy of flood forecasting models is crucial for mitigating these impacts and providing early warning systems. However, the performance of these models significantly relies on the length and quality of training data. A lack of sufficient historical flood event data can undermine the ability of these models to provide accurate forecasts. To address this, we employ a Time-Series Generative Adversarial Network (TimeGAN) to synthetically generate flood events, enriching the training dataset both in quantity and quality. TimeGAN is trained using nine features, comprising United States Geological Survey (USGS) discharge and meteorological data from the North American Land Data Assimilation System (NLDAS-2) dataset, to produce synthetic data that includes both the synthetic NLDAS dataset and its corresponding USGS discharge. The augmented dataset, which combines historical and synthetic data, is then used to train a Long Short-Term Memory (LSTM) model to forecast streamflow at various lead times. Additionally, we incorporate wavelet transform (WT) within our model to decompose observed discharge data, identifying trends. The model performance is tested across twenty-four basins in Southeast Texas, focusing on extreme conditions during Hurricane Harvey. Results indicate that data augmentation improves the model's performance, increasing the average Nash–Sutcliffe Efficiency (NSE) over 24 basins by approximately 10 %, 5 %, 19 %, and 38 % for lead times of up to four days. These findings demonstrate the model's robustness and applicability in real-world scenarios, highlighting its potential as an effective tool for decision-makers in risk management during extreme events.
期刊介绍:
Advances in Water Resources provides a forum for the presentation of fundamental scientific advances in the understanding of water resources systems. The scope of Advances in Water Resources includes any combination of theoretical, computational, and experimental approaches used to advance fundamental understanding of surface or subsurface water resources systems or the interaction of these systems with the atmosphere, geosphere, biosphere, and human societies. Manuscripts involving case studies that do not attempt to reach broader conclusions, research on engineering design, applied hydraulics, or water quality and treatment, as well as applications of existing knowledge that do not advance fundamental understanding of hydrological processes, are not appropriate for Advances in Water Resources.
Examples of appropriate topical areas that will be considered include the following:
• Surface and subsurface hydrology
• Hydrometeorology
• Environmental fluid dynamics
• Ecohydrology and ecohydrodynamics
• Multiphase transport phenomena in porous media
• Fluid flow and species transport and reaction processes