{"title":"Transfer learning using the global Caravan dataset for developing a local river streamflow prediction model","authors":"Almas Alzhanov , Aliya Nugumanova , Vsevolod Moreido","doi":"10.1016/j.envsoft.2025.106691","DOIUrl":null,"url":null,"abstract":"<div><div>Effective water resource and flood risk management depends on reliable streamflow forecasting. However, the accuracy of such forecasts is often limited by sparse monitoring networks and insufficient historical data. To address this issue, we explore the potential of a multi-basin training approach using the global Caravan hydrological dataset to improve local streamflow forecasting. As a case study, we focus on the Uba River basin in East Kazakhstan. The developed models are evaluated against two baselines: GR4J hydrological model and an LSTM model trained exclusively on local data. Results indicate that our approach enhances forecasting accuracy and outperforms the baseline models, with the best model achieving Nash-Sutcliffe efficiency value of 0.8187 compared to 0.72 of GR4J and 0.7602 of LSTM trained exclusively on local data. These findings indicate that multi-basin training with global datasets can enhance local streamflow forecasting in data-scarce regions.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"194 ","pages":"Article 106691"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225003755","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Effective water resource and flood risk management depends on reliable streamflow forecasting. However, the accuracy of such forecasts is often limited by sparse monitoring networks and insufficient historical data. To address this issue, we explore the potential of a multi-basin training approach using the global Caravan hydrological dataset to improve local streamflow forecasting. As a case study, we focus on the Uba River basin in East Kazakhstan. The developed models are evaluated against two baselines: GR4J hydrological model and an LSTM model trained exclusively on local data. Results indicate that our approach enhances forecasting accuracy and outperforms the baseline models, with the best model achieving Nash-Sutcliffe efficiency value of 0.8187 compared to 0.72 of GR4J and 0.7602 of LSTM trained exclusively on local data. These findings indicate that multi-basin training with global datasets can enhance local streamflow forecasting in data-scarce regions.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.