{"title":"Enhancing Streamflow Prediction in Ungauged Basins Using a Nonlinear Knowledge-Based Framework and Deep Learning","authors":"Parnian Ghaneei, Ehsan Foroumandi, Hamid Moradkhani","doi":"10.1029/2024wr037152","DOIUrl":null,"url":null,"abstract":"In hydrology, a fundamental task involves enhancing the predictive power of a model in ungagged basins by transferring information on physical attributes and hydroclimate dynamics from gauged basins. Introducing an integrated nonlinear clustering framework, this study aims to develop a comprehensive framework that augments predictive performance in basins where direct measurements are sparse or absent. In this framework, uniform manifold approximation and projection (UMAP) is used as a nonlinear method to extract the essential features embedded in hydro-climatological attributes and physical properties. Then, the Growing Neural Gas (GNG) clustering model is used to find the basins that potentially share similar hydro-climatological behaviors. Besides UMAP-GNG, the integration of Principal Component Analysis (PCA) as a linear method to reduce dimensionality with common clustering methods are also assessed to serve as benchmarks. The results reveal that the combination of clustering algorithms with the PCA method may lead to loss of information while the nonlinear method (UMAP) can extract more informative features. The efficacy of the proposed framework is assessed across the Contiguous United States (CONUS) by training a single Base Model using long short-term memory (LSTM) for the centroids of all clusters and then, fine-tuning the model on the centroids of each cluster separately to create a regional model. The results indicate that using the information extracted by the UMAP-GNG method to guide a Base Model can significantly improve the accuracy in most of the clusters and enhance the median prediction accuracy within different clusters from 0.04 to 0.37 of KGE in ungauged basins.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"238 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2024wr037152","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In hydrology, a fundamental task involves enhancing the predictive power of a model in ungagged basins by transferring information on physical attributes and hydroclimate dynamics from gauged basins. Introducing an integrated nonlinear clustering framework, this study aims to develop a comprehensive framework that augments predictive performance in basins where direct measurements are sparse or absent. In this framework, uniform manifold approximation and projection (UMAP) is used as a nonlinear method to extract the essential features embedded in hydro-climatological attributes and physical properties. Then, the Growing Neural Gas (GNG) clustering model is used to find the basins that potentially share similar hydro-climatological behaviors. Besides UMAP-GNG, the integration of Principal Component Analysis (PCA) as a linear method to reduce dimensionality with common clustering methods are also assessed to serve as benchmarks. The results reveal that the combination of clustering algorithms with the PCA method may lead to loss of information while the nonlinear method (UMAP) can extract more informative features. The efficacy of the proposed framework is assessed across the Contiguous United States (CONUS) by training a single Base Model using long short-term memory (LSTM) for the centroids of all clusters and then, fine-tuning the model on the centroids of each cluster separately to create a regional model. The results indicate that using the information extracted by the UMAP-GNG method to guide a Base Model can significantly improve the accuracy in most of the clusters and enhance the median prediction accuracy within different clusters from 0.04 to 0.37 of KGE in ungauged basins.
期刊介绍:
Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.