Rajitha Athukorala, Joshua A. Simmons, Sally Cripps, R. Vervoort
{"title":"Disentangling hydrological mixtures","authors":"Rajitha Athukorala, Joshua A. Simmons, Sally Cripps, R. Vervoort","doi":"10.36334/modsim.2023.athukorala550","DOIUrl":null,"url":null,"abstract":": Streamflow timeseries in different parts of the world have their own unique features, posing challenges for hydrological modelling. In Australia, more than 70% of the rivers are non-perennial (i.e. rivers which have no flow for at least part of the year) (Shanafield et al., 2020) and the transition from zero or low flows to high flows is often rapid, resulting in flash floods. Therefore, any forecasting model for the mean and variability of Australian streamflow needs to account for these unique features. To account for rivers which have no streamflow for part of the year we propose a Bayesian Hierarchical Mixture of Experts (BHME) model where the streamflow distribution has two components. The first component of the mixture is a point mass at zero for zero flows, and the second is a Gamma distribution for non-zero flows. As in all hydrological modelling, streamflow data is derived from river height observations via a rating curve which maps river heights to discharge rates. In this paper we take a Bayesian approach and use the posterior mean of stream discharge given river height data for this mapping. To identify zero streamflow, we take the lowest recorded river height, compute the expected value of stream flow given this height, and its corresponding 95% credible interval. If a streamflow observation is lower than the lower limit of this credible interval, then we categorise it as zero. The probability of streamflow at any given day, belonging to either the zero or non-zero flow component is modelled using a logistic regression. The logistic regression model as well as the parameters of the Gamma distribution are parameterized to depend on upstream streamflow and rainfall from the previous day. The second approach is another BHME model with two components to model sudden changes in non-zero flow regimes common to Australian rivers. The two components in this approach are both Gamma densities which are parameterized to depend on upstream streamflow and rainfall from the previous day. The mixture weights in this approach depends on the same set of covariates through a logit link function as in the first approach. The models are estimated in a Bayesian framework using Hamiltonian Monte Carlo with the No-U-Turn Sampler (NUTS) (Homan and Gelman, 2014), to perform the required multidimensional integration and generate samples from the posterior distribution of the quantities of interest. These approaches provide a statistically robust method to model zero observations as well as sudden changes in streamflow. The logistic regression in the first approach provides useful information regarding the transition from flow to no flow (and vice versa) which a single component model cannot provide. The two-component model in the second approach provides better fit to the data and better predictive densities compared to a single component model. The transitions from one component to another in the second approach provides useful information in sudden changes in flow regimes commonly seen in Australia.","PeriodicalId":390064,"journal":{"name":"MODSIM2023, 25th International Congress on Modelling and Simulation.","volume":"159 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MODSIM2023, 25th International Congress on Modelling and Simulation.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36334/modsim.2023.athukorala550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
: Streamflow timeseries in different parts of the world have their own unique features, posing challenges for hydrological modelling. In Australia, more than 70% of the rivers are non-perennial (i.e. rivers which have no flow for at least part of the year) (Shanafield et al., 2020) and the transition from zero or low flows to high flows is often rapid, resulting in flash floods. Therefore, any forecasting model for the mean and variability of Australian streamflow needs to account for these unique features. To account for rivers which have no streamflow for part of the year we propose a Bayesian Hierarchical Mixture of Experts (BHME) model where the streamflow distribution has two components. The first component of the mixture is a point mass at zero for zero flows, and the second is a Gamma distribution for non-zero flows. As in all hydrological modelling, streamflow data is derived from river height observations via a rating curve which maps river heights to discharge rates. In this paper we take a Bayesian approach and use the posterior mean of stream discharge given river height data for this mapping. To identify zero streamflow, we take the lowest recorded river height, compute the expected value of stream flow given this height, and its corresponding 95% credible interval. If a streamflow observation is lower than the lower limit of this credible interval, then we categorise it as zero. The probability of streamflow at any given day, belonging to either the zero or non-zero flow component is modelled using a logistic regression. The logistic regression model as well as the parameters of the Gamma distribution are parameterized to depend on upstream streamflow and rainfall from the previous day. The second approach is another BHME model with two components to model sudden changes in non-zero flow regimes common to Australian rivers. The two components in this approach are both Gamma densities which are parameterized to depend on upstream streamflow and rainfall from the previous day. The mixture weights in this approach depends on the same set of covariates through a logit link function as in the first approach. The models are estimated in a Bayesian framework using Hamiltonian Monte Carlo with the No-U-Turn Sampler (NUTS) (Homan and Gelman, 2014), to perform the required multidimensional integration and generate samples from the posterior distribution of the quantities of interest. These approaches provide a statistically robust method to model zero observations as well as sudden changes in streamflow. The logistic regression in the first approach provides useful information regarding the transition from flow to no flow (and vice versa) which a single component model cannot provide. The two-component model in the second approach provides better fit to the data and better predictive densities compared to a single component model. The transitions from one component to another in the second approach provides useful information in sudden changes in flow regimes commonly seen in Australia.