{"title":"Time-dependent shrinkage of time-varying parameter regression models","authors":"Zhongfang He","doi":"10.1080/07474938.2023.2237274","DOIUrl":null,"url":null,"abstract":"AbstractThis article studies the time-varying parameter (TVP) regression model in which the regression coefficients are random walk latent states with time-dependent conditional variances. This TVP model is flexible to accommodate a wide variety of time variation patterns but requires effective shrinkage on the state variances to avoid over-fitting. A Bayesian shrinkage prior is proposed based on reparameterization that translates the variance shrinkage problem into a variable shrinkage one in a conditionally linear regression with fixed coefficients. The proposed prior allows strong shrinkage for the state variances while maintaining the flexibility to accommodate local signals. A Bayesian estimation method is developed that employs the ancilarity-sufficiency interweaving strategy to boost sampling efficiency. Simulation study and an empirical application to forecast inflation rate illustrate the benefits of the proposed approach.KEYWORDS: ASISBayesian shrinkagehorseshoeMCMCTVPJEL Classification: C01C11C22E37 AcknowledgmentsI would like to thank Professor Esfandiar Maasoumi (the editor), an AE and two referees for many invaluable comments that have greatly improved the article. All remaining errors are my own. The views in this article are solely the author’s responsibility and are not related to the company the author works in. The author reports that there are no competing interests to declare.Notes1 Other recent examples of shrinkage TVP models with homoskedastic latent states include Cadonna et al. (Citation2020), Chan et al. (Citation2020) etc.2 Another strand of the literature allows heteroskedastic latent states by applying time-dependent spike-and-slab mixture priors for state variances (e.g. Giordani and Kohn (Citation2008), Chan et al. (Citation2012), Hauzenberger (Citation2021), Rockova and McAlinn (Citation2021)) but faces the computational hurdle due to the combinatorial complexity of sampling the mixture indicators of the spike-and-slab priors.3 See Hauzenberger et al. (Citation2020) for similar strategies for versions of TVP models where latent states follow independent Gaussian distributions rather than random walks.4 To see this, let βt* = βt - β0. The TVP model can be rewritten as yt = xt′β0 + xt′βt* + ϵt, βt* = βt−1* + ηt and β0* = 0.5 Alternative shrinkage priors for linear regressions include the spike-and-slab one (George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005)) and the normal-gamma one (Griffin and Brown (Citation2010)) etc. A comprehensive comparison of the various shrinkage priors in the current TVP context is left for future research.6 The density of an inverted beta distribution IB(a, b) is p(x)=xa−1(1+x)−a−bB(a,b)I{x>0} where B(·,·) is the beta function and a and b are positive real numbers. If x∼IB(0.5,0.5), then x∼C+(0,1) and vice versa, where C+(0,1) is a standard half-Cauchy distribution with the density p(z)=2π(1+z2)I{z>0}.7 If ±x∼N(0,a), then x∼G(0.5,2a) and vice versa, where the gamma distribution G(α,β) has the density 1Γ(α)βαxα−1 exp (−xβ).8 See Johndrow et al. (Citation2020) and Hauzenberger et al. (Citation2020) for methods that use approximations to the exact algorithm of Bhattacharya et al. (Citation2016).9 Alternative approaches to simulate the latent states from a linear Gaussian state space system include Fruhwirth-Schnatter (Citation1994), Rue (Citation2001) and McCausland et al. (Citation2011) etc.10 In experiments, another two data generating processes are studied where the ratio of σ2 to the variance of the dependent variable is 0.5 and 0.8 respectively. The estimation results are qualitatively similar.11 Generating 1,000 posterior draws from the GHS and DGHS prior specifications takes about 20 and 26 seconds respectively on a standard desktop computer with a 3.0 GHz Intel Core i5 CPU, running in MATLAB R2020b.12 The ESS is computed by the initial monotone sequence method of Geyer (Citation1992) and is normalized by dividing by the number of posterior draws.13 The data source is the FRED database of the U.S. federal reserve bank of St. Louis. The series names are CPILFESL, TB3MS and UNRATE for consumer price index, 3-month treasury bill rate and unemployment rate respectively. Quarterly average is computed as the average monthly values within each quarter.14 Sampling from the GIG distribution is by adapting the Matlab function gigrnd written by Enes Makalic and Daniel Schimidt that implements an algorithm from Devroye (Citation2014).15 A Gamma distribution G(0.5,2) is equivalent to a χ2(1) distribution.16 Sampling from the polya-gamma distribution is by the Matlab function pgdraw written by Enes Makalic and Daniel Schimidt that implements an algorithm from Windle (Citation2013).","PeriodicalId":11438,"journal":{"name":"Econometric Reviews","volume":"2 1","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Econometric Reviews","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/07474938.2023.2237274","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
AbstractThis article studies the time-varying parameter (TVP) regression model in which the regression coefficients are random walk latent states with time-dependent conditional variances. This TVP model is flexible to accommodate a wide variety of time variation patterns but requires effective shrinkage on the state variances to avoid over-fitting. A Bayesian shrinkage prior is proposed based on reparameterization that translates the variance shrinkage problem into a variable shrinkage one in a conditionally linear regression with fixed coefficients. The proposed prior allows strong shrinkage for the state variances while maintaining the flexibility to accommodate local signals. A Bayesian estimation method is developed that employs the ancilarity-sufficiency interweaving strategy to boost sampling efficiency. Simulation study and an empirical application to forecast inflation rate illustrate the benefits of the proposed approach.KEYWORDS: ASISBayesian shrinkagehorseshoeMCMCTVPJEL Classification: C01C11C22E37 AcknowledgmentsI would like to thank Professor Esfandiar Maasoumi (the editor), an AE and two referees for many invaluable comments that have greatly improved the article. All remaining errors are my own. The views in this article are solely the author’s responsibility and are not related to the company the author works in. The author reports that there are no competing interests to declare.Notes1 Other recent examples of shrinkage TVP models with homoskedastic latent states include Cadonna et al. (Citation2020), Chan et al. (Citation2020) etc.2 Another strand of the literature allows heteroskedastic latent states by applying time-dependent spike-and-slab mixture priors for state variances (e.g. Giordani and Kohn (Citation2008), Chan et al. (Citation2012), Hauzenberger (Citation2021), Rockova and McAlinn (Citation2021)) but faces the computational hurdle due to the combinatorial complexity of sampling the mixture indicators of the spike-and-slab priors.3 See Hauzenberger et al. (Citation2020) for similar strategies for versions of TVP models where latent states follow independent Gaussian distributions rather than random walks.4 To see this, let βt* = βt - β0. The TVP model can be rewritten as yt = xt′β0 + xt′βt* + ϵt, βt* = βt−1* + ηt and β0* = 0.5 Alternative shrinkage priors for linear regressions include the spike-and-slab one (George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005)) and the normal-gamma one (Griffin and Brown (Citation2010)) etc. A comprehensive comparison of the various shrinkage priors in the current TVP context is left for future research.6 The density of an inverted beta distribution IB(a, b) is p(x)=xa−1(1+x)−a−bB(a,b)I{x>0} where B(·,·) is the beta function and a and b are positive real numbers. If x∼IB(0.5,0.5), then x∼C+(0,1) and vice versa, where C+(0,1) is a standard half-Cauchy distribution with the density p(z)=2π(1+z2)I{z>0}.7 If ±x∼N(0,a), then x∼G(0.5,2a) and vice versa, where the gamma distribution G(α,β) has the density 1Γ(α)βαxα−1 exp (−xβ).8 See Johndrow et al. (Citation2020) and Hauzenberger et al. (Citation2020) for methods that use approximations to the exact algorithm of Bhattacharya et al. (Citation2016).9 Alternative approaches to simulate the latent states from a linear Gaussian state space system include Fruhwirth-Schnatter (Citation1994), Rue (Citation2001) and McCausland et al. (Citation2011) etc.10 In experiments, another two data generating processes are studied where the ratio of σ2 to the variance of the dependent variable is 0.5 and 0.8 respectively. The estimation results are qualitatively similar.11 Generating 1,000 posterior draws from the GHS and DGHS prior specifications takes about 20 and 26 seconds respectively on a standard desktop computer with a 3.0 GHz Intel Core i5 CPU, running in MATLAB R2020b.12 The ESS is computed by the initial monotone sequence method of Geyer (Citation1992) and is normalized by dividing by the number of posterior draws.13 The data source is the FRED database of the U.S. federal reserve bank of St. Louis. The series names are CPILFESL, TB3MS and UNRATE for consumer price index, 3-month treasury bill rate and unemployment rate respectively. Quarterly average is computed as the average monthly values within each quarter.14 Sampling from the GIG distribution is by adapting the Matlab function gigrnd written by Enes Makalic and Daniel Schimidt that implements an algorithm from Devroye (Citation2014).15 A Gamma distribution G(0.5,2) is equivalent to a χ2(1) distribution.16 Sampling from the polya-gamma distribution is by the Matlab function pgdraw written by Enes Makalic and Daniel Schimidt that implements an algorithm from Windle (Citation2013).
期刊介绍:
Econometric Reviews is widely regarded as one of the top 5 core journals in econometrics. It probes the limits of econometric knowledge, featuring regular, state-of-the-art single blind refereed articles and book reviews. ER has been consistently the leader and innovator in its acclaimed retrospective and critical surveys and interchanges on current or developing topics. Special issues of the journal are developed by a world-renowned editorial board. These bring together leading experts from econometrics and beyond. Reviews of books and software are also within the scope of the journal. Its content is expressly intended to reach beyond econometrics and advanced empirical economics, to statistics and other social sciences.