Time-dependent shrinkage of time-varying parameter regression models

IF 0.8 4区 经济学 Q3 ECONOMICS
Zhongfang He
{"title":"Time-dependent shrinkage of time-varying parameter regression models","authors":"Zhongfang He","doi":"10.1080/07474938.2023.2237274","DOIUrl":null,"url":null,"abstract":"AbstractThis article studies the time-varying parameter (TVP) regression model in which the regression coefficients are random walk latent states with time-dependent conditional variances. This TVP model is flexible to accommodate a wide variety of time variation patterns but requires effective shrinkage on the state variances to avoid over-fitting. A Bayesian shrinkage prior is proposed based on reparameterization that translates the variance shrinkage problem into a variable shrinkage one in a conditionally linear regression with fixed coefficients. The proposed prior allows strong shrinkage for the state variances while maintaining the flexibility to accommodate local signals. A Bayesian estimation method is developed that employs the ancilarity-sufficiency interweaving strategy to boost sampling efficiency. Simulation study and an empirical application to forecast inflation rate illustrate the benefits of the proposed approach.KEYWORDS: ASISBayesian shrinkagehorseshoeMCMCTVPJEL Classification: C01C11C22E37 AcknowledgmentsI would like to thank Professor Esfandiar Maasoumi (the editor), an AE and two referees for many invaluable comments that have greatly improved the article. All remaining errors are my own. The views in this article are solely the author’s responsibility and are not related to the company the author works in. The author reports that there are no competing interests to declare.Notes1 Other recent examples of shrinkage TVP models with homoskedastic latent states include Cadonna et al. (Citation2020), Chan et al. (Citation2020) etc.2 Another strand of the literature allows heteroskedastic latent states by applying time-dependent spike-and-slab mixture priors for state variances (e.g. Giordani and Kohn (Citation2008), Chan et al. (Citation2012), Hauzenberger (Citation2021), Rockova and McAlinn (Citation2021)) but faces the computational hurdle due to the combinatorial complexity of sampling the mixture indicators of the spike-and-slab priors.3 See Hauzenberger et al. (Citation2020) for similar strategies for versions of TVP models where latent states follow independent Gaussian distributions rather than random walks.4 To see this, let βt* = βt - β0. The TVP model can be rewritten as yt = xt′β0 + xt′βt* + ϵt, βt* = βt−1* + ηt and β0* = 0.5 Alternative shrinkage priors for linear regressions include the spike-and-slab one (George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005)) and the normal-gamma one (Griffin and Brown (Citation2010)) etc. A comprehensive comparison of the various shrinkage priors in the current TVP context is left for future research.6 The density of an inverted beta distribution IB(a, b) is p(x)=xa−1(1+x)−a−bB(a,b)I{x>0} where B(·,·) is the beta function and a and b are positive real numbers. If x∼IB(0.5,0.5), then x∼C+(0,1) and vice versa, where C+(0,1) is a standard half-Cauchy distribution with the density p(z)=2π(1+z2)I{z>0}.7 If ±x∼N(0,a), then x∼G(0.5,2a) and vice versa, where the gamma distribution G(α,β) has the density 1Γ(α)βαxα−1 exp (−xβ).8 See Johndrow et al. (Citation2020) and Hauzenberger et al. (Citation2020) for methods that use approximations to the exact algorithm of Bhattacharya et al. (Citation2016).9 Alternative approaches to simulate the latent states from a linear Gaussian state space system include Fruhwirth-Schnatter (Citation1994), Rue (Citation2001) and McCausland et al. (Citation2011) etc.10 In experiments, another two data generating processes are studied where the ratio of σ2 to the variance of the dependent variable is 0.5 and 0.8 respectively. The estimation results are qualitatively similar.11 Generating 1,000 posterior draws from the GHS and DGHS prior specifications takes about 20 and 26 seconds respectively on a standard desktop computer with a 3.0 GHz Intel Core i5 CPU, running in MATLAB R2020b.12 The ESS is computed by the initial monotone sequence method of Geyer (Citation1992) and is normalized by dividing by the number of posterior draws.13 The data source is the FRED database of the U.S. federal reserve bank of St. Louis. The series names are CPILFESL, TB3MS and UNRATE for consumer price index, 3-month treasury bill rate and unemployment rate respectively. Quarterly average is computed as the average monthly values within each quarter.14 Sampling from the GIG distribution is by adapting the Matlab function gigrnd written by Enes Makalic and Daniel Schimidt that implements an algorithm from Devroye (Citation2014).15 A Gamma distribution G(0.5,2) is equivalent to a χ2(1) distribution.16 Sampling from the polya-gamma distribution is by the Matlab function pgdraw written by Enes Makalic and Daniel Schimidt that implements an algorithm from Windle (Citation2013).","PeriodicalId":11438,"journal":{"name":"Econometric Reviews","volume":"2 1","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Econometric Reviews","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/07474938.2023.2237274","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

AbstractThis article studies the time-varying parameter (TVP) regression model in which the regression coefficients are random walk latent states with time-dependent conditional variances. This TVP model is flexible to accommodate a wide variety of time variation patterns but requires effective shrinkage on the state variances to avoid over-fitting. A Bayesian shrinkage prior is proposed based on reparameterization that translates the variance shrinkage problem into a variable shrinkage one in a conditionally linear regression with fixed coefficients. The proposed prior allows strong shrinkage for the state variances while maintaining the flexibility to accommodate local signals. A Bayesian estimation method is developed that employs the ancilarity-sufficiency interweaving strategy to boost sampling efficiency. Simulation study and an empirical application to forecast inflation rate illustrate the benefits of the proposed approach.KEYWORDS: ASISBayesian shrinkagehorseshoeMCMCTVPJEL Classification: C01C11C22E37 AcknowledgmentsI would like to thank Professor Esfandiar Maasoumi (the editor), an AE and two referees for many invaluable comments that have greatly improved the article. All remaining errors are my own. The views in this article are solely the author’s responsibility and are not related to the company the author works in. The author reports that there are no competing interests to declare.Notes1 Other recent examples of shrinkage TVP models with homoskedastic latent states include Cadonna et al. (Citation2020), Chan et al. (Citation2020) etc.2 Another strand of the literature allows heteroskedastic latent states by applying time-dependent spike-and-slab mixture priors for state variances (e.g. Giordani and Kohn (Citation2008), Chan et al. (Citation2012), Hauzenberger (Citation2021), Rockova and McAlinn (Citation2021)) but faces the computational hurdle due to the combinatorial complexity of sampling the mixture indicators of the spike-and-slab priors.3 See Hauzenberger et al. (Citation2020) for similar strategies for versions of TVP models where latent states follow independent Gaussian distributions rather than random walks.4 To see this, let βt* = βt - β0. The TVP model can be rewritten as yt = xt′β0 + xt′βt* + ϵt, βt* = βt−1* + ηt and β0* = 0.5 Alternative shrinkage priors for linear regressions include the spike-and-slab one (George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005)) and the normal-gamma one (Griffin and Brown (Citation2010)) etc. A comprehensive comparison of the various shrinkage priors in the current TVP context is left for future research.6 The density of an inverted beta distribution IB(a, b) is p(x)=xa−1(1+x)−a−bB(a,b)I{x>0} where B(·,·) is the beta function and a and b are positive real numbers. If x∼IB(0.5,0.5), then x∼C+(0,1) and vice versa, where C+(0,1) is a standard half-Cauchy distribution with the density p(z)=2π(1+z2)I{z>0}.7 If ±x∼N(0,a), then x∼G(0.5,2a) and vice versa, where the gamma distribution G(α,β) has the density 1Γ(α)βαxα−1 exp (−xβ).8 See Johndrow et al. (Citation2020) and Hauzenberger et al. (Citation2020) for methods that use approximations to the exact algorithm of Bhattacharya et al. (Citation2016).9 Alternative approaches to simulate the latent states from a linear Gaussian state space system include Fruhwirth-Schnatter (Citation1994), Rue (Citation2001) and McCausland et al. (Citation2011) etc.10 In experiments, another two data generating processes are studied where the ratio of σ2 to the variance of the dependent variable is 0.5 and 0.8 respectively. The estimation results are qualitatively similar.11 Generating 1,000 posterior draws from the GHS and DGHS prior specifications takes about 20 and 26 seconds respectively on a standard desktop computer with a 3.0 GHz Intel Core i5 CPU, running in MATLAB R2020b.12 The ESS is computed by the initial monotone sequence method of Geyer (Citation1992) and is normalized by dividing by the number of posterior draws.13 The data source is the FRED database of the U.S. federal reserve bank of St. Louis. The series names are CPILFESL, TB3MS and UNRATE for consumer price index, 3-month treasury bill rate and unemployment rate respectively. Quarterly average is computed as the average monthly values within each quarter.14 Sampling from the GIG distribution is by adapting the Matlab function gigrnd written by Enes Makalic and Daniel Schimidt that implements an algorithm from Devroye (Citation2014).15 A Gamma distribution G(0.5,2) is equivalent to a χ2(1) distribution.16 Sampling from the polya-gamma distribution is by the Matlab function pgdraw written by Enes Makalic and Daniel Schimidt that implements an algorithm from Windle (Citation2013).
时变参数回归模型的时变收缩
摘要本文研究了时变参数(TVP)回归模型,该模型的回归系数为随时间变化的条件方差的随机游走潜态。该TVP模型可以灵活地适应各种时间变化模式,但需要有效地缩小状态方差以避免过度拟合。提出了一种基于重参数化的贝叶斯收缩先验,将方差收缩问题转化为固定系数条件线性回归中的可变收缩问题。所提出的先验允许对状态方差进行强收缩,同时保持适应本地信号的灵活性。提出了一种贝叶斯估计方法,该方法采用辅助充分交织策略来提高采样效率。仿真研究和对通货膨胀率预测的实证应用表明了该方法的有效性。我要感谢Esfandiar Maasoumi教授(编辑)、一位AE和两位审稿人的宝贵意见,他们极大地改进了本文。所有剩下的错误都是我自己的。本文中的观点完全是作者的责任,与作者工作的公司无关。作者报告说,没有相互竞争的利益需要申报。注1其他具有同方差潜在状态的缩窄TVP模型的最新例子包括Cadonna等人(Citation2020), Chan等人(Citation2020)等。2另一种文献通过对状态方差应用时间相关的峰值-板混合先验来允许异方差潜在状态(例如Giordani和Kohn (Citation2008), Chan等人(Citation2012), Hauzenberger (Citation2021),Rockova和McAlinn (Citation2021)),但由于对尖刺和板状先验的混合指标进行采样的组合复杂性,面临计算障碍参见Hauzenberger等人(Citation2020)了解潜在状态遵循独立高斯分布而不是随机游动的TVP模型版本的类似策略让βt* = βt - β0。TVP模型可以改写为yt = xt ' β0 + xt ' βt* + ϵt, βt* = βt - 1* + ηt和β0* = 0.5。线性回归的收缩先验包括尖钉-板模型(George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005))和正伽马模型(Griffin and Brown (Citation2010))等。在目前的TVP背景下,各种收缩先验的综合比较将留给未来的研究倒β分布IB(a, b)的密度为p(x)=xa−1(1+x)−a−bB(a,b)I{x>0},其中b(·,·)为β函数,a和b为正实数。如果x ~ IB(0.5,0.5),则x ~ C+(0,1),反之亦然,其中C+(0,1)是密度p(z)=2π(1+z2)I{z>}.7的标准半柯西分布如果±x ~ N(0,a),则x ~ G(0.5,2a),反之亦然,其中伽马分布G(α,β)的密度为1Γ(α)βαxα−1 exp(−xβ) 8参见Johndrow等人(Citation2020)和Hauzenberger等人(Citation2020)了解使用近似Bhattacharya等人(Citation2016)的精确算法的方法从线性高斯状态空间系统模拟潜在状态的替代方法包括Fruhwirth-Schnatter (Citation1994), Rue (Citation2001)和McCausland等人(Citation2011)等。10在实验中,研究了另外两个数据生成过程,其中σ2与因变量方差的比值分别为0.5和0.8。估计结果在质量上是相似的在具有3.0 GHz Intel Core i5 CPU的标准台式计算机上,在MATLAB R2020b.12中运行,根据GHS和DGHS先验规范生成1000张后验图分别需要大约20秒和26秒ESS由Geyer (Citation1992)的初始单调序列方法计算,并通过除以后验图的数量进行归一化数据来源为美国圣路易斯联邦储备银行FRED数据库。消费者物价指数、3个月国库券利率和失业率的系列名称分别为CPILFESL、TB3MS和UNRATE。季度平均值按每个季度内每月的平均值计算从GIG分布中采样是通过采用由Enes Makalic和Daniel Schimidt编写的Matlab函数gigrnd,该函数实现了Devroye (Citation2014)的算法Gamma分布G(0.5,2)等价于χ2(1)分布通过Enes Makalic和Daniel Schimidt编写的Matlab函数pgdraw对polya-gamma分布进行采样,该函数实现了来自Windle (Citation2013)的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Econometric Reviews
Econometric Reviews 管理科学-数学跨学科应用
CiteScore
1.70
自引率
0.00%
发文量
27
审稿时长
>12 weeks
期刊介绍: Econometric Reviews is widely regarded as one of the top 5 core journals in econometrics. It probes the limits of econometric knowledge, featuring regular, state-of-the-art single blind refereed articles and book reviews. ER has been consistently the leader and innovator in its acclaimed retrospective and critical surveys and interchanges on current or developing topics. Special issues of the journal are developed by a world-renowned editorial board. These bring together leading experts from econometrics and beyond. Reviews of books and software are also within the scope of the journal. Its content is expressly intended to reach beyond econometrics and advanced empirical economics, to statistics and other social sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信