Posterior Means and Precisions of the Coefficients in Linear Models with Highly Collinear Regressors

M. Pesaran, Ron P. Smith
{"title":"Posterior Means and Precisions of the Coefficients in Linear Models with Highly Collinear Regressors","authors":"M. Pesaran, Ron P. Smith","doi":"10.2139/ssrn.3076052","DOIUrl":null,"url":null,"abstract":"When there is exact collinearity between regressors, their individual coefficients are not identified, but given an informative prior their Bayesian posterior means are well defined. The case of high but not exact collinearity is more complicated but similar results follow. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which we represent, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank defficient matrix as T goes to infinity. This paper examines the asymptotic behavior of the posterior mean and precision of the parameters of a linear regression model for both the cases of exactly and highly collinear regressors. We show that in both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the priors for coefficients and precision. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case, which is illustrated with an empirical example.","PeriodicalId":320844,"journal":{"name":"PSN: Econometrics","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PSN: Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3076052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

When there is exact collinearity between regressors, their individual coefficients are not identified, but given an informative prior their Bayesian posterior means are well defined. The case of high but not exact collinearity is more complicated but similar results follow. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which we represent, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank defficient matrix as T goes to infinity. This paper examines the asymptotic behavior of the posterior mean and precision of the parameters of a linear regression model for both the cases of exactly and highly collinear regressors. We show that in both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the priors for coefficients and precision. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case, which is illustrated with an empirical example.
具有高度共线性回归量的线性模型系数的后验均值和精度
当回归量之间存在确切的共线性时,它们的个别系数不被识别,但给定一个信息先验,它们的贝叶斯后验均值被很好地定义。高但不精确共线性的情况更复杂,但结果相似。正如精确共线性导致参数无法识别一样,高共线性可以被视为参数的弱识别,根据弱仪器文献,我们表示,对于有限样本量T,相关矩阵是满秩的,但当T趋于无穷时收敛为秩不足矩阵。本文研究了完全共线和高度共线两种情况下线性回归模型的后验均值的渐近行为和参数的精度。我们表明,在这两种情况下,即使样本量足够大,后验均值仍然对先验均值的选择敏感,并且精度的上升速度低于样本量的增长速度。在高度共线性的情况下,后验均值收敛于正态分布的随机变量,其均值和方差依赖于系数和精度的先验。对于精确共线性或强辨识,分布退化为不动点。分析还提出了一个诊断统计高度共线的情况下,这是一个实证例子说明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信