高度共线性回归模型的贝叶斯分析

USC Dornsife Institute for New Economic Thinking Research Paper Series Pub Date : 2018-10-06 DOI:10.2139/ssrn.3264842

M. Pesaran, Ron P. Smith

{"title":"高度共线性回归模型的贝叶斯分析","authors":"M. Pesaran, Ron P. Smith","doi":"10.2139/ssrn.3264842","DOIUrl":null,"url":null,"abstract":"Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.","PeriodicalId":150248,"journal":{"name":"USC Dornsife Institute for New Economic Thinking Research Paper Series","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"A Bayesian Analysis of Linear Regression Models with Highly Collinear Regressors\",\"authors\":\"M. Pesaran, Ron P. Smith\",\"doi\":\"10.2139/ssrn.3264842\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.\",\"PeriodicalId\":150248,\"journal\":{\"name\":\"USC Dornsife Institute for New Economic Thinking Research Paper Series\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"USC Dornsife Institute for New Economic Thinking Research Paper Series\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3264842\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"USC Dornsife Institute for New Economic Thinking Research Paper Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3264842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

回归量之间的确切共线性使得它们的个别系数无法确定。但是，给定一个信息先验，它们的贝叶斯后验均值被很好地定义。正如精确共线性导致参数无法识别一样，高共线性可以被视为参数的弱识别，根据弱仪器文献，在有限样本量T下，相关矩阵为满秩，但在T趋于无穷时收敛为秩不足矩阵。在完全共线性和高度共线性回归的情况下，检验了线性回归模型的后验均值的渐近行为和参数的精度。在这两种情况下，即使样本量足够大，后验均值仍然对先验均值的选择敏感，而且精度的上升速度低于样本量的增长速度。在高度共线性的情况下，后验均值收敛于正态分布的随机变量，其均值和方差取决于先验均值和先验精度。对于精确共线性或强辨识，分布退化为不动点。分析还提出了高度共线病例的诊断统计。蒙特卡罗模拟和一个经验例子被用来说明主要发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Bayesian Analysis of Linear Regression Models with Highly Collinear Regressors

Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

USC Dornsife Institute for New Economic Thinking Research Paper Series

自引率

0.00%

发文量