{"title":"高度共线性回归模型的贝叶斯分析","authors":"M. Pesaran, Ron P. Smith","doi":"10.2139/ssrn.3264842","DOIUrl":null,"url":null,"abstract":"Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.","PeriodicalId":150248,"journal":{"name":"USC Dornsife Institute for New Economic Thinking Research Paper Series","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"A Bayesian Analysis of Linear Regression Models with Highly Collinear Regressors\",\"authors\":\"M. Pesaran, Ron P. Smith\",\"doi\":\"10.2139/ssrn.3264842\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.\",\"PeriodicalId\":150248,\"journal\":{\"name\":\"USC Dornsife Institute for New Economic Thinking Research Paper Series\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"USC Dornsife Institute for New Economic Thinking Research Paper Series\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3264842\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"USC Dornsife Institute for New Economic Thinking Research Paper Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3264842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Bayesian Analysis of Linear Regression Models with Highly Collinear Regressors
Exact collinearity between regressors makes their individual coefficients not identified. But, given an informative prior, their Bayesian posterior means are well defined. Just as exact collinearity causes non-identification of the parameters, high collinearity can be viewed as weak identification of the parameters, which is represented, in line with the weak instrument literature, by the correlation matrix being of full rank for a finite sample size T, but converging to a rank deficient matrix as T goes to infinity. The asymptotic behaviour of the posterior mean and precision of the parameters of a linear regression model are examined in the cases of exactly and highly collinear regressors. In both cases the posterior mean remains sensitive to the choice of prior means even if the sample size is sufficiently large, and that the precision rises at a slower rate than the sample size. In the highly collinear case, the posterior means converge to normally distributed random variables whose mean and variance depend on the prior means and prior precisions. The distribution degenerates to fixed points for either exact collinearity or strong identification. The analysis also suggests a diagnostic statistic for the highly collinear case. Monte Carlo simulations and an empirical example are used to illustrate the main findings.