{"title":"基于稀疏负载矩阵的贝叶斯因子分析识别新方法","authors":"M. Pape","doi":"10.2139/ssrn.2399368","DOIUrl":null,"url":null,"abstract":"Sparse factor analysis comprises aspects of exploratory and confirmatory factor analysis, seeking to establish a parsimonious structure in the loadings matrix of the model. This task is related to the issue of determining the number of factors required for model representation, the question of which variables are useful and which ones can be excluded from the analysis, and the problem whether some variables are driven by a subset of all factors only. Whereas sparsity analysis focuses mainly on the third of these questions, it can provide helpful hints to tackle the first two questions as well. I use multivariate highest posterior density (HPD) intervals calculated for the posterior densities derived from the weighted orthogonal Procrustes (WOP) ex-post identification approach to find a sparse loadings structure. In a simulation study, this method is used to identify different sparse structures, including those with excess variables, and to determine the number of factors in the model, where all three tasks are well achieved. Eventually, I apply the approach on a data set of intelligence test results to determine the number of factors, the required variables and the sparsity structure, where it yields results not only well-comprehensible, but also very similar to those found in former studies analyzing the data set.","PeriodicalId":384078,"journal":{"name":"ERN: Other Econometrics: Data Collection & Data Estimation Methodology (Topic)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Identification Approach to Bayesian Factor Analysis with Sparse Loadings Matrices\",\"authors\":\"M. Pape\",\"doi\":\"10.2139/ssrn.2399368\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sparse factor analysis comprises aspects of exploratory and confirmatory factor analysis, seeking to establish a parsimonious structure in the loadings matrix of the model. This task is related to the issue of determining the number of factors required for model representation, the question of which variables are useful and which ones can be excluded from the analysis, and the problem whether some variables are driven by a subset of all factors only. Whereas sparsity analysis focuses mainly on the third of these questions, it can provide helpful hints to tackle the first two questions as well. I use multivariate highest posterior density (HPD) intervals calculated for the posterior densities derived from the weighted orthogonal Procrustes (WOP) ex-post identification approach to find a sparse loadings structure. In a simulation study, this method is used to identify different sparse structures, including those with excess variables, and to determine the number of factors in the model, where all three tasks are well achieved. Eventually, I apply the approach on a data set of intelligence test results to determine the number of factors, the required variables and the sparsity structure, where it yields results not only well-comprehensible, but also very similar to those found in former studies analyzing the data set.\",\"PeriodicalId\":384078,\"journal\":{\"name\":\"ERN: Other Econometrics: Data Collection & Data Estimation Methodology (Topic)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-02-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Other Econometrics: Data Collection & Data Estimation Methodology (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.2399368\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Other Econometrics: Data Collection & Data Estimation Methodology (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2399368","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Novel Identification Approach to Bayesian Factor Analysis with Sparse Loadings Matrices
Sparse factor analysis comprises aspects of exploratory and confirmatory factor analysis, seeking to establish a parsimonious structure in the loadings matrix of the model. This task is related to the issue of determining the number of factors required for model representation, the question of which variables are useful and which ones can be excluded from the analysis, and the problem whether some variables are driven by a subset of all factors only. Whereas sparsity analysis focuses mainly on the third of these questions, it can provide helpful hints to tackle the first two questions as well. I use multivariate highest posterior density (HPD) intervals calculated for the posterior densities derived from the weighted orthogonal Procrustes (WOP) ex-post identification approach to find a sparse loadings structure. In a simulation study, this method is used to identify different sparse structures, including those with excess variables, and to determine the number of factors in the model, where all three tasks are well achieved. Eventually, I apply the approach on a data set of intelligence test results to determine the number of factors, the required variables and the sparsity structure, where it yields results not only well-comprehensible, but also very similar to those found in former studies analyzing the data set.