{"title":"Generalized Matrix Factor Model","authors":"Xinbing Kong, Tong Zhang","doi":"arxiv-2409.10001","DOIUrl":null,"url":null,"abstract":"This article introduces a nonlinear generalized matrix factor model (GMFM)\nthat allows for mixed-type variables, extending the scope of linear matrix\nfactor models (LMFM) that are so far limited to handling continuous variables.\nWe introduce a novel augmented Lagrange multiplier method, equivalent to the\nconstraint maximum likelihood estimation, and carefully tailored to be locally\nconcave around the true factor and loading parameters. This statistically\nguarantees the local convexity of the negative Hessian matrix around the true\nparameters of the factors and loadings, which is nontrivial in the matrix\nfactor modeling and leads to feasible central limit theorems of the estimated\nfactors and loadings. We also theoretically establish the convergence rates of\nthe estimated factor and loading matrices for the GMFM under general conditions\nthat allow for correlations across samples, rows, and columns. Moreover, we\nprovide a model selection criterion to determine the numbers of row and column\nfactors consistently. To numerically compute the constraint maximum likelihood\nestimator, we provide two algorithms: two-stage alternating maximization and\nminorization maximization. Extensive simulation studies demonstrate GMFM's\nsuperiority in handling discrete and mixed-type variables. An empirical data\nanalysis of the company's operating performance shows that GMFM does clustering\nand reconstruction well in the presence of discontinuous entries in the data\nmatrix.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article introduces a nonlinear generalized matrix factor model (GMFM)
that allows for mixed-type variables, extending the scope of linear matrix
factor models (LMFM) that are so far limited to handling continuous variables.
We introduce a novel augmented Lagrange multiplier method, equivalent to the
constraint maximum likelihood estimation, and carefully tailored to be locally
concave around the true factor and loading parameters. This statistically
guarantees the local convexity of the negative Hessian matrix around the true
parameters of the factors and loadings, which is nontrivial in the matrix
factor modeling and leads to feasible central limit theorems of the estimated
factors and loadings. We also theoretically establish the convergence rates of
the estimated factor and loading matrices for the GMFM under general conditions
that allow for correlations across samples, rows, and columns. Moreover, we
provide a model selection criterion to determine the numbers of row and column
factors consistently. To numerically compute the constraint maximum likelihood
estimator, we provide two algorithms: two-stage alternating maximization and
minorization maximization. Extensive simulation studies demonstrate GMFM's
superiority in handling discrete and mixed-type variables. An empirical data
analysis of the company's operating performance shows that GMFM does clustering
and reconstruction well in the presence of discontinuous entries in the data
matrix.