{"title":"Robust factorization for high-dimensional matrix-variate observations","authors":"Yalin Wang , Long Yu","doi":"10.1016/j.jmva.2025.105467","DOIUrl":null,"url":null,"abstract":"<div><div>Large-dimensional matrix-variate observations have been ubiquitous in the big data era, while unsupervised low-rank approximation technique would help reveal their hidden patterns and structures. In this paper, we study a hierarchical Canonical Polyadic (CP) product matrix factor model under the elliptical framework, which essentially assumes that the matrix-variate observations are from a matrix elliptical distribution. The proposed model not only incorporates the row-wise and column-wise interrelated information, but also adapts to the tail properties of the matrix-variate observations. We resort to the matrix Kendall’s tau introduced in the recent literature to recover the loading spaces, and minimize the square loss function to estimate the factor scores. We also propose an eigenvalue-ratio method to estimate the pair of factor numbers. Thorough theories for the model estimation, including statistical consistency and rates of convergence, are established under regular conditions. It is worth highlighting that the proposed method exhibits superior performance compared to other methods for estimating the signal part, particularly in the heavy-tailed cases. This superiority has been thoroughly validated through extensive simulations. The effectiveness in matrix reconstruction of the proposed method is demonstrated by applying it to a macroeconomic dataset of China.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"210 ","pages":"Article 105467"},"PeriodicalIF":1.4000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multivariate Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0047259X25000624","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Large-dimensional matrix-variate observations have been ubiquitous in the big data era, while unsupervised low-rank approximation technique would help reveal their hidden patterns and structures. In this paper, we study a hierarchical Canonical Polyadic (CP) product matrix factor model under the elliptical framework, which essentially assumes that the matrix-variate observations are from a matrix elliptical distribution. The proposed model not only incorporates the row-wise and column-wise interrelated information, but also adapts to the tail properties of the matrix-variate observations. We resort to the matrix Kendall’s tau introduced in the recent literature to recover the loading spaces, and minimize the square loss function to estimate the factor scores. We also propose an eigenvalue-ratio method to estimate the pair of factor numbers. Thorough theories for the model estimation, including statistical consistency and rates of convergence, are established under regular conditions. It is worth highlighting that the proposed method exhibits superior performance compared to other methods for estimating the signal part, particularly in the heavy-tailed cases. This superiority has been thoroughly validated through extensive simulations. The effectiveness in matrix reconstruction of the proposed method is demonstrated by applying it to a macroeconomic dataset of China.
期刊介绍:
Founded in 1971, the Journal of Multivariate Analysis (JMVA) is the central venue for the publication of new, relevant methodology and particularly innovative applications pertaining to the analysis and interpretation of multidimensional data.
The journal welcomes contributions to all aspects of multivariate data analysis and modeling, including cluster analysis, discriminant analysis, factor analysis, and multidimensional continuous or discrete distribution theory. Topics of current interest include, but are not limited to, inferential aspects of
Copula modeling
Functional data analysis
Graphical modeling
High-dimensional data analysis
Image analysis
Multivariate extreme-value theory
Sparse modeling
Spatial statistics.