{"title":"High frequency factor analysis with partially observable factors","authors":"Dachuan Chen , Wenqi Lu , Siyu Xie","doi":"10.1016/j.jeconom.2025.106058","DOIUrl":null,"url":null,"abstract":"<div><div>This paper considers a novel factor structure – <em>Partially Observable Factor Model</em> – where both observable factors and latent factors exist in the model simultaneously. Such factor structure can make sure both interpretability and goodness-of-fit at the same time. Necessary estimation methodologies for this partially observable factor model are developed in this paper for the high frequency data. The proposed estimation methodology is robust to jumps, microstructure noise and asynchronous observation times simultaneously.</div><div>When the observable factors are exogenous, we provide the estimation theory for the integrated eigenvalues of the residual covariance matrix, which including the bias-corrected estimator, central limit theorem and asymptotic variance estimator. As a result, the asymptotic normality of the bias-corrected estimator can be applied to test the existence of the latent factors.</div><div>When the observable factors are endogenous, we propose a novel framework of high frequency unsupervised exogenous component learning (HF-UECL), which can help people quantify the contributions of the observable factors into the latent factors. This is the first work on high frequency instrumental variables, and it can be regard as a necessary and non-trivial extension of the Projected-PCA in the world of continuous-time model. Statistical inferences have been established for the loadings of the observable factors onto the latent factors.</div><div>Monte Carlo simulation demonstrates the validity of our estimation methodologies. Empirical study demonstrates that (i) in the exogenous setting, the latent factors significantly exist in the residual process of the high frequency regression; (ii) in the endogenous setting, the correlations between the observable factors and latent factors do exist significantly.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106058"},"PeriodicalIF":9.9000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Econometrics","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304407625001125","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper considers a novel factor structure – Partially Observable Factor Model – where both observable factors and latent factors exist in the model simultaneously. Such factor structure can make sure both interpretability and goodness-of-fit at the same time. Necessary estimation methodologies for this partially observable factor model are developed in this paper for the high frequency data. The proposed estimation methodology is robust to jumps, microstructure noise and asynchronous observation times simultaneously.
When the observable factors are exogenous, we provide the estimation theory for the integrated eigenvalues of the residual covariance matrix, which including the bias-corrected estimator, central limit theorem and asymptotic variance estimator. As a result, the asymptotic normality of the bias-corrected estimator can be applied to test the existence of the latent factors.
When the observable factors are endogenous, we propose a novel framework of high frequency unsupervised exogenous component learning (HF-UECL), which can help people quantify the contributions of the observable factors into the latent factors. This is the first work on high frequency instrumental variables, and it can be regard as a necessary and non-trivial extension of the Projected-PCA in the world of continuous-time model. Statistical inferences have been established for the loadings of the observable factors onto the latent factors.
Monte Carlo simulation demonstrates the validity of our estimation methodologies. Empirical study demonstrates that (i) in the exogenous setting, the latent factors significantly exist in the residual process of the high frequency regression; (ii) in the endogenous setting, the correlations between the observable factors and latent factors do exist significantly.
期刊介绍:
The Journal of Econometrics serves as an outlet for important, high quality, new research in both theoretical and applied econometrics. The scope of the Journal includes papers dealing with identification, estimation, testing, decision, and prediction issues encountered in economic research. Classical Bayesian statistics, and machine learning methods, are decidedly within the range of the Journal''s interests. The Annals of Econometrics is a supplement to the Journal of Econometrics.