{"title":"A Unified Framework for Principal Subspace Analysis from the Hamiltonian Viewpoint","authors":"K. Yu","doi":"10.11159/icsta22.143","DOIUrl":null,"url":null,"abstract":"Extended Abstract In this work, we develop a general and unified framework for principal component analysis (PCA) applicable to Riemannian, sub-Riemannian and symplectic manifold-valued data and functional data. Almost all existing statistical methods for manifold data rely on the tangent bundle of the manifold, with the purpose of transforming the nonlinear manifold to the linear tangent spaces. However, such methods become invalid when the tangent vectors are constrained to lie in a subspace of the tangent space since the exponential map will no longer be a local diffeomorphism. This scenario, known as the sub-Riemannian geometry, has attracted considerable attention in recent years. We propose to shift the tangent space viewpoint and move towards the dual spaces of the tangent spaces, i.e., the cotangent spaces, and build subspaces based on initial covectors. More generally, motivated by the Arnold-Liouville theorem we propose the anchor-compatible identification for subspaces with first integrals (ACISFI), which constructs a properly nested sequence of subspaces as the fibres of a carefully chosen set of functionally independent functions defined on the cotangent bundle, i.e., the first integrals of the Hamiltonian system, generalising the ideas of obtaining subspaces from linearly independent tangent vectors [1] or from affinely independent points [2]. There are several advantages of the ACISFI over the existing PCA on manifolds. First, the subspaces can be learnt from sample points in a completely data-driven way. We do not impose a particular form for the Hamiltonian, e.g., the Hamiltonian which induces the geodesic flow, but it can be chosen as any smooth function on the manifold, and there is not any a priori restriction on the form of first integrals as well. Second, the submanifolds can be","PeriodicalId":325859,"journal":{"name":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11159/icsta22.143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Extended Abstract In this work, we develop a general and unified framework for principal component analysis (PCA) applicable to Riemannian, sub-Riemannian and symplectic manifold-valued data and functional data. Almost all existing statistical methods for manifold data rely on the tangent bundle of the manifold, with the purpose of transforming the nonlinear manifold to the linear tangent spaces. However, such methods become invalid when the tangent vectors are constrained to lie in a subspace of the tangent space since the exponential map will no longer be a local diffeomorphism. This scenario, known as the sub-Riemannian geometry, has attracted considerable attention in recent years. We propose to shift the tangent space viewpoint and move towards the dual spaces of the tangent spaces, i.e., the cotangent spaces, and build subspaces based on initial covectors. More generally, motivated by the Arnold-Liouville theorem we propose the anchor-compatible identification for subspaces with first integrals (ACISFI), which constructs a properly nested sequence of subspaces as the fibres of a carefully chosen set of functionally independent functions defined on the cotangent bundle, i.e., the first integrals of the Hamiltonian system, generalising the ideas of obtaining subspaces from linearly independent tangent vectors [1] or from affinely independent points [2]. There are several advantages of the ACISFI over the existing PCA on manifolds. First, the subspaces can be learnt from sample points in a completely data-driven way. We do not impose a particular form for the Hamiltonian, e.g., the Hamiltonian which induces the geodesic flow, but it can be chosen as any smooth function on the manifold, and there is not any a priori restriction on the form of first integrals as well. Second, the submanifolds can be