Trevor C. Vannoy, Jacob J. Senecal, Veronika Strnadová-Neeley
{"title":"Improved Subspace K-Means Performance via a Randomized Matrix Decomposition","authors":"Trevor C. Vannoy, Jacob J. Senecal, Veronika Strnadová-Neeley","doi":"10.1109/GlobalSIP45357.2019.8969298","DOIUrl":null,"url":null,"abstract":"Subspace clustering algorithms provide the capability to project a dataset onto bases that facilitate clustering. Proposed in 2017, the subspace k-means algorithm simultaneously performs clustering and dimensionality reduction with the goal of finding the optimal subspace for the cluster structure; this is accomplished by incorporating a trade-off between cluster and noise subspaces in the objective function. In this study, we improve subspace k-means by estimating a critical transformation matrix via a randomized eigenvalue decomposition. Our modification results in an order of magnitude runtime improvement on high dimensional data, while retaining the simplicity, interpretable subspace projections, and convergence guarantees of the original algorithm.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP45357.2019.8969298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Subspace clustering algorithms provide the capability to project a dataset onto bases that facilitate clustering. Proposed in 2017, the subspace k-means algorithm simultaneously performs clustering and dimensionality reduction with the goal of finding the optimal subspace for the cluster structure; this is accomplished by incorporating a trade-off between cluster and noise subspaces in the objective function. In this study, we improve subspace k-means by estimating a critical transformation matrix via a randomized eigenvalue decomposition. Our modification results in an order of magnitude runtime improvement on high dimensional data, while retaining the simplicity, interpretable subspace projections, and convergence guarantees of the original algorithm.