Improved Subspace K-Means Performance via a Randomized Matrix Decomposition

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI:10.1109/GlobalSIP45357.2019.8969298

Trevor C. Vannoy, Jacob J. Senecal, Veronika Strnadová-Neeley

引用次数: 0

Abstract

Subspace clustering algorithms provide the capability to project a dataset onto bases that facilitate clustering. Proposed in 2017, the subspace k-means algorithm simultaneously performs clustering and dimensionality reduction with the goal of finding the optimal subspace for the cluster structure; this is accomplished by incorporating a trade-off between cluster and noise subspaces in the objective function. In this study, we improve subspace k-means by estimating a critical transformation matrix via a randomized eigenvalue decomposition. Our modification results in an order of magnitude runtime improvement on high dimensional data, while retaining the simplicity, interpretable subspace projections, and convergence guarantees of the original algorithm.

查看原文本刊更多论文

基于随机矩阵分解的改进子空间k -均值性能

子空间聚类算法提供了将数据集投影到便于聚类的基上的能力。2017年提出的子空间k-means算法同时进行聚类和降维，目标是为聚类结构找到最优子空间;这是通过在目标函数中结合聚类和噪声子空间之间的权衡来实现的。在本研究中，我们通过随机特征值分解估计临界变换矩阵来改进子空间k-means。我们的改进在高维数据上的运行时间提高了一个数量级，同时保留了原始算法的简单性、可解释的子空间投影和收敛性保证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

自引率

0.00%

发文量