The effect of cluster location and dataset size on 2-stage k-means algorithm

2011 10th International Workshop on Electronics, Control, Measurement and Signals Pub Date : 2011-06-01 DOI:10.1109/IWECMS.2011.5952377

R. Salman, V. Kecman

引用次数: 2

Abstract

Paper introduces the 2-stage k-means algorithm which is faster than the standard 1-stage k-means algorithm. The main idea of the 2-stages is to move, in the first stage (fast), the centers of the clusters closer to their final locations. This will be done by using a small part of the data to achieve faster calculation. The next stage (slow) stage will start from the centers found during the first stage (fast). Different initial locations of the clusters have been used while testing the algorithms here. With bigger datasets, it is shown that the 2-stage clustering method achieves better speed-up.

查看原文本刊更多论文

聚类位置和数据集大小对两阶段k-means算法的影响

本文介绍了比标准的1阶段k-means算法更快的2阶段k-means算法。这两个阶段的主要思想是在第一阶段(快速)移动，使星团的中心更靠近它们的最终位置。这将通过使用一小部分数据来实现更快的计算。下一阶段(慢)将从第一阶段(快)中发现的中心开始。在这里测试算法时，使用了不同的集群初始位置。对于较大的数据集，两阶段聚类方法获得了更好的提速效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 10th International Workshop on Electronics, Control, Measurement and Signals

自引率

0.00%

发文量