A K-Means Clustering Algorithm Based on Double Attributes of Objects

2015 Seventh International Conference on Measuring Technology and Mechatronics Automation Pub Date : 2015-06-13 DOI:10.1109/ICMTMA.2015.12

Tu Linli, D. Yanni, Chu Siyong

引用次数: 3

Abstract

The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.

查看原文本刊更多论文

基于对象双属性的K-Means聚类算法

K-means聚类算法在数据分析、模式识别、图像处理和市场研究中发挥了重要作用。经典K-means算法随机选择初始聚类中心，使得聚类结果不稳定。本文通过对经典K -means算法的深入研究，提出了一种新的基于对象双属性的K -means聚类算法。该算法基于高密度集生成的不相似度矩阵构造Huffman树，然后根据K值在Huffman树中选择初始聚类中心点，利用该方法有效克服了经典K-means算法进行聚类随机选择导致初始聚类中心结果不稳定的缺陷。本文采用两个UCI数据集对新算法进行验证。实验结果表明，新的k-means算法可以选择质量稳定的初始聚类中心，从而获得较好的聚类结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Seventh International Conference on Measuring Technology and Mechatronics Automation

自引率

0.00%

发文量