A K-Means Clustering Algorithm Based on Double Attributes of Objects

Tu Linli, D. Yanni, Chu Siyong
{"title":"A K-Means Clustering Algorithm Based on Double Attributes of Objects","authors":"Tu Linli, D. Yanni, Chu Siyong","doi":"10.1109/ICMTMA.2015.12","DOIUrl":null,"url":null,"abstract":"The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.","PeriodicalId":196962,"journal":{"name":"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Seventh International Conference on Measuring Technology and Mechatronics Automation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMTMA.2015.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The K-means clustering algorithm have played an important role in the data analysis, pattern recognition, image processing, and market research. Classical K-means algorithm randomly selected initial cluster centers, so that the clustering results unstable. In this paper, through deeply study on classical k-means algorithm, we proposed a new K - means algorithm of Clustering based on double attributes of objects. The algorithm is based on the dissimilarity degree matrix which generated by high density set to construct the Huffman tree, and then according to K value to select initial cluster centers points in the Huffman tree, using this method effectively overcomes the defects of classical K-means algorithm for clustering random selection caused the initial cluster centers result unstable defects. In this paper, the new algorithm uses two UCI data sets to validate. The results of experiment show that the new k-means algorithm can choose the initial cluster center of high quality stable, so as to get better clustering results.
基于对象双属性的K-Means聚类算法
K-means聚类算法在数据分析、模式识别、图像处理和市场研究中发挥了重要作用。经典K-means算法随机选择初始聚类中心,使得聚类结果不稳定。本文通过对经典K -means算法的深入研究,提出了一种新的基于对象双属性的K -means聚类算法。该算法基于高密度集生成的不相似度矩阵构造Huffman树,然后根据K值在Huffman树中选择初始聚类中心点,利用该方法有效克服了经典K-means算法进行聚类随机选择导致初始聚类中心结果不稳定的缺陷。本文采用两个UCI数据集对新算法进行验证。实验结果表明,新的k-means算法可以选择质量稳定的初始聚类中心,从而获得较好的聚类结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信