K-means Optimization Algorithm for Solving Clustering Problem

2009 Second International Workshop on Knowledge Discovery and Data Mining Pub Date : 2009-01-23 DOI:10.1109/WKDD.2009.85

Jinxin Dong, Min-yong Qi

引用次数: 24

Abstract

The basic K-means is sensitive to the initial centre and easy to get stuck at local optimal value. To solve such problems, a new clustering algorithm is proposed based on simulated annealing. The algorithm views the clustering as optimization problem, the bisecting K-means splits the dataset into k clusters at first, and then run simulated annealing algorithm using the sum of distances between each pattern and its centre based on bisecting K-means as the aim function. To avoid the shortcomings of simulated annealing such as long computation time and low efficiency, a new data structure named sequence list is given. The experiment result shows the feasibility and validity of the proposed algorithm.

查看原文本刊更多论文

聚类问题的K-means优化算法

基本k均值对初始中心很敏感，容易卡在局部最优值。为了解决这类问题，提出了一种基于模拟退火的聚类算法。该算法将聚类问题视为优化问题，采用k均值平分法首先将数据集分成k个聚类，然后以基于k均值平分法的每个模式与其中心之间的距离之和作为目标函数，运行模拟退火算法。为了避免模拟退火算法计算时间长、效率低的缺点，提出了一种新的数据结构——序列表。实验结果表明了该算法的可行性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 Second International Workshop on Knowledge Discovery and Data Mining

自引率

0.00%

发文量