{"title":"优化 K-means 算法:蚁群优化","authors":"T. Reddy, K. P. Supreethi","doi":"10.1109/ICCMC.2017.8282522","DOIUrl":null,"url":null,"abstract":"Significance of a versatile and simple clustering algorithm is becoming indispensable with the huge data growth in recent years. K-Means clustering is one such clustering algorithm which is simple yet elegant. But K-Means Algorithm has its disadvantages, dependence on the initial cluster centers and the algorithm tends to converge at a local minima. To overcome these disadvantages, ant colony optimization is applied to improve the traditional K-Means clustering algorithm. Two methods of using ants in K-Means are presented in the paper. In the first method the ant is allowed to go for a random walk and picks a data item. Pick and Drop probabilities of that particular data item are calculated. These values determine whether a data item remains in the same cluster or is moved to another cluster. In the second method instead of letting the ant pick up a data item randomly we calculate the pick and drop and let the ant walk to the data item which has the highest probability to be moved to another cluster. Entropy and F-measure are considered as quality measures.","PeriodicalId":163288,"journal":{"name":"2017 International Conference on Computing Methodologies and Communication (ICCMC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Optimization of K-means algorithm: Ant colony optimization\",\"authors\":\"T. Reddy, K. P. Supreethi\",\"doi\":\"10.1109/ICCMC.2017.8282522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Significance of a versatile and simple clustering algorithm is becoming indispensable with the huge data growth in recent years. K-Means clustering is one such clustering algorithm which is simple yet elegant. But K-Means Algorithm has its disadvantages, dependence on the initial cluster centers and the algorithm tends to converge at a local minima. To overcome these disadvantages, ant colony optimization is applied to improve the traditional K-Means clustering algorithm. Two methods of using ants in K-Means are presented in the paper. In the first method the ant is allowed to go for a random walk and picks a data item. Pick and Drop probabilities of that particular data item are calculated. These values determine whether a data item remains in the same cluster or is moved to another cluster. In the second method instead of letting the ant pick up a data item randomly we calculate the pick and drop and let the ant walk to the data item which has the highest probability to be moved to another cluster. Entropy and F-measure are considered as quality measures.\",\"PeriodicalId\":163288,\"journal\":{\"name\":\"2017 International Conference on Computing Methodologies and Communication (ICCMC)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computing Methodologies and Communication (ICCMC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCMC.2017.8282522\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC.2017.8282522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
摘要
近年来,随着大量数据的增长,一种通用而简单的聚类算法变得不可或缺。K-Means 聚类算法就是这样一种简单而优雅的聚类算法。但是,K-Means 算法也有其缺点,那就是对初始聚类中心的依赖性,而且该算法往往会收敛于局部最小值。为了克服这些缺点,蚁群优化被用来改进传统的 K-Means 聚类算法。本文介绍了在 K-Means 中使用蚂蚁的两种方法。第一种方法允许蚂蚁随机行走,并选取一个数据项。计算该特定数据项的选取和删除概率。这些值决定了数据项是留在同一聚类中还是被移动到另一个聚类中。在第二种方法中,我们不是让蚂蚁随机拾取一个数据项,而是计算拾取和丢弃概率,然后让蚂蚁走到被移动到另一个群集的概率最高的数据项处。熵(Entropy)和 F 值(F-measure)被视为质量度量。
Optimization of K-means algorithm: Ant colony optimization
Significance of a versatile and simple clustering algorithm is becoming indispensable with the huge data growth in recent years. K-Means clustering is one such clustering algorithm which is simple yet elegant. But K-Means Algorithm has its disadvantages, dependence on the initial cluster centers and the algorithm tends to converge at a local minima. To overcome these disadvantages, ant colony optimization is applied to improve the traditional K-Means clustering algorithm. Two methods of using ants in K-Means are presented in the paper. In the first method the ant is allowed to go for a random walk and picks a data item. Pick and Drop probabilities of that particular data item are calculated. These values determine whether a data item remains in the same cluster or is moved to another cluster. In the second method instead of letting the ant pick up a data item randomly we calculate the pick and drop and let the ant walk to the data item which has the highest probability to be moved to another cluster. Entropy and F-measure are considered as quality measures.