An Effective Approach for Selecting Cluster Centroids for the k-means Algorithm using IABC Approach

2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) Pub Date : 2019-07-01 DOI:10.1109/ICCICC46617.2019.9146077

M. Batchanaboyina, Nagaraju Devarakonda

{"title":"An Effective Approach for Selecting Cluster Centroids for the k-means Algorithm using IABC Approach","authors":"M. Batchanaboyina, Nagaraju Devarakonda","doi":"10.1109/ICCICC46617.2019.9146077","DOIUrl":null,"url":null,"abstract":"K-means is a popular grouping technique for unsupervised data. Though the technique is simple and potential it suffers from the ambiguity in the selection of k (number of clusters). The selection or initialization of k-cluster centroids significantly impacts the performance of the k-means. The distribution of data with the possibility of outliers may degrade the clustering strength. If the initialization of clustering is best fitted then the rest of the k-means process will save the execution time and can offer better clustering. A searching procedure is needed that learn the data under hand and decide the cluster initialization accordingly. ABC is a good technique for optimized search. In this paper, a study of different variants of ABC is undertaken in order to propose a new methodology named IABC. The methodology made use of bee intelligence for search optimization along with k-means and k-nearest neighborhood algorithms. The proposed approach adds the strength to the k-means approach in terms of optimum centroid initialization. Experiments are done on two popular datasets to compare the proposed approach with the existing techniques. The comparisons are made between the existing and proposed methods. Better and encouraging results are found. The proposed process is able to save a significant amount of time and can offer better and accurate clustering.","PeriodicalId":294902,"journal":{"name":"2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)","volume":"421 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICC46617.2019.9146077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

K-means is a popular grouping technique for unsupervised data. Though the technique is simple and potential it suffers from the ambiguity in the selection of k (number of clusters). The selection or initialization of k-cluster centroids significantly impacts the performance of the k-means. The distribution of data with the possibility of outliers may degrade the clustering strength. If the initialization of clustering is best fitted then the rest of the k-means process will save the execution time and can offer better clustering. A searching procedure is needed that learn the data under hand and decide the cluster initialization accordingly. ABC is a good technique for optimized search. In this paper, a study of different variants of ABC is undertaken in order to propose a new methodology named IABC. The methodology made use of bee intelligence for search optimization along with k-means and k-nearest neighborhood algorithms. The proposed approach adds the strength to the k-means approach in terms of optimum centroid initialization. Experiments are done on two popular datasets to compare the proposed approach with the existing techniques. The comparisons are made between the existing and proposed methods. Better and encouraging results are found. The proposed process is able to save a significant amount of time and can offer better and accurate clustering.

查看原文本刊更多论文

基于IABC方法的k-means聚类质心选择方法

K-means是一种流行的无监督数据分组技术。虽然该技术简单而有潜力，但它在k(簇数)的选择上存在模糊性。k-聚类质心的选择或初始化会显著影响k-均值的性能。存在异常值的数据分布可能会降低聚类强度。如果聚类的初始化是最好的拟合，那么剩下的k-means过程将节省执行时间，并可以提供更好的聚类。需要一个搜索过程来学习手头的数据，并据此决定簇的初始化。ABC是一种很好的优化搜索技术。本文对ABC的不同变体进行了研究，并提出了一种名为IABC的新方法。该方法利用蜜蜂智能进行搜索优化以及k-均值和k-最近邻算法。该方法在最佳质心初始化方面增加了k-means方法的强度。在两个流行的数据集上进行了实验，将所提出的方法与现有技术进行了比较。对现有方法和所提方法进行了比较。发现了更好和令人鼓舞的结果。所提出的过程能够节省大量的时间，并且能够提供更好和更准确的聚类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

自引率

0.00%

发文量