{"title":"Modified K-means algorithm for automatic stimation of number of clusters using advanced visual assessment of cluster tendency","authors":"D. Sharmilarani, N. Kousika, G Komarasamy","doi":"10.1109/ISCO.2014.7103951","DOIUrl":null,"url":null,"abstract":"One of the major problems in cluster analysis is the determination of the number of clusters in unlabeled data, which is a basic input for most clustering algorithms. In our paper, we investigate a new method for automatically estimating the number of clusters in unlabeled data sets, which is based on an existing algorithm for Spectral Visual Assessment of Cluster Tendency (SpecVAT) of a data set, using several common image and signal processing techniques. Its basic steps include 1) generating a VAT image of an input dissimilarity matrix, 2) Constructing Laplacian matrix 3) Normalize the rows and 4) Apply SpecVAT. Our new method is nearly “automatic,” depending on just one easy-to-set parameter. In this paper we propose direct visual validation method and divergence matrix for finding the automatic clustering. The experimental result shows that the proposed algorithm is much better than the other algorithms.","PeriodicalId":119329,"journal":{"name":"2014 IEEE 8th International Conference on Intelligent Systems and Control (ISCO)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 8th International Conference on Intelligent Systems and Control (ISCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCO.2014.7103951","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
One of the major problems in cluster analysis is the determination of the number of clusters in unlabeled data, which is a basic input for most clustering algorithms. In our paper, we investigate a new method for automatically estimating the number of clusters in unlabeled data sets, which is based on an existing algorithm for Spectral Visual Assessment of Cluster Tendency (SpecVAT) of a data set, using several common image and signal processing techniques. Its basic steps include 1) generating a VAT image of an input dissimilarity matrix, 2) Constructing Laplacian matrix 3) Normalize the rows and 4) Apply SpecVAT. Our new method is nearly “automatic,” depending on just one easy-to-set parameter. In this paper we propose direct visual validation method and divergence matrix for finding the automatic clustering. The experimental result shows that the proposed algorithm is much better than the other algorithms.