{"title":"Metric Based Performance Analysis of Clustering Algorithms for High Dimensional Data","authors":"Smita Chormunge, S. Jena","doi":"10.1109/CSNT.2015.127","DOIUrl":null,"url":null,"abstract":"Cluster analysis is a main task of exploratory data mining and plays important role in many applications. There are numerous of clustering techniques in data mining works efficiently for low dimensional data and fails to handle high dimensional data. In this paper we evaluated the performance efficiency of K-means and Agglomerative hierarchical clustering methods based on Euclidean and Manhattan distance functions for high dimensional data by varying cluster values. Efficiency concerns the computational time required to build up datasets. Based on experimental work we examined that in both case of distance functions Agglomerative clustering method is efficient in time than K-means clustering algorithm on dataset, which we use for empirical study.","PeriodicalId":334733,"journal":{"name":"2015 Fifth International Conference on Communication Systems and Network Technologies","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fifth International Conference on Communication Systems and Network Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSNT.2015.127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Cluster analysis is a main task of exploratory data mining and plays important role in many applications. There are numerous of clustering techniques in data mining works efficiently for low dimensional data and fails to handle high dimensional data. In this paper we evaluated the performance efficiency of K-means and Agglomerative hierarchical clustering methods based on Euclidean and Manhattan distance functions for high dimensional data by varying cluster values. Efficiency concerns the computational time required to build up datasets. Based on experimental work we examined that in both case of distance functions Agglomerative clustering method is efficient in time than K-means clustering algorithm on dataset, which we use for empirical study.