{"title":"Evaluating the density parameter in density peak based clustering","authors":"Jian Hou, Wei-Xue Liu","doi":"10.1109/ICICIP.2016.7885878","DOIUrl":null,"url":null,"abstract":"The density peak based clustering algorithm is a simple yet effective clustering approach. This algorithm firstly calculates the local density of each data and the distance to the nearest neighbor with higher density. Based on the assumption that cluster centers are density peaks and they are relatively far from each other, this algorithm isolates the candidates of cluster centers from the non-center data. After the cluster centers are identified, the other data are assigned labels equaling to those of their nearest neighbors with higher density. In this way the clustering can be accomplished efficiently and clusters of arbitrary shapes can be obtained. The key of the density peak based clustering algorithm lies in the density calculation method. In this paper we study the influence of the data amount used in density calculation on the clustering results of the density peak based algorithm. As a result, we arrive at some conclusions on the selection of the data amount, which can be useful in applying this algorithm in real tasks.","PeriodicalId":226381,"journal":{"name":"2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2016.7885878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The density peak based clustering algorithm is a simple yet effective clustering approach. This algorithm firstly calculates the local density of each data and the distance to the nearest neighbor with higher density. Based on the assumption that cluster centers are density peaks and they are relatively far from each other, this algorithm isolates the candidates of cluster centers from the non-center data. After the cluster centers are identified, the other data are assigned labels equaling to those of their nearest neighbors with higher density. In this way the clustering can be accomplished efficiently and clusters of arbitrary shapes can be obtained. The key of the density peak based clustering algorithm lies in the density calculation method. In this paper we study the influence of the data amount used in density calculation on the clustering results of the density peak based algorithm. As a result, we arrive at some conclusions on the selection of the data amount, which can be useful in applying this algorithm in real tasks.