Spectral Averagely-dense Clustering Based on Dynamic Shared Nearest Neighbors

2020 5th International Conference on Computational Intelligence and Applications (ICCIA) Pub Date : 2020-06-01 DOI:10.1109/ICCIA49625.2020.00034

C. Yuan, L. Zhang

{"title":"Spectral Averagely-dense Clustering Based on Dynamic Shared Nearest Neighbors","authors":"C. Yuan, L. Zhang","doi":"10.1109/ICCIA49625.2020.00034","DOIUrl":null,"url":null,"abstract":"Spectral averagely-dense clustering is a clustering algorithm based on density, but it has the problem of being sensitive to the parameter ε. Aiming at the above problems, a spectral averagely-dense clustering based on dynamic shared nearest neighbors is put forward. Firstly, a similarity measures is constructed by combining self-tunning distance and shared nearest neighbors. Self-tunning distance can handle clusters of different density, and shared nearest neighbors can draw closer to the data in the same cluster and alienate the data in different clusters. Secondly, based on the sample distribution function, a method capable of self-adaptively determining the k-value of the shared nearest neighbors is proposed without setting the parameter k. Finally, the constructed similarity measure is used as the similarity measure of the fully connected graph. The ε-neighberhood graph of spectral averagely-dense clustering is replaced with the fully connected graph, which avoid setting the parameter ε. Through the experiments on artificial datasets and UCI datasets, the proposed algorithm is compared with the spectral averagelydense clustering and the standard spectral clustering. The experimental results show that the proposed algorithm not only avoids the problem of difficult selection of ε-neighberhood graph parameters, but also has better performance on the datasets.","PeriodicalId":237536,"journal":{"name":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIA49625.2020.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Spectral averagely-dense clustering is a clustering algorithm based on density, but it has the problem of being sensitive to the parameter ε. Aiming at the above problems, a spectral averagely-dense clustering based on dynamic shared nearest neighbors is put forward. Firstly, a similarity measures is constructed by combining self-tunning distance and shared nearest neighbors. Self-tunning distance can handle clusters of different density, and shared nearest neighbors can draw closer to the data in the same cluster and alienate the data in different clusters. Secondly, based on the sample distribution function, a method capable of self-adaptively determining the k-value of the shared nearest neighbors is proposed without setting the parameter k. Finally, the constructed similarity measure is used as the similarity measure of the fully connected graph. The ε-neighberhood graph of spectral averagely-dense clustering is replaced with the fully connected graph, which avoid setting the parameter ε. Through the experiments on artificial datasets and UCI datasets, the proposed algorithm is compared with the spectral averagelydense clustering and the standard spectral clustering. The experimental results show that the proposed algorithm not only avoids the problem of difficult selection of ε-neighberhood graph parameters, but also has better performance on the datasets.

查看原文本刊更多论文

基于动态共享近邻的谱平均密集聚类

谱平均密集聚类是一种基于密度的聚类算法，但存在对参数ε敏感的问题。针对上述问题，提出了一种基于动态共享近邻的谱平均密集聚类方法。首先，结合自调谐距离和共享近邻构造相似度度量;距离自调优可以处理不同密度的聚类，共享近邻可以拉近同一聚类中的数据，疏远不同聚类中的数据。其次，基于样本分布函数，在不设置参数k的情况下，提出了一种自适应确定共享近邻k值的方法。最后，将构造的相似测度作为全连通图的相似测度。将谱平均密集聚类的ε-邻域图替换为完全连通图，避免了参数ε的设置。通过在人工数据集和UCI数据集上的实验，将该算法与光谱平均密集聚类和标准光谱聚类进行了比较。实验结果表明，该算法不仅避免了ε-邻域图参数选择困难的问题，而且在数据集上具有较好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 5th International Conference on Computational Intelligence and Applications (ICCIA)

自引率

0.00%

发文量