Spectral Averagely-dense Clustering Based on Dynamic Shared Nearest Neighbors

C. Yuan, L. Zhang
{"title":"Spectral Averagely-dense Clustering Based on Dynamic Shared Nearest Neighbors","authors":"C. Yuan, L. Zhang","doi":"10.1109/ICCIA49625.2020.00034","DOIUrl":null,"url":null,"abstract":"Spectral averagely-dense clustering is a clustering algorithm based on density, but it has the problem of being sensitive to the parameter ε. Aiming at the above problems, a spectral averagely-dense clustering based on dynamic shared nearest neighbors is put forward. Firstly, a similarity measures is constructed by combining self-tunning distance and shared nearest neighbors. Self-tunning distance can handle clusters of different density, and shared nearest neighbors can draw closer to the data in the same cluster and alienate the data in different clusters. Secondly, based on the sample distribution function, a method capable of self-adaptively determining the k-value of the shared nearest neighbors is proposed without setting the parameter k. Finally, the constructed similarity measure is used as the similarity measure of the fully connected graph. The ε-neighberhood graph of spectral averagely-dense clustering is replaced with the fully connected graph, which avoid setting the parameter ε. Through the experiments on artificial datasets and UCI datasets, the proposed algorithm is compared with the spectral averagelydense clustering and the standard spectral clustering. The experimental results show that the proposed algorithm not only avoids the problem of difficult selection of ε-neighberhood graph parameters, but also has better performance on the datasets.","PeriodicalId":237536,"journal":{"name":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIA49625.2020.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Spectral averagely-dense clustering is a clustering algorithm based on density, but it has the problem of being sensitive to the parameter ε. Aiming at the above problems, a spectral averagely-dense clustering based on dynamic shared nearest neighbors is put forward. Firstly, a similarity measures is constructed by combining self-tunning distance and shared nearest neighbors. Self-tunning distance can handle clusters of different density, and shared nearest neighbors can draw closer to the data in the same cluster and alienate the data in different clusters. Secondly, based on the sample distribution function, a method capable of self-adaptively determining the k-value of the shared nearest neighbors is proposed without setting the parameter k. Finally, the constructed similarity measure is used as the similarity measure of the fully connected graph. The ε-neighberhood graph of spectral averagely-dense clustering is replaced with the fully connected graph, which avoid setting the parameter ε. Through the experiments on artificial datasets and UCI datasets, the proposed algorithm is compared with the spectral averagelydense clustering and the standard spectral clustering. The experimental results show that the proposed algorithm not only avoids the problem of difficult selection of ε-neighberhood graph parameters, but also has better performance on the datasets.
基于动态共享近邻的谱平均密集聚类
谱平均密集聚类是一种基于密度的聚类算法,但存在对参数ε敏感的问题。针对上述问题,提出了一种基于动态共享近邻的谱平均密集聚类方法。首先,结合自调谐距离和共享近邻构造相似度度量;距离自调优可以处理不同密度的聚类,共享近邻可以拉近同一聚类中的数据,疏远不同聚类中的数据。其次,基于样本分布函数,在不设置参数k的情况下,提出了一种自适应确定共享近邻k值的方法。最后,将构造的相似测度作为全连通图的相似测度。将谱平均密集聚类的ε-邻域图替换为完全连通图,避免了参数ε的设置。通过在人工数据集和UCI数据集上的实验,将该算法与光谱平均密集聚类和标准光谱聚类进行了比较。实验结果表明,该算法不仅避免了ε-邻域图参数选择困难的问题,而且在数据集上具有较好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信