Liping Sun;Fan Huang;Xiaoyao Zheng;Liangmin Guo;Qingying Yu;Zhenghua Chen;Yonglong Luo
{"title":"Density Peaks Clustering Based on Label Propagation and K-Mutual-Nearest Neighbors","authors":"Liping Sun;Fan Huang;Xiaoyao Zheng;Liangmin Guo;Qingying Yu;Zhenghua Chen;Yonglong Luo","doi":"10.1109/TETCI.2024.3452687","DOIUrl":null,"url":null,"abstract":"The density peaks clustering algorithm is one of the density-based clustering algorithms. This algorithm has several advantages, including not requiring a preset number of clusters, requiring fewer parameters, and being able to achieve clustering of any shape. However, it also has limitations, such as poor clustering performance on datasets with uneven density, the need to manually select cluster centers on the decision graph, and a chain reaction that can lead to a large number of point misallocations due to incorrect allocation of individual points. To overcome the shortcomings of the density peaks clustering algorithm, we propose a density peaks clustering algorithm based on label propagation and k-mutual-nearest neighbors. First, the local density and the distance are defined by incorporating the concept of k-mutual-nearest neighbors to enhance clustering performance on datasets with uneven-density clusters. Second, an adaptive method for selecting cluster centers is proposed to avoid the manual selection of cluster centers. Third, an improved label propagation algorithm is used to assign all remaining points to solve the chain reaction problem. The experimental results show that our algorithm can accurately identify cluster centers and obtain high-quality clustering results on synthetic datasets with different characteristics, including datasets with uneven cluster density, convex datasets, manifold datasets, and datasets with inter-cluster contact. On different types of UCI datasets, including small datasets, high-dimensional datasets, and large datasets, our algorithm outperforms other comparative algorithms.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1830-1842"},"PeriodicalIF":5.3000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10670066/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The density peaks clustering algorithm is one of the density-based clustering algorithms. This algorithm has several advantages, including not requiring a preset number of clusters, requiring fewer parameters, and being able to achieve clustering of any shape. However, it also has limitations, such as poor clustering performance on datasets with uneven density, the need to manually select cluster centers on the decision graph, and a chain reaction that can lead to a large number of point misallocations due to incorrect allocation of individual points. To overcome the shortcomings of the density peaks clustering algorithm, we propose a density peaks clustering algorithm based on label propagation and k-mutual-nearest neighbors. First, the local density and the distance are defined by incorporating the concept of k-mutual-nearest neighbors to enhance clustering performance on datasets with uneven-density clusters. Second, an adaptive method for selecting cluster centers is proposed to avoid the manual selection of cluster centers. Third, an improved label propagation algorithm is used to assign all remaining points to solve the chain reaction problem. The experimental results show that our algorithm can accurately identify cluster centers and obtain high-quality clustering results on synthetic datasets with different characteristics, including datasets with uneven cluster density, convex datasets, manifold datasets, and datasets with inter-cluster contact. On different types of UCI datasets, including small datasets, high-dimensional datasets, and large datasets, our algorithm outperforms other comparative algorithms.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.