Zitong Zhang , Xiaojun Chen , Chen Wang , Ruili Wang , Wei Song , Feiping Nie
{"title":"用于集合聚类的结构化双方图学习法","authors":"Zitong Zhang , Xiaojun Chen , Chen Wang , Ruili Wang , Wei Song , Feiping Nie","doi":"10.1016/j.patcog.2024.111133","DOIUrl":null,"url":null,"abstract":"<div><div>Given a set of base clustering results, conventional bipartite graph-based ensemble clustering methods typically require computing a sample-cluster similarity matrix from each base clustering result. These matrices are then either concatenated or averaged to form a bipartite weight matrix, which is used to create a bipartite graph. Graph-based partition techniques are subsequently applied to this graph to obtain the final clustering result. However, these methods often suffer from unreliable base clustering results, making it challenging to identify a clear cluster structure due to the variations in cluster structures across the base results. In this paper, we propose a novel Structured Bipartite Graph Learning (SBGL) method. Our approach begins by computing a sample-cluster similarity matrix from each base clustering result and constructing a base bipartite graph from each of these matrices. We assume these base bipartite graphs contain a set of latent clusters and project them into a set of sample-latent-cluster bipartite graphs. These new graphs are then ensembled into a bipartite graph with a distinct cluster structure, from which the final set of clusters is derived. Our method allows for different numbers of clusters across base clusterings, leading to improved performance. Experimental results on both synthetic and real-world datasets demonstrate the superior performance of our new method.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111133"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Structured Bipartite Graph Learning method for ensemble clustering\",\"authors\":\"Zitong Zhang , Xiaojun Chen , Chen Wang , Ruili Wang , Wei Song , Feiping Nie\",\"doi\":\"10.1016/j.patcog.2024.111133\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Given a set of base clustering results, conventional bipartite graph-based ensemble clustering methods typically require computing a sample-cluster similarity matrix from each base clustering result. These matrices are then either concatenated or averaged to form a bipartite weight matrix, which is used to create a bipartite graph. Graph-based partition techniques are subsequently applied to this graph to obtain the final clustering result. However, these methods often suffer from unreliable base clustering results, making it challenging to identify a clear cluster structure due to the variations in cluster structures across the base results. In this paper, we propose a novel Structured Bipartite Graph Learning (SBGL) method. Our approach begins by computing a sample-cluster similarity matrix from each base clustering result and constructing a base bipartite graph from each of these matrices. We assume these base bipartite graphs contain a set of latent clusters and project them into a set of sample-latent-cluster bipartite graphs. These new graphs are then ensembled into a bipartite graph with a distinct cluster structure, from which the final set of clusters is derived. Our method allows for different numbers of clusters across base clusterings, leading to improved performance. Experimental results on both synthetic and real-world datasets demonstrate the superior performance of our new method.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"160 \",\"pages\":\"Article 111133\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320324008847\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320324008847","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A Structured Bipartite Graph Learning method for ensemble clustering
Given a set of base clustering results, conventional bipartite graph-based ensemble clustering methods typically require computing a sample-cluster similarity matrix from each base clustering result. These matrices are then either concatenated or averaged to form a bipartite weight matrix, which is used to create a bipartite graph. Graph-based partition techniques are subsequently applied to this graph to obtain the final clustering result. However, these methods often suffer from unreliable base clustering results, making it challenging to identify a clear cluster structure due to the variations in cluster structures across the base results. In this paper, we propose a novel Structured Bipartite Graph Learning (SBGL) method. Our approach begins by computing a sample-cluster similarity matrix from each base clustering result and constructing a base bipartite graph from each of these matrices. We assume these base bipartite graphs contain a set of latent clusters and project them into a set of sample-latent-cluster bipartite graphs. These new graphs are then ensembled into a bipartite graph with a distinct cluster structure, from which the final set of clusters is derived. Our method allows for different numbers of clusters across base clusterings, leading to improved performance. Experimental results on both synthetic and real-world datasets demonstrate the superior performance of our new method.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.