具有样本外扩展的归一化切共聚类

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-06-04 DOI:10.1016/j.patcog.2025.111881

Jingyu Wang , Mingqing Liu , Feiping Nie , Xuelong Li

{"title":"具有样本外扩展的归一化切共聚类","authors":"Jingyu Wang , Mingqing Liu , Feiping Nie , Xuelong Li","doi":"10.1016/j.patcog.2025.111881","DOIUrl":null,"url":null,"abstract":"<div><div>Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (<span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span>) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed <span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span> model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"169 ","pages":"Article 111881"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Normalized cut co-clustering with out-of-sample extension\",\"authors\":\"Jingyu Wang , Mingqing Liu , Feiping Nie , Xuelong Li\",\"doi\":\"10.1016/j.patcog.2025.111881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (<span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span>) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed <span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span> model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"169 \",\"pages\":\"Article 111881\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325005412\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325005412","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

在各种实际应用中，共聚类是一种关键的数据挖掘技术，其中基于锚点的方法揭示了样本和锚点之间的双重关系。由于松弛和后处理导致的信息丢失，经典的基于锚点的方法存在潜在的性能下降。为了克服这一缺点，我们提出了一种归一化切共聚类（NC3）模型，该模型通过交替更新离散标签矩阵来为样本和锚点分配聚类。与传统的基于锚点的共聚类方法不同，该模型直接解决了二部图上原有的离散归一化切问题。针对离散切割问题，提出了一种迭代坐标上升算法，提高了聚类过程的速度。通过对样本和锚点的标记矩阵进行优化，无需进行松弛离散化操作即可得到聚类。此外，本文提出的NC3模型可以解决基于锚点标签的样本外聚类问题。通过大量的实验，我们验证了我们模型的有效性，与最先进的方法相比，取得了具有竞争力的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Normalized cut co-clustering with out-of-sample extension

Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (

{NC}^{3}

) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed

{NC}^{3}

model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.