具有样本外扩展的归一化切共聚类

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jingyu Wang , Mingqing Liu , Feiping Nie , Xuelong Li
{"title":"具有样本外扩展的归一化切共聚类","authors":"Jingyu Wang ,&nbsp;Mingqing Liu ,&nbsp;Feiping Nie ,&nbsp;Xuelong Li","doi":"10.1016/j.patcog.2025.111881","DOIUrl":null,"url":null,"abstract":"<div><div>Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (<span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span>) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed <span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span> model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"169 ","pages":"Article 111881"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Normalized cut co-clustering with out-of-sample extension\",\"authors\":\"Jingyu Wang ,&nbsp;Mingqing Liu ,&nbsp;Feiping Nie ,&nbsp;Xuelong Li\",\"doi\":\"10.1016/j.patcog.2025.111881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (<span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span>) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed <span><math><msup><mrow><mi>NC</mi></mrow><mrow><mn>3</mn></mrow></msup></math></span> model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"169 \",\"pages\":\"Article 111881\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325005412\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325005412","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在各种实际应用中,共聚类是一种关键的数据挖掘技术,其中基于锚点的方法揭示了样本和锚点之间的双重关系。由于松弛和后处理导致的信息丢失,经典的基于锚点的方法存在潜在的性能下降。为了克服这一缺点,我们提出了一种归一化切共聚类(NC3)模型,该模型通过交替更新离散标签矩阵来为样本和锚点分配聚类。与传统的基于锚点的共聚类方法不同,该模型直接解决了二部图上原有的离散归一化切问题。针对离散切割问题,提出了一种迭代坐标上升算法,提高了聚类过程的速度。通过对样本和锚点的标记矩阵进行优化,无需进行松弛离散化操作即可得到聚类。此外,本文提出的NC3模型可以解决基于锚点标签的样本外聚类问题。通过大量的实验,我们验证了我们模型的有效性,与最先进的方法相比,取得了具有竞争力的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Normalized cut co-clustering with out-of-sample extension
Co-clustering is a critical data mining technology in various real-world applications, where anchor-based methods reveal the dual relationships between samples and anchors. Due to the information loss caused by relaxation and post-processing, classical anchor-based methods suffer from potential performance degradation. To overcome this disadvantage, we propose a Normalized Cut Co-Clustering (NC3) model, which assigns clusters for samples and anchors by alternatively updating the discrete label matrices. Different from traditional anchor-based co-clustering methods, our model solves the original discrete normalized cut problem on the bipartite graph directly. To address the discrete cut problem, an iterative coordinate ascent algorithm is presented, which can speed up the clustering process. Through optimization on the label matrices of samples and anchors, the clusters can be obtained without relaxation–discretization operation. Furthermore, the proposed NC3 model can tackle the out-of-sample clustering issue based on labels of anchors. Through extensive experiments, we validate the effectiveness of our model, achieving competitive results compared to state-of-the-art approaches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信