具有忠实语义的张量推导大规模多视图子空间聚类技术

IF 3 3区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang
{"title":"具有忠实语义的张量推导大规模多视图子空间聚类技术","authors":"Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang","doi":"10.1109/TSIPN.2024.3414134","DOIUrl":null,"url":null,"abstract":"Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.","PeriodicalId":56268,"journal":{"name":"IEEE Transactions on Signal and Information Processing over Networks","volume":"10 ","pages":"584-598"},"PeriodicalIF":3.0000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tensor-Derived Large-Scale Multi-View Subspace Clustering With Faithful Semantics\",\"authors\":\"Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang\",\"doi\":\"10.1109/TSIPN.2024.3414134\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.\",\"PeriodicalId\":56268,\"journal\":{\"name\":\"IEEE Transactions on Signal and Information Processing over Networks\",\"volume\":\"10 \",\"pages\":\"584-598\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Signal and Information Processing over Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10556808/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal and Information Processing over Networks","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10556808/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

多视角子空间聚类因其能够从多个数据中提取重要信息而受到广泛研究。然而,基于张量的方法往往会遇到一些限制:1) 由于需要构建全局亲和矩阵,因此计算复杂度较高;2) 样本间复杂的语义信息仍未得到充分挖掘。为了解决这些问题,我们提出了一个名为 "具有忠实语义的张量衍生大规模多视角子空间聚类 "的综合框架,用可信的锚图取代原始图。具体来说,我们设计了一种基于图优化的锚选择策略来获取突出点,从而计算出锚图以降低构建表示矩阵的计算复杂度。随后,设计了一种细化方法,通过将图划分为重要组成部分和不需要的连接,灵活提取节点之间的重要语义。这些保留重要信息的矩阵被融合成一个张量,该张量受核规范约束,以保持其低秩属性。同时,为了避免聚类结果混淆,应剔除不需要的连接。最后,利用谱嵌入直接指导锚点和图的学习。与其他基于子空间的聚类方法相比,所提出的模型在 NoisyMNIST 和 Prokaryotic 数据集上的 ACC 分别提高了 3.3% 和 13.1%,同时降低了较高的计算复杂度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Tensor-Derived Large-Scale Multi-View Subspace Clustering With Faithful Semantics
Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Signal and Information Processing over Networks
IEEE Transactions on Signal and Information Processing over Networks Computer Science-Computer Networks and Communications
CiteScore
5.80
自引率
12.50%
发文量
56
期刊介绍: The IEEE Transactions on Signal and Information Processing over Networks publishes high-quality papers that extend the classical notions of processing of signals defined over vector spaces (e.g. time and space) to processing of signals and information (data) defined over networks, potentially dynamically varying. In signal processing over networks, the topology of the network may define structural relationships in the data, or may constrain processing of the data. Topics include distributed algorithms for filtering, detection, estimation, adaptation and learning, model selection, data fusion, and diffusion or evolution of information over such networks, and applications of distributed signal processing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信