Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang
{"title":"具有忠实语义的张量推导大规模多视图子空间聚类技术","authors":"Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang","doi":"10.1109/TSIPN.2024.3414134","DOIUrl":null,"url":null,"abstract":"Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.","PeriodicalId":56268,"journal":{"name":"IEEE Transactions on Signal and Information Processing over Networks","volume":"10 ","pages":"584-598"},"PeriodicalIF":3.0000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tensor-Derived Large-Scale Multi-View Subspace Clustering With Faithful Semantics\",\"authors\":\"Sujia Huang;Shide Du;Lele Fu;Zhihao Wu;Shiping Wang\",\"doi\":\"10.1109/TSIPN.2024.3414134\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.\",\"PeriodicalId\":56268,\"journal\":{\"name\":\"IEEE Transactions on Signal and Information Processing over Networks\",\"volume\":\"10 \",\"pages\":\"584-598\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Signal and Information Processing over Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10556808/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal and Information Processing over Networks","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10556808/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Tensor-Derived Large-Scale Multi-View Subspace Clustering With Faithful Semantics
Multi-view subspace clustering is extensively investigated for its ability to extract essential information from multiple data. However, tensor-based methods often encounter several limitations: 1) They suffer from high computational complexity due to the construction of a global affinity matrix; 2) The sophisticated semantic information among samples remains under-explored. To address these issues, we propose a comprehensive framework called tensor-derived large-scale multi-view subspace clustering with faithful semantics, which replaces the original graph with a trustworthy anchor graph. In particular, a graph-optimization-based anchor selection strategy is designed to obtain salient points, and thus the anchor graph is computed to decrease the computational complexity of constructing the representation matrix. Subsequently, a refinement approach is designed to flexibly extract essential semantics between nodes by dividing the graph into significant components and undesired connections. These matrices preserving important information are fused into a tensor that is constrained by a nuclear norm to retain its low-rank property. Meanwhile, the undesired links should be eliminated to avoid confusing the clustering results. Finally, the spectral embedding is employed to directly guide the learning of anchors and graphs. The proposed model achieves a remarkable improvement of 3.3% and 13.1% of ACC on the NoisyMNIST and Prokaryotic datasets while reducing high computational complexity compared to other subspace-based clustering approaches.
期刊介绍:
The IEEE Transactions on Signal and Information Processing over Networks publishes high-quality papers that extend the classical notions of processing of signals defined over vector spaces (e.g. time and space) to processing of signals and information (data) defined over networks, potentially dynamically varying. In signal processing over networks, the topology of the network may define structural relationships in the data, or may constrain processing of the data. Topics include distributed algorithms for filtering, detection, estimation, adaptation and learning, model selection, data fusion, and diffusion or evolution of information over such networks, and applications of distributed signal processing.