Siamese network with squeeze-attention for incomplete multi-view multi-label classification

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-05-12 DOI:10.1007/s40747-025-01909-6

Mengqing Wang, Jiarui Chen, Lian Zhao, Yinghao Ye, Xiaohuan Lu

{"title":"Siamese network with squeeze-attention for incomplete multi-view multi-label classification","authors":"Mengqing Wang, Jiarui Chen, Lian Zhao, Yinghao Ye, Xiaohuan Lu","doi":"10.1007/s40747-025-01909-6","DOIUrl":null,"url":null,"abstract":"<p>Multi-view multi-label classification (MvMLC) has garnered significant interest because of its ability to handle complex datasets. However, the inherent complexity of real-world data often results in incomplete views and missing labels, which limit the richness of data and hinder the accurate association of features with their corresponding categories. Additionally, the MvMLC task is intricate due to the need for diverse views to coherently represent the same entity, thus demanding the creation of stable and consistent multi-view representations that can ensure a reliable feature alignment process across heterogeneous perspectives. To address these challenges, we propose a model based on a Siamese network with squeeze attention (SSA) for incomplete multi-view multi-label classification (iMvMLC). Specifically, to capture the shared semantic information across different views, we combine cross-view collaborative synthesis (CCS) and viewwise representation calibration (VRC) mechanisms. CCS enhances the semantic interaction between views by introducing directive blocks and stacked autoencoders on top of the Siamese network, thereby improving the ability to extract shared semantic representations. The VRC mechanism uses contrastive learning with positive and negative sample pairs to refine the shared semantic space, ensuring higher feature consistency and better alignment across views. Furthermore, considering the task-specific importance variation exhibited by each view, we apply the squeeze attention-weighted fusion (SWF) strategy, which performs feature dimensionality reduction to amplify the key characteristics from each view and enables the model to flexibly adjust the influence of each perspective. Extensive evaluations conducted across five datasets demonstrate that the SSA method outperforms many existing approaches.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"13 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01909-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-view multi-label classification (MvMLC) has garnered significant interest because of its ability to handle complex datasets. However, the inherent complexity of real-world data often results in incomplete views and missing labels, which limit the richness of data and hinder the accurate association of features with their corresponding categories. Additionally, the MvMLC task is intricate due to the need for diverse views to coherently represent the same entity, thus demanding the creation of stable and consistent multi-view representations that can ensure a reliable feature alignment process across heterogeneous perspectives. To address these challenges, we propose a model based on a Siamese network with squeeze attention (SSA) for incomplete multi-view multi-label classification (iMvMLC). Specifically, to capture the shared semantic information across different views, we combine cross-view collaborative synthesis (CCS) and viewwise representation calibration (VRC) mechanisms. CCS enhances the semantic interaction between views by introducing directive blocks and stacked autoencoders on top of the Siamese network, thereby improving the ability to extract shared semantic representations. The VRC mechanism uses contrastive learning with positive and negative sample pairs to refine the shared semantic space, ensuring higher feature consistency and better alignment across views. Furthermore, considering the task-specific importance variation exhibited by each view, we apply the squeeze attention-weighted fusion (SWF) strategy, which performs feature dimensionality reduction to amplify the key characteristics from each view and enables the model to flexibly adjust the influence of each perspective. Extensive evaluations conducted across five datasets demonstrate that the SSA method outperforms many existing approaches.

查看原文本刊更多论文

不完全多视图多标签分类的Siamese网络挤压注意

多视图多标签分类（MvMLC）因其处理复杂数据集的能力而引起了人们的极大兴趣。然而，现实世界数据固有的复杂性往往导致不完整的视图和缺失的标签，这限制了数据的丰富性，阻碍了特征与其相应类别的准确关联。此外，MvMLC任务是复杂的，因为需要不同的视图来一致地表示相同的实体，因此要求创建稳定和一致的多视图表示，以确保跨异构透视图的可靠特征对齐过程。为了解决这些挑战，我们提出了一种基于Siamese网络的不完全多视图多标签分类（iMvMLC）模型。具体来说，为了捕获跨不同视图的共享语义信息，我们结合了跨视图协同合成（CCS）和视图表示校准（VRC）机制。CCS通过在Siamese网络上引入指令块和堆叠自编码器来增强视图之间的语义交互，从而提高了提取共享语义表示的能力。VRC机制使用正负样本对的对比学习来细化共享语义空间，确保更高的特征一致性和更好的跨视图对齐。此外，考虑到每个视图所表现出的任务特定重要性变化，我们采用挤压注意加权融合（SWF）策略，该策略通过特征降维来放大每个视图的关键特征，使模型能够灵活调整每个视图的影响。在五个数据集上进行的广泛评估表明，SSA方法优于许多现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.