Linear projection fused graph-based semi-supervised learning on multi-view data

IF 13.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2025-07-12 DOI:10.1007/s10462-025-11313-8

Jingjun Bi, Fadi Dornaika, Jinan Charafeddine

{"title":"Linear projection fused graph-based semi-supervised learning on multi-view data","authors":"Jingjun Bi, Fadi Dornaika, Jinan Charafeddine","doi":"10.1007/s10462-025-11313-8","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, the surge in data-driven applications across various domains has spurred heightened interest in semi-supervised learning applied to graphs. This surge is attributed to the ubiquitous presence of graph data structures in real-world contexts, such as social networks’ interpersonal relationships, recommender systems’ user behavior graphs, and bioinformatics’ molecular interaction networks. However, for certain data types like images, not only is there a dearth of explicit graph structure, but also the existence of multiple view description methods complicates matters further. The intricacies of multi-view data pose challenges in directly applying traditional semi-supervised learning techniques to graphs. Consequently, researchers have begun exploring the fusion of semi-supervised learning with deep learning to leverage its wealth of information and enhance model efficacy. Effectively amalgamating graph structures with multi-view data remains a challenging problem necessitating further research. This paper introduces the Linear projection Fused Graph-based Semi-supervised Classification (LFGSC) method tailored for multi-view data, building upon the Graph Convolutional Network (GCN) architecture. Firstly, for each view, we leverage a semi-supervised approach that provides the concurrent estimation of the corresponding graph and the flexible linear data representations in a low-dimensional feature space. Subsequently, an adaptive and unified graph is generated, followed by the utilization of a fully connected network to fuse the projected features further and reduce dimensionality. Finally, the fused features and graph are inputted into a GCN to conduct semi-supervised classification. During training, the model incorporates cross-entropy loss, manifold regularization loss, graph auto-encoder loss, and supervised contrastive loss. Leveraging linear transformation significantly diminishes the input feature dimensions for GCN, thereby achieving high accuracy while substantially reducing computational overhead. Furthermore, experimental results conducted on various bench-marked multi-view image datasets demonstrate the superiority of LFGSC over existing semi-supervised learning methods for multi-view scenarios. (Source code: https://github.com/BiJingjun/LFGSC.)</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 10","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11313-8.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11313-8","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, the surge in data-driven applications across various domains has spurred heightened interest in semi-supervised learning applied to graphs. This surge is attributed to the ubiquitous presence of graph data structures in real-world contexts, such as social networks’ interpersonal relationships, recommender systems’ user behavior graphs, and bioinformatics’ molecular interaction networks. However, for certain data types like images, not only is there a dearth of explicit graph structure, but also the existence of multiple view description methods complicates matters further. The intricacies of multi-view data pose challenges in directly applying traditional semi-supervised learning techniques to graphs. Consequently, researchers have begun exploring the fusion of semi-supervised learning with deep learning to leverage its wealth of information and enhance model efficacy. Effectively amalgamating graph structures with multi-view data remains a challenging problem necessitating further research. This paper introduces the Linear projection Fused Graph-based Semi-supervised Classification (LFGSC) method tailored for multi-view data, building upon the Graph Convolutional Network (GCN) architecture. Firstly, for each view, we leverage a semi-supervised approach that provides the concurrent estimation of the corresponding graph and the flexible linear data representations in a low-dimensional feature space. Subsequently, an adaptive and unified graph is generated, followed by the utilization of a fully connected network to fuse the projected features further and reduce dimensionality. Finally, the fused features and graph are inputted into a GCN to conduct semi-supervised classification. During training, the model incorporates cross-entropy loss, manifold regularization loss, graph auto-encoder loss, and supervised contrastive loss. Leveraging linear transformation significantly diminishes the input feature dimensions for GCN, thereby achieving high accuracy while substantially reducing computational overhead. Furthermore, experimental results conducted on various bench-marked multi-view image datasets demonstrate the superiority of LFGSC over existing semi-supervised learning methods for multi-view scenarios. (Source code: https://github.com/BiJingjun/LFGSC.)

查看原文本刊更多论文

基于多视图数据的线性投影融合图半监督学习

近年来，数据驱动应用在各个领域的激增激发了人们对半监督学习应用于图的兴趣。这种激增归因于图数据结构在现实世界中无处不在的存在，例如社交网络的人际关系、推荐系统的用户行为图和生物信息学的分子相互作用网络。然而，对于某些数据类型，如图像，不仅缺乏明确的图结构，而且存在多种视图描述方法，使问题进一步复杂化。多视图数据的复杂性给将传统的半监督学习技术直接应用于图提出了挑战。因此，研究人员已经开始探索半监督学习与深度学习的融合，以利用其丰富的信息并提高模型的有效性。图结构与多视图数据的有效融合仍然是一个具有挑战性的问题，需要进一步研究。本文在图卷积网络（GCN）的基础上，提出了一种针对多视图数据的线性投影融合图半监督分类（LFGSC）方法。首先，对于每个视图，我们利用半监督方法，在低维特征空间中提供相应图的并发估计和灵活的线性数据表示。然后，生成自适应的统一图，然后利用全连通网络进一步融合投影特征并降低维数。最后，将融合后的特征和图输入到GCN中进行半监督分类。在训练过程中，该模型结合了交叉熵损失、流形正则化损失、图自编码器损失和监督对比损失。利用线性变换可以显著减少GCN的输入特征维度，从而在大大减少计算开销的同时实现高精度。此外，在各种基准多视图图像数据集上进行的实验结果表明，LFGSC在多视图场景下优于现有的半监督学习方法。（源代码：https://github.com/BiJingjun/LFGSC。）

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.