HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation

IF 10.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems Pub Date : 2023-03-25 DOI:10.1109/TNNLS.2023.3273255

Gen Shi;Yifan Zhu;Jian K. Liu;Xuesong Li

{"title":"HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation","authors":"Gen Shi;Yifan Zhu;Jian K. Liu;Xuesong Li","doi":"10.1109/TNNLS.2023.3273255","DOIUrl":null,"url":null,"abstract":"Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"35 10","pages":"13914-13925"},"PeriodicalIF":10.2000,"publicationDate":"2023-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10135109/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.

查看原文本刊更多论文

HeGCL：异构图层表示中的高级自我监督学习。

在具有大量无标记数据的异构图中进行表征学习引起了人们的极大兴趣。图的异质性不仅包含丰富的信息，也为设计无监督或自监督学习（SSL）策略带来了困难。现有的方法（如基于随机漫步的方法）主要依赖于邻居的邻近信息，缺乏将节点特征整合到更高层次表示中的能力。此外，以往的自监督或无监督框架通常是针对节点级任务设计的，通常无法捕捉全局图属性，在图级任务中可能表现不佳。因此，迫切需要一种能更好地捕捉异构图全局属性的无标签框架。在本文中，我们提出了一种基于跨视图对比学习（HeGCL）的自监督异构图神经网络（GNN）。HeGCL 提出了两种异构图编码视图：元路径视图和轮廓视图。与提供语义信息的元路径视图相比，轮廓视图通过使用非本地块来编码复杂的边缘关系和捕捉图层属性。因此，HeGCL 通过最大化分别来自轮廓视图和元路径视图的全局表征和语义表征之间的互信息（MI）来学习节点嵌入。在节点级和图级任务上的实验表明，所提出的模型优于其他方法，而进一步的探索研究也表明，非本地块的引入为图级任务做出了重大贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.