An Efficient Multi-View Heterogeneous Hypergraph Convolutional Network for Heterogeneous Information Network Representation Learning

IF 7.5 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data Pub Date : 2024-08-13 DOI:10.1109/TBDATA.2024.3442549

Rui Bing;Guan Yuan;Yanmei Zhang;Senzhang Wang;Bohan Li;Yong Zhou

{"title":"An Efficient Multi-View Heterogeneous Hypergraph Convolutional Network for Heterogeneous Information Network Representation Learning","authors":"Rui Bing;Guan Yuan;Yanmei Zhang;Senzhang Wang;Bohan Li;Yong Zhou","doi":"10.1109/TBDATA.2024.3442549","DOIUrl":null,"url":null,"abstract":"Heterogeneous hypergraph neural networks are powerful tools to capture complex correlations among various nodes in Heterogeneous Information Networks (HINs). Despite satisfied performances of them, they are still plagued by the following problems: 1) They cannot capture the correlations in structural and semantic view at once, leading to topological information loss. 2) Due to the number of nodes being greater than the number of node types, node-level self-attention they used causes massive parameters and leads to high time consumption. 3) Interactions in meta-paths may be redundant, resulting in the correlations bias. To address the three issues, we propose an efficient Multi-View Heterogeneous Hypergraph Convolutional Network (MVH <inline-formula><tex-math>$^{2}$</tex-math></inline-formula> GCN). It first constructs relational and semantic hypergraphs based on different types of edges and meta-paths respectively, to represent the complex correlations in structural view and semantic view. Meanwhile, the clean semantic hypergraphs are generated by structure learning network to avoid redundancy. Then, an efficient hypergraph convolutional network is designed to learn node embeddings. By doing so, correlations in the two views are captured. Finally, the learned node embeddings from two views are aggregated via a gated embedding fusion module for downstream tasks. Experiment results demonstrate that MVH <inline-formula><tex-math>$^{2}$</tex-math></inline-formula> GCN is effective and efficient.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"11 3","pages":"1144-1157"},"PeriodicalIF":7.5000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634788/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Heterogeneous hypergraph neural networks are powerful tools to capture complex correlations among various nodes in Heterogeneous Information Networks (HINs). Despite satisfied performances of them, they are still plagued by the following problems: 1) They cannot capture the correlations in structural and semantic view at once, leading to topological information loss. 2) Due to the number of nodes being greater than the number of node types, node-level self-attention they used causes massive parameters and leads to high time consumption. 3) Interactions in meta-paths may be redundant, resulting in the correlations bias. To address the three issues, we propose an efficient Multi-View Heterogeneous Hypergraph Convolutional Network (MVH

$^{2}$

GCN). It first constructs relational and semantic hypergraphs based on different types of edges and meta-paths respectively, to represent the complex correlations in structural view and semantic view. Meanwhile, the clean semantic hypergraphs are generated by structure learning network to avoid redundancy. Then, an efficient hypergraph convolutional network is designed to learn node embeddings. By doing so, correlations in the two views are captured. Finally, the learned node embeddings from two views are aggregated via a gated embedding fusion module for downstream tasks. Experiment results demonstrate that MVH

$^{2}$

GCN is effective and efficient.

查看原文本刊更多论文

面向异构信息网络表示学习的高效多视图异构超图卷积网络

异构超图神经网络是捕获异构信息网络中各节点间复杂关联的有力工具。尽管它们的性能令人满意，但仍然存在以下问题：1)它们不能同时捕获结构和语义视图的相关性，导致拓扑信息丢失。2)由于节点数量大于节点类型数量，使用节点级自关注导致参数量大，耗时高。3)元路径中的交互可能是冗余的，导致相关性偏差。为了解决这三个问题，我们提出了一个高效的多视图异构超图卷积网络（MVH $^{2}$ GCN）。首先分别基于不同类型的边和元路径构建关系超图和语义超图，以表示结构视图和语义视图中的复杂关联。同时，通过结构学习网络生成干净的语义超图，避免冗余。然后，设计了一个高效的超图卷积网络来学习节点嵌入。通过这样做，可以捕获两个视图中的相关性。最后，通过一个门控嵌入融合模块对从两个视图中学习到的节点嵌入进行聚合，用于下游任务。实验结果表明，MVH $^{2}$ GCN算法是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Big Data Multiple-

CiteScore

11.80

自引率

2.80%

发文量

114

期刊介绍： The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.