异构信息网络学习的自监督节点-超边嵌入

IF 7.5 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data Pub Date : 2023-03-11 DOI:10.1109/TBDATA.2023.3275374

Mengran Li;Yong Zhang;Wei Zhang;Yi Chu;Yongli Hu;Baocai Yin

{"title":"异构信息网络学习的自监督节点-超边嵌入","authors":"Mengran Li;Yong Zhang;Wei Zhang;Yi Chu;Yongli Hu;Baocai Yin","doi":"10.1109/TBDATA.2023.3275374","DOIUrl":null,"url":null,"abstract":"The exploration of self-supervised information mining of heterogeneous datasets has gained significant traction in recent years. Heterogeneous graph neural networks (HGNNs) have emerged as a highly promising method for handling heterogeneous information networks (HINs) due to their superior performance. These networks leverage aggregation functions to convert pairwise relations-based features from raw heterogeneous graphs into embedding vectors. However, real-world HINs contain valuable higher-order relations that are often overlooked but can provide complementary information. To address this issue, we propose a novel method called \n<bold>S\nelf-supervised \n<bold>N\nodes-\n<bold>H\nyperedges \n<bold>E\nmbedding (SNHE), which leverages hypergraph structures to incorporate higher-order information into the embedding process of HINs. Our method decomposes the raw graph structure into snapshots based on various meta-paths, which are then transformed into hypergraphs to aggregate high-order information within the data and generate embedding representations. Given the complexity of HINs, we develop a dual self-supervised structure that maximizes mutual information in the enhanced graph data space, guides the overall model update, and reduces redundancy and noise. We evaluate our proposed method on various real-world datasets for node classification and clustering tasks, and compare it against state-of-the-art methods. The experimental results demonstrate the efficacy of our method. Our code is available at \n<uri>https://github.com/limengran98/SNHE</uri>\n.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 4","pages":"1210-1224"},"PeriodicalIF":7.5000,"publicationDate":"2023-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-Supervised Nodes-Hyperedges Embedding for Heterogeneous Information Network Learning\",\"authors\":\"Mengran Li;Yong Zhang;Wei Zhang;Yi Chu;Yongli Hu;Baocai Yin\",\"doi\":\"10.1109/TBDATA.2023.3275374\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The exploration of self-supervised information mining of heterogeneous datasets has gained significant traction in recent years. Heterogeneous graph neural networks (HGNNs) have emerged as a highly promising method for handling heterogeneous information networks (HINs) due to their superior performance. These networks leverage aggregation functions to convert pairwise relations-based features from raw heterogeneous graphs into embedding vectors. However, real-world HINs contain valuable higher-order relations that are often overlooked but can provide complementary information. To address this issue, we propose a novel method called \\n<bold>S\\nelf-supervised \\n<bold>N\\nodes-\\n<bold>H\\nyperedges \\n<bold>E\\nmbedding (SNHE), which leverages hypergraph structures to incorporate higher-order information into the embedding process of HINs. Our method decomposes the raw graph structure into snapshots based on various meta-paths, which are then transformed into hypergraphs to aggregate high-order information within the data and generate embedding representations. Given the complexity of HINs, we develop a dual self-supervised structure that maximizes mutual information in the enhanced graph data space, guides the overall model update, and reduces redundancy and noise. We evaluate our proposed method on various real-world datasets for node classification and clustering tasks, and compare it against state-of-the-art methods. The experimental results demonstrate the efficacy of our method. Our code is available at \\n<uri>https://github.com/limengran98/SNHE</uri>\\n.\",\"PeriodicalId\":13106,\"journal\":{\"name\":\"IEEE Transactions on Big Data\",\"volume\":\"9 4\",\"pages\":\"1210-1224\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2023-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Big Data\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10123057/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10123057/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

近年来，对异构数据集的自监督信息挖掘的探索得到了极大的关注。异构图神经网络(hgnn)由于其优越的性能而成为处理异构信息网络(HINs)的一种很有前途的方法。这些网络利用聚合函数将基于成对关系的特征从原始异构图转换为嵌入向量。然而，现实世界的HINs包含有价值的高阶关系，这些关系经常被忽视，但可以提供补充信息。为了解决这个问题，我们提出了一种新的方法，称为自监督节点-超边嵌入(SNHE)，它利用超图结构将高阶信息融入到HINs的嵌入过程中。我们的方法将原始图结构分解为基于各种元路径的快照，然后将其转换为超图，以聚合数据中的高阶信息并生成嵌入表示。鉴于HINs的复杂性，我们开发了一种双重自监督结构，该结构在增强的图数据空间中最大化互信息，指导整体模型更新，并减少冗余和噪声。我们在各种真实世界的数据集上评估了我们提出的方法，用于节点分类和聚类任务，并将其与最先进的方法进行比较。实验结果证明了该方法的有效性。我们的代码可在https://github.com/limengran98/SNHE上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Self-Supervised Nodes-Hyperedges Embedding for Heterogeneous Information Network Learning

The exploration of self-supervised information mining of heterogeneous datasets has gained significant traction in recent years. Heterogeneous graph neural networks (HGNNs) have emerged as a highly promising method for handling heterogeneous information networks (HINs) due to their superior performance. These networks leverage aggregation functions to convert pairwise relations-based features from raw heterogeneous graphs into embedding vectors. However, real-world HINs contain valuable higher-order relations that are often overlooked but can provide complementary information. To address this issue, we propose a novel method called S elf-supervised N odes- H yperedges E mbedding (SNHE), which leverages hypergraph structures to incorporate higher-order information into the embedding process of HINs. Our method decomposes the raw graph structure into snapshots based on various meta-paths, which are then transformed into hypergraphs to aggregate high-order information within the data and generate embedding representations. Given the complexity of HINs, we develop a dual self-supervised structure that maximizes mutual information in the enhanced graph data space, guides the overall model update, and reduces redundancy and noise. We evaluate our proposed method on various real-world datasets for node classification and clustering tasks, and compare it against state-of-the-art methods. The experimental results demonstrate the efficacy of our method. Our code is available at https://github.com/limengran98/SNHE .

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Big Data Multiple-

CiteScore

11.80

自引率

2.80%

发文量

114

期刊介绍： The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.