Temporal SIR-GN: Efficient and Effective Structural Representation Learning for Temporal Graphs

Proc. VLDB Endow. Pub Date : 2023-05-01 DOI:10.14778/3598581.3598583

Janet Layne, Justin Carpenter, Edoardo Serra, Francesco Gullo

{"title":"Temporal SIR-GN: Efficient and Effective Structural Representation Learning for Temporal Graphs","authors":"Janet Layne, Justin Carpenter, Edoardo Serra, Francesco Gullo","doi":"10.14778/3598581.3598583","DOIUrl":null,"url":null,"abstract":"Node representation learning (NRL) generates numerical vectors (embeddings) for the nodes of a graph. Structural NRL specifically assigns similar node embeddings for those nodes that exhibit similar structural roles. This is in contrast with its proximity-based counterpart, wherein similarity between embeddings reflects spatial proximity among nodes. Structural NRL is useful for tasks such as node classification where nodes of the same class share structural roles, though there may exist a distant, or no path between them.\n Athough structural NRL has been well-studied in static graphs, it has received limited attention in the temporal setting. Here, the embeddings are required to represent the evolution of nodes' structural roles over time. The existing methods are limited in terms of efficiency and effectiveness: they scale poorly to even moderate number of timestamps, or capture structural role only tangentially.\n \n In this work, we present a novel unsupervised approach to structural representation learning for temporal graphs that overcomes these limitations. For each node, our approach clusters then aggregates the embedding of a node's neighbors for each timestamp, followed by a further temporal aggregation of all timestamps. This is repeated for (at most)\n d\n iterations, so as to acquire information from the\n d\n -hop neighborhood of a node. Our approach takes linear time in the number of overall temporal edges, and possesses important theoretical properties that formally demonstrate its effectiveness.\n \n Extensive experiments on synthetic and real datasets show superior performance in node classification and regression tasks, and superior scalability of our approach to large graphs.","PeriodicalId":20467,"journal":{"name":"Proc. VLDB Endow.","volume":"34 1","pages":"2075-2089"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proc. VLDB Endow.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14778/3598581.3598583","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Node representation learning (NRL) generates numerical vectors (embeddings) for the nodes of a graph. Structural NRL specifically assigns similar node embeddings for those nodes that exhibit similar structural roles. This is in contrast with its proximity-based counterpart, wherein similarity between embeddings reflects spatial proximity among nodes. Structural NRL is useful for tasks such as node classification where nodes of the same class share structural roles, though there may exist a distant, or no path between them. Athough structural NRL has been well-studied in static graphs, it has received limited attention in the temporal setting. Here, the embeddings are required to represent the evolution of nodes' structural roles over time. The existing methods are limited in terms of efficiency and effectiveness: they scale poorly to even moderate number of timestamps, or capture structural role only tangentially. In this work, we present a novel unsupervised approach to structural representation learning for temporal graphs that overcomes these limitations. For each node, our approach clusters then aggregates the embedding of a node's neighbors for each timestamp, followed by a further temporal aggregation of all timestamps. This is repeated for (at most) d iterations, so as to acquire information from the d -hop neighborhood of a node. Our approach takes linear time in the number of overall temporal edges, and possesses important theoretical properties that formally demonstrate its effectiveness. Extensive experiments on synthetic and real datasets show superior performance in node classification and regression tasks, and superior scalability of our approach to large graphs.

查看原文本刊更多论文

时间图SIR-GN:时间图的高效结构表示学习

节点表示学习(NRL)为图的节点生成数值向量(嵌入)。结构NRL特别为那些表现出相似结构角色的节点分配相似的节点嵌入。这与基于接近度的对应方法形成对比，其中嵌入之间的相似性反映了节点之间的空间接近性。结构化NRL对于节点分类这样的任务很有用，其中相同类的节点共享结构角色，尽管它们之间可能存在距离，或者没有路径。尽管在静态图中已经对结构NRL进行了很好的研究，但在时间设置中却受到了有限的关注。在这里，需要嵌入来表示节点的结构角色随时间的演变。现有的方法在效率和有效性方面是有限的:它们很难扩展到甚至中等数量的时间戳，或者只是切切地捕捉结构作用。在这项工作中，我们提出了一种新的无监督方法来克服这些限制的时间图结构表示学习。对于每个节点，我们的方法集群然后为每个时间戳聚合节点邻居的嵌入，然后对所有时间戳进行进一步的时间聚合。这将重复(最多)d次迭代，以便从节点的d跳邻域获取信息。我们的方法在整体时间边的数量上需要线性时间，并且具有正式证明其有效性的重要理论性质。在合成数据集和真实数据集上进行的大量实验表明，我们的方法在节点分类和回归任务上具有优越的性能，并且在处理大型图时具有优越的可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proc. VLDB Endow.

自引率

0.00%

发文量