Scalable Graph Neural Network Training

Q3 Computer Science

Operating Systems Review (ACM) Pub Date : 2021-05-05 DOI:10.1145/3469379.3469387

M. Serafini, Hui Guan

引用次数: 19

Abstract

Graph Neural Networks (GNNs) are a new and increasingly popular family of deep neural network architectures to perform learning on graphs. Training them efficiently is challenging due to the irregular nature of graph data. The problem becomes even more challenging when scaling to large graphs that exceed the capacity of single devices. Standard approaches to distributed DNN training, like data and model parallelism, do not directly apply to GNNs. Instead, two different approaches have emerged in the literature: whole-graph and sample-based training. In this paper, we review and compare the two approaches. Scalability is challenging with both approaches, but we make a case that research should focus on sample-based training since it is a more promising approach. Finally, we review recent systems supporting sample-based training.

查看原文本刊更多论文

可伸缩图神经网络训练

图神经网络（GNN）是一种新的、越来越流行的深度神经网络架构家族，用于对图进行学习。由于图形数据的不规则性，有效地训练它们是具有挑战性的。当扩展到超过单个设备容量的大型图形时，这个问题变得更加具有挑战性。分布式DNN训练的标准方法，如数据和模型并行性，并不直接适用于GNN。相反，文献中出现了两种不同的方法：全图和基于样本的训练。在本文中，我们回顾并比较了这两种方法。这两种方法的可扩展性都很有挑战性，但我们认为研究应该集中在基于样本的训练上，因为这是一种更有前景的方法。最后，我们回顾了最近支持基于样本的训练的系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Operating Systems Review (ACM) Computer Science-Computer Networks and Communications

CiteScore

2.80

自引率

0.00%

发文量

期刊介绍： Operating Systems Review (OSR) is a publication of the ACM Special Interest Group on Operating Systems (SIGOPS), whose scope of interest includes: computer operating systems and architecture for multiprogramming, multiprocessing, and time sharing; resource management; evaluation and simulation; reliability, integrity, and security of data; communications among computing processors; and computer system modeling and analysis.