Layrub: layer-centric GPU memory reuse and data migration in extreme-scale deep learning systems

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI:10.1145/3178487.3178528

Bo Liu, Wenbin Jiang, Hai Jin, Xuanhua Shi, Yang Ma

引用次数: 2

Abstract

Growing accuracy and robustness of Deep Neural Networks (DNN) models are accompanied by growing model capacity (going deeper or wider). However, high memory requirements of those models make it difficult to execute the training process in one GPU. To address it, we first identify the memory usage characteristics for deep and wide convolutional networks, and demonstrate the opportunities of memory reuse on both intra-layer and inter-layer levels. We then present Layrub, a runtime data placement strategy that orchestrates the execution of training process. It achieves layer-centric reuse to reduce memory consumption for extreme-scale deep learning that cannot be run on one single GPU.

查看原文本刊更多论文

Layrub:极端规模深度学习系统中以层为中心的GPU内存重用和数据迁移

深度神经网络(DNN)模型精度和鲁棒性的提高伴随着模型容量的增长(深度或广度)。然而，这些模型的高内存要求使得在一个GPU上执行训练过程变得困难。为了解决这个问题，我们首先确定了深度和宽卷积网络的内存使用特征，并展示了在层内和层间级别上内存重用的机会。然后，我们介绍Layrub，这是一个运行时数据放置策略，可以编排训练过程的执行。它实现了以层为中心的重用，以减少无法在单个GPU上运行的极端规模深度学习的内存消耗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

自引率

0.00%

发文量