Transparent GPU memory management for DNNs

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI:10.1145/3178487.3178531

Jungho Park, Hyungmin Cho, Wookeun Jung, Jaejin Lee

引用次数: 2

Abstract

Modern DNN frameworks exploit GPU acceleration by default to achieve high performance. The limitation of GPU memory capacity becomes a serious problem because DNNs are becoming deeper and larger. This paper proposes a purely software-based transparent solution, called tvDNN, to the GPU memory capacity problem. It is based on GPU memory swapping and memory object sectioning techniques. It also provides an efficient memory-object swapping schedule based on ILP (optimal) and heuristics (suboptimal). The experimental results show that tvDNN enables Caffe to build VGG-16 with a large batch size, such as 256 or 512, using a few GB of GPU memory without significant performance degradation.

查看原文本刊更多论文

透明GPU内存管理的dnn

现代深度神经网络框架默认使用GPU加速来实现高性能。随着深度神经网络变得越来越大，GPU内存容量的限制成为一个严重的问题。本文提出了一种纯粹基于软件的透明解决方案，称为tvDNN，以解决GPU内存容量问题。它是基于GPU内存交换和内存对象分割技术。它还提供了基于ILP(最优)和启发式(次优)的高效内存对象交换计划。实验结果表明，tvDNN使Caffe能够使用几GB的GPU内存构建批量大小较大的VGG-16，例如256或512，而不会出现明显的性能下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

自引率

0.00%

发文量