MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

EMDL '17 Pub Date : 2017-06-03 DOI:10.1145/3089801.3089804

Qingqing Cao, Niranjan Balasubramanian, A. Balasubramanian

引用次数: 60

Abstract

In this paper, we explore optimizations to run Recurrent Neural Network (RNN) models locally on mobile devices. RNN models are widely used for Natural Language Processing, Machine translation, and other tasks. However, existing mobile applications that use RNN models do so on the cloud. To address privacy and efficiency concerns, we show how RNN models can be run locally on mobile devices. Existing work on porting deep learning models to mobile devices focus on Convolution Neural Networks (CNNs) and cannot be applied directly to RNN models. In response, we present MobiRNN, a mobile-specific optimization framework that implements GPU offloading specifically for mobile GPUs. Evaluations using an RNN model for activity recognition shows that MobiRNN does significantly decrease the latency of running RNN models on phones.

查看原文本刊更多论文

MobiRNN:高效递归神经网络在移动GPU上的执行

在本文中，我们探索了在移动设备上本地运行循环神经网络(RNN)模型的优化。RNN模型广泛用于自然语言处理、机器翻译和其他任务。然而，现有的使用RNN模型的移动应用程序是在云上运行的。为了解决隐私和效率问题，我们展示了如何在移动设备上本地运行RNN模型。将深度学习模型移植到移动设备上的现有工作主要集中在卷积神经网络(cnn)上，不能直接应用于RNN模型。作为回应，我们提出了MobiRNN，一个特定于移动设备的优化框架，它实现了专门针对移动GPU的GPU卸载。使用RNN模型进行活动识别的评估表明，MobiRNN确实显著降低了在手机上运行RNN模型的延迟。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

EMDL '17

自引率

0.00%

发文量