Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment

Proceedings of the 2018 on Great Lakes Symposium on VLSI Pub Date : 2018-05-30 DOI:10.1145/3194554.3194565

Amir Erfan Eshratifar, Massoud Pedram

{"title":"Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment","authors":"Amir Erfan Eshratifar, Massoud Pedram","doi":"10.1145/3194554.3194565","DOIUrl":null,"url":null,"abstract":"In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.","PeriodicalId":215940,"journal":{"name":"Proceedings of the 2018 on Great Lakes Symposium on VLSI","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"77","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 on Great Lakes Symposium on VLSI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3194554.3194565","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 77

Abstract

In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.

查看原文本刊更多论文

移动云计算环境下深度神经网络的能量和性能高效计算卸载

在当今的计算技术场景中，移动设备被认为计算能力较弱，而大型云服务器能够处理昂贵的工作负载，因此，密集的计算任务通常被卸载到云上。学习技术的最新进展使深度神经网络(dnn)得到了广泛的应用。基于商业语音的智能个人助理(IPA)，如苹果的Siri，采用深度神经网络作为其识别模型，完全在云上运行。纯云方法可能需要在云和移动设备之间传输大量数据。仅使用移动设备的方法可能缺乏性能效率。此外，由于拥塞和有限的订阅，云服务器有时可能很慢，移动设备可能有电池使用限制。在本文中，我们研究了将dnn中的部分计算卸载到云端的效率。我们为dnn中的前向传播制定了一个最佳的计算卸载框架，该框架适应移动端电池使用限制和云上有限的可用资源。我们的仿真结果表明，我们的框架在移动设备上的执行时间平均可以达到1.42倍，最高可以达到3.07倍的加速。此外，移动能耗平均降低2.11倍，最高可降低4.26倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2018 on Great Lakes Symposium on VLSI

自引率

0.00%

发文量