Adaptive deep learning model selection on embedded systems

Proceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems Pub Date : 2018-06-19 DOI:10.1145/3211332.3211336

Ben Taylor, Vicent Sanz Marco, W. Wolff, Yehia El-khatib, Zheng Wang

{"title":"Adaptive deep learning model selection on embedded systems","authors":"Ben Taylor, Vicent Sanz Marco, W. Wolff, Yehia El-khatib, Zheng Wang","doi":"10.1145/3211332.3211336","DOIUrl":null,"url":null,"abstract":"The recent ground-breaking advances in deep learning networks (DNNs) make them attractive for embedded systems. However, it can take a long time for DNNs to make an inference on resource-limited embedded devices. Offloading the computation into the cloud is often infeasible due to privacy concerns, high latency, or the lack of connectivity. As such, there is a critical need to find a way to effectively execute the DNN models locally on the devices. This paper presents an adaptive scheme to determine which DNN model to use for a given input, by considering the desired accuracy and inference time. Our approach employs machine learning to develop a predictive model to quickly select a pre-trained DNN to use for a given input and the optimization constraint. We achieve this by first training off-line a predictive model, and then use the learnt model to select a DNN model to use for new, unseen inputs. We apply our approach to the image classification task and evaluate it on a Jetson TX2 embedded deep learning platform using the ImageNet ILSVRC 2012 validation dataset. We consider a range of influential DNN models. Experimental results show that our approach achieves a 7.52% improvement in inference accuracy, and a 1.8x reduction in inference time over the most-capable single DNN model.","PeriodicalId":258348,"journal":{"name":"Proceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"108","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3211332.3211336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 108

Abstract

The recent ground-breaking advances in deep learning networks (DNNs) make them attractive for embedded systems. However, it can take a long time for DNNs to make an inference on resource-limited embedded devices. Offloading the computation into the cloud is often infeasible due to privacy concerns, high latency, or the lack of connectivity. As such, there is a critical need to find a way to effectively execute the DNN models locally on the devices. This paper presents an adaptive scheme to determine which DNN model to use for a given input, by considering the desired accuracy and inference time. Our approach employs machine learning to develop a predictive model to quickly select a pre-trained DNN to use for a given input and the optimization constraint. We achieve this by first training off-line a predictive model, and then use the learnt model to select a DNN model to use for new, unseen inputs. We apply our approach to the image classification task and evaluate it on a Jetson TX2 embedded deep learning platform using the ImageNet ILSVRC 2012 validation dataset. We consider a range of influential DNN models. Experimental results show that our approach achieves a 7.52% improvement in inference accuracy, and a 1.8x reduction in inference time over the most-capable single DNN model.

查看原文本刊更多论文

嵌入式系统的自适应深度学习模型选择

最近深度学习网络(dnn)的突破性进展使其对嵌入式系统具有吸引力。然而，dnn在资源有限的嵌入式设备上进行推理可能需要很长时间。由于隐私问题、高延迟或缺乏连接性，将计算卸载到云中通常是不可行的。因此，迫切需要找到一种在设备上有效执行DNN模型的方法。本文提出了一种自适应方案，通过考虑所需的精度和推理时间来确定对给定输入使用哪种深度神经网络模型。我们的方法采用机器学习来开发一个预测模型，以快速选择一个预训练的DNN来用于给定的输入和优化约束。我们首先通过离线训练一个预测模型来实现这一点，然后使用学习到的模型来选择一个DNN模型，用于新的、看不见的输入。我们将该方法应用于图像分类任务，并在Jetson TX2嵌入式深度学习平台上使用ImageNet ILSVRC 2012验证数据集对其进行评估。我们考虑了一系列有影响力的深度神经网络模型。实验结果表明，与最强大的单一DNN模型相比，我们的方法在推理精度上提高了7.52%，在推理时间上减少了1.8倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems

自引率

0.00%

发文量