Modular Neural Networks for Low-Power Image Classification on Embedded Devices

ACM Transactions on Design Automation of Electronic Systems (TODAES) Pub Date : 2020-10-15 DOI:10.1145/3408062

Abhinav Goel, Sara Aghajanzadeh, Caleb Tung, Shuo-Han Chen, G. Thiruvathukal, Yung-Hsiang Lu

{"title":"Modular Neural Networks for Low-Power Image Classification on Embedded Devices","authors":"Abhinav Goel, Sara Aghajanzadeh, Caleb Tung, Shuo-Han Chen, G. Thiruvathukal, Yung-Hsiang Lu","doi":"10.1145/3408062","DOIUrl":null,"url":null,"abstract":"Embedded devices are generally small, battery-powered computers with limited hardware resources. It is difficult to run deep neural networks (DNNs) on these devices, because DNNs perform millions of operations and consume significant amounts of energy. Prior research has shown that a considerable number of a DNN’s memory accesses and computation are redundant when performing tasks like image classification. To reduce this redundancy and thereby reduce the energy consumption of DNNs, we introduce the Modular Neural Network Tree architecture. Instead of using one large DNN for the classifier, this architecture uses multiple smaller DNNs (called modules) to progressively classify images into groups of categories based on a novel visual similarity metric. Once a group of categories is selected by a module, another module then continues to distinguish among the similar categories within the selected group. This process is repeated over multiple modules until we are left with a single category. The computation needed to distinguish dissimilar groups is avoided, thus reducing redundant operations, memory accesses, and energy. Experimental results using several image datasets reveal the effectiveness of our proposed solution to reduce memory requirements by 50% to 99%, inference time by 55% to 95%, energy consumption by 52% to 94%, and the number of operations by 15% to 99% when compared with existing DNN architectures, running on two different embedded systems: Raspberry Pi 3 and Raspberry Pi Zero.","PeriodicalId":6933,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","volume":"4 1","pages":"1 - 35"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3408062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Embedded devices are generally small, battery-powered computers with limited hardware resources. It is difficult to run deep neural networks (DNNs) on these devices, because DNNs perform millions of operations and consume significant amounts of energy. Prior research has shown that a considerable number of a DNN’s memory accesses and computation are redundant when performing tasks like image classification. To reduce this redundancy and thereby reduce the energy consumption of DNNs, we introduce the Modular Neural Network Tree architecture. Instead of using one large DNN for the classifier, this architecture uses multiple smaller DNNs (called modules) to progressively classify images into groups of categories based on a novel visual similarity metric. Once a group of categories is selected by a module, another module then continues to distinguish among the similar categories within the selected group. This process is repeated over multiple modules until we are left with a single category. The computation needed to distinguish dissimilar groups is avoided, thus reducing redundant operations, memory accesses, and energy. Experimental results using several image datasets reveal the effectiveness of our proposed solution to reduce memory requirements by 50% to 99%, inference time by 55% to 95%, energy consumption by 52% to 94%, and the number of operations by 15% to 99% when compared with existing DNN architectures, running on two different embedded systems: Raspberry Pi 3 and Raspberry Pi Zero.

查看原文本刊更多论文

基于模块化神经网络的嵌入式设备低功耗图像分类

嵌入式设备通常是小型的、由电池供电的计算机，硬件资源有限。在这些设备上运行深度神经网络(dnn)是很困难的，因为dnn要执行数百万次操作并消耗大量的能量。先前的研究表明，在执行图像分类等任务时，DNN的相当数量的内存访问和计算是冗余的。为了减少这种冗余，从而降低dnn的能量消耗，我们引入了模块化神经网络树架构。该架构使用多个较小的DNN(称为模块)，而不是使用一个大型DNN作为分类器，根据新的视觉相似性度量将图像逐步分类为类别组。一旦一个模块选择了一组类别，那么另一个模块将继续在所选组中的类似类别之间进行区分。这个过程在多个模块上重复，直到我们只剩下一个类别。避免了区分不同组所需的计算，从而减少了冗余操作、内存访问和能量。使用多个图像数据集的实验结果表明，与现有的DNN架构相比，我们提出的解决方案在两种不同的嵌入式系统(树莓派3和树莓派零)上运行时，内存需求降低了50%到99%，推理时间降低了55%到95%，能耗降低了52%到94%，操作次数减少了15%到99%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Design Automation of Electronic Systems (TODAES)

自引率

0.00%

发文量