Approximate Learning and Fault-Tolerant Mapping for Energy-Efficient Neuromorphic Systems

ACM Transactions on Design Automation of Electronic Systems (TODAES) Pub Date : 2020-12-30 DOI:10.1145/3436491

A. Gebregiorgis, M. Tahoori

{"title":"Approximate Learning and Fault-Tolerant Mapping for Energy-Efficient Neuromorphic Systems","authors":"A. Gebregiorgis, M. Tahoori","doi":"10.1145/3436491","DOIUrl":null,"url":null,"abstract":"Brain-inspired deep neural networks such as Convolutional Neural Network (CNN) have shown great potential in solving difficult cognitive problems such as object recognition and classification. However, such architectures have high computational energy demand and sensitivity to variation effects, making them inapplicable for energy-constrained embedded learning platforms. To address this issue, we propose a learning and mapping approach that utilizes approximate computing during early design phases for a layer-wise pruning and fault tolerant weight mapping scheme of reliable and energy-efficient CNNs. In the proposed approach, approximate CNN is prepared first by layer-wise pruning of approximable neurons, which have high error tolerance margins using a two-level approximate learning methodology. Then, the pruned network is retrained to improve its accuracy by fine-tuning the weight values. Finally, a fault-tolerant layer-wise neural weight mapping scheme is adopted to aggressively reduce memory operating voltage when loading the weights of error resilient layers for energy-efficiency. Thus, the combination of approximate learning and fault tolerance aware memory operating voltage downscaling techniques enable us to implement robust and energy-efficient approximate inference engine for CNN applications. Simulation results show that the proposed fault tolerant and approximate learning approach can improve the energy-efficiency of CNN inference engines by more than 50% with less than 5% reduction in classification accuracy. Additionally, more than 26% energy-saving is achieved by using the proposed layer-wise mapping-based cache memory operating voltage down-scaling.","PeriodicalId":6933,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","volume":"19 1","pages":"1 - 23"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3436491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Brain-inspired deep neural networks such as Convolutional Neural Network (CNN) have shown great potential in solving difficult cognitive problems such as object recognition and classification. However, such architectures have high computational energy demand and sensitivity to variation effects, making them inapplicable for energy-constrained embedded learning platforms. To address this issue, we propose a learning and mapping approach that utilizes approximate computing during early design phases for a layer-wise pruning and fault tolerant weight mapping scheme of reliable and energy-efficient CNNs. In the proposed approach, approximate CNN is prepared first by layer-wise pruning of approximable neurons, which have high error tolerance margins using a two-level approximate learning methodology. Then, the pruned network is retrained to improve its accuracy by fine-tuning the weight values. Finally, a fault-tolerant layer-wise neural weight mapping scheme is adopted to aggressively reduce memory operating voltage when loading the weights of error resilient layers for energy-efficiency. Thus, the combination of approximate learning and fault tolerance aware memory operating voltage downscaling techniques enable us to implement robust and energy-efficient approximate inference engine for CNN applications. Simulation results show that the proposed fault tolerant and approximate learning approach can improve the energy-efficiency of CNN inference engines by more than 50% with less than 5% reduction in classification accuracy. Additionally, more than 26% energy-saving is achieved by using the proposed layer-wise mapping-based cache memory operating voltage down-scaling.

查看原文本刊更多论文

节能神经形态系统的近似学习与容错映射

卷积神经网络(CNN)等以大脑为灵感的深度神经网络在解决物体识别和分类等困难的认知问题方面显示出巨大的潜力。然而，这种架构具有较高的计算能量需求和对变化效应的敏感性，因此不适用于能量受限的嵌入式学习平台。为了解决这个问题，我们提出了一种学习和映射方法，该方法在早期设计阶段利用近似计算来实现可靠和节能的cnn的分层修剪和容错权映射方案。在该方法中，近似CNN首先通过使用两级近似学习方法对具有高容错裕度的近似神经元进行分层剪枝来制备。然后，对经过修剪的网络进行再训练，通过微调权值来提高其精度。最后，采用一种容错的分层神经权值映射方案，在加载误差弹性层权值时大幅降低存储器工作电压，提高能量效率。因此，近似学习和容错感知存储器工作电压降阶技术的结合使我们能够为CNN应用实现鲁棒和节能的近似推理引擎。仿真结果表明，本文提出的容错近似学习方法可以使CNN推理机的能量效率提高50%以上，分类精度降低不到5%。此外，通过使用所提出的基于分层映射的高速缓存操作电压降尺，可以实现26%以上的节能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Design Automation of Electronic Systems (TODAES)

自引率

0.00%

发文量