Efficient Compression Technique for NoC-based Deep Neural Network Accelerators

2020 23rd Euromicro Conference on Digital System Design (DSD) Pub Date : 2020-08-01 DOI:10.1109/DSD51259.2020.00037

J. Lorandel, Habiba Lahdhiri, E. Bourdel, Salvatore Monteleone, M. Palesi

引用次数: 4

Abstract

Deep Neural Networks (DNNs) are very powerful neural networks, widely used in many applications. On the other hand, such networks are computation and memory intensive, which makes their implementation difficult onto hardwareconstrained systems, that could use network-on-chip as interconnect infrastructure. A way to reduce the traffic generated among memory and the processing elements is to compress the information before their exchange inside the network. In particular, our work focuses on reducing the huge number of DNN parameters, i.e., weights. In this paper, we propose a flexible and low-complexity compression technique which preserves the DNN performance, allowing to reduce the memory footprint and the volume of data to be exchanged while necessitating few hardware resources. The technique is evaluated on several DNN models, achieving a compression rate close to 80% without significant loss in accuracy on AlexNet, ResNet, or LeNet-5.

查看原文本刊更多论文

基于noc的深度神经网络加速器的高效压缩技术

深度神经网络是一种非常强大的神经网络，广泛应用于许多领域。另一方面，这样的网络是计算和内存密集型的，这使得它们很难在硬件受限的系统上实现，这些系统可以使用片上网络作为互连基础设施。减少内存和处理元素之间产生的流量的一种方法是在信息在网络内交换之前压缩它们。特别是，我们的工作重点是减少大量的深度神经网络参数，即权重。在本文中，我们提出了一种灵活和低复杂度的压缩技术，它保留了深度神经网络的性能，允许减少内存占用和交换的数据量，同时需要很少的硬件资源。该技术在多个深度神经网络模型上进行了评估，在AlexNet、ResNet或LeNet-5上实现了接近80%的压缩率，而精度没有明显损失。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 23rd Euromicro Conference on Digital System Design (DSD)

自引率

0.00%

发文量