Multi-channel precision-sparsity-adapted inter-frame differential data codec for video neural network processor

Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design Pub Date : 2020-08-10 DOI:10.1145/3370748.3407002

Yixiong Yang, Zhe Yuan, Fang Su, Fanyang Cheng, Zhuqing Yuan, Huazhong Yang, Yongpan Liu

引用次数: 0

Abstract

Activation I/O traffic is a critical bottleneck of video neural network processor. Recent works adopted an inter-frame difference method to reduce activation size. However, current methods can't fully adapt to the various precision and sparsity in differential data. In this paper, we propose the multi-channel precision-sparsity-adapted codec, which will separate the differential activation and encode activation in multiple channels. We analyze the most adapted encoding of each channel, and select the optimal channel number with the best performance. A two-channel codec hardware has been implemented in the ASIC accelerator, which can encode/decode activations in parallel. Experiment results show that our coding achieves 2.2x-18.2x compression rate in three scenarios with no accuracy loss, and the hardware has 42x/174x improvement on speed and energy-efficiency compared with the software codec.

查看原文本刊更多论文

面向视频神经网络处理器的多通道精度稀疏化帧间差分数据编解码器

激活I/O流量是视频神经网络处理器的关键瓶颈。最近的研究采用帧间差分法来减小激活尺寸。然而，现有的方法不能完全适应差分数据的精度和稀疏性。本文提出了一种多通道精确稀疏化编解码器，该编解码器将多通道中的差分激活和编码激活分离开来。我们分析了每个信道最适合的编码方式，并选择了性能最好的最优信道数。在ASIC加速器中实现了双通道编解码器硬件，可以并行编码/解码激活。实验结果表明，我们的编码在三种场景下的压缩率均达到2.2x-18.2x，且没有精度损失，与软件编解码器相比，硬件编解码器的速度和能效提高了42倍/174倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design

自引率

0.00%

发文量