Accelerating Sparse Deep Neural Networks on FPGAs

2019 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2019-09-01 DOI:10.1109/HPEC.2019.8916419

Sitao Huang, Carl Pearson, R. Nagi, Jinjun Xiong, Deming Chen, Wen-mei W. Hwu

引用次数: 17

Abstract

Deep neural networks (DNNs) have been widely adopted in many domains, including computer vision, natural language processing, and medical care. Recent research reveals that sparsity in DNN parameters can be exploited to reduce inference computational complexity and improve network quality. However, sparsity also introduces irregularity and extra complexity in data processing, which make the accelerator design challenging. This work presents the design and implementation of a highly flexible sparse DNN inference accelerator on FPGA. Our proposed inference engine can be easily configured to be used in both mobile computing and high-performance computing scenarios. Evaluation shows our proposed inference engine effectively accelerates sparse DNNs and outperforms CPU solution by up to 4.7 $\times$ in terms of energy efficiency.

查看原文本刊更多论文

fpga上的稀疏深度神经网络加速

深度神经网络已广泛应用于计算机视觉、自然语言处理和医疗保健等领域。最近的研究表明，深度神经网络参数的稀疏性可以用来降低推理计算复杂度和提高网络质量。然而，稀疏性也在数据处理中引入了不规则性和额外的复杂性，这给加速器的设计带来了挑战。本文在FPGA上设计并实现了一种高度灵活的稀疏DNN推理加速器。我们提出的推理引擎可以很容易地配置为用于移动计算和高性能计算场景。评估表明，我们提出的推理引擎有效地加速了稀疏dnn，并且在能源效率方面优于CPU解决方案高达4.7倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量