FPGA-based Acceleration of Deep Neural Networks Using High Level Method

2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC) Pub Date : 2015-11-04 DOI:10.1109/3PGCIC.2015.103

Lei Liu, Jianlu Luo, Xiaoyan Deng, Sikun Li

引用次数: 4

Abstract

Deep neural network (DNN) is becoming more and more applied in data center applications such as speech recognition, image search, etc. However, the training in DNN is very time-consuming because of its deep structure. This paper presents FPGA-based acceleration of deep neural networks using a high level method and proposes a parallel optimizing strategy using the Kintex-7 FPGA board's features. Experimental results show that it can increase the utilization of FPGA computation units with low mini-batch size and reduce the transfer cost effectively. The optimized algorithm achieves up to 17.65x higher performance than CPU.

查看原文本刊更多论文

基于fpga的深度神经网络高级加速方法

深度神经网络(DNN)在语音识别、图像搜索等数据中心应用中得到越来越多的应用。然而，由于深度神经网络的深层结构，其训练非常耗时。本文提出了一种基于FPGA的深度神经网络的高级加速方法，并利用Kintex-7 FPGA板的特点提出了一种并行优化策略。实验结果表明，该方法可以有效地提高FPGA小批量计算单元的利用率，降低传输成本。优化后的算法性能比CPU高17.65倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

自引率

0.00%

发文量