[DL] A Survey of FPGA-based Neural Network Inference Accelerators

ACM Transactions on Reconfigurable Technology and Systems (TRETS) Pub Date : 2019-03-28 DOI:10.1145/3289185

Kaiyuan Guo, Shulin Zeng, Jincheng Yu, Yu Wang, Huazhong Yang

{"title":"[DL] A Survey of FPGA-based Neural Network Inference Accelerators","authors":"Kaiyuan Guo, Shulin Zeng, Jincheng Yu, Yu Wang, Huazhong Yang","doi":"10.1145/3289185","DOIUrl":null,"url":null,"abstract":"Recent research on neural networks has shown a significant advantage in machine learning over traditional algorithms based on handcrafted features and models. Neural networks are now widely adopted in regions like image, speech, and video recognition. But the high computation and storage complexity of neural network inference poses great difficulty on its application. It is difficult for CPU platforms to offer enough computation capacity. GPU platforms are the first choice for neural network processes because of its high computation capacity and easy-to-use development frameworks. However, FPGA-based neural network inference accelerator is becoming a research topic. With specifically designed hardware, FPGA is the next possible solution to surpass GPU in speed and energy efficiency. Various FPGA-based accelerator designs have been proposed with software and hardware optimization techniques to achieve high speed and energy efficiency. In this article, we give an overview of previous work on neural network inference accelerators based on FPGA and summarize the main techniques used. An investigation from software to hardware, from circuit level to system level is carried out to complete analysis of FPGA-based neural network inference accelerator design and serves as a guide to future work.","PeriodicalId":162787,"journal":{"name":"ACM Transactions on Reconfigurable Technology and Systems (TRETS)","volume":"929 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"149","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Reconfigurable Technology and Systems (TRETS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3289185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 149

Abstract

Recent research on neural networks has shown a significant advantage in machine learning over traditional algorithms based on handcrafted features and models. Neural networks are now widely adopted in regions like image, speech, and video recognition. But the high computation and storage complexity of neural network inference poses great difficulty on its application. It is difficult for CPU platforms to offer enough computation capacity. GPU platforms are the first choice for neural network processes because of its high computation capacity and easy-to-use development frameworks. However, FPGA-based neural network inference accelerator is becoming a research topic. With specifically designed hardware, FPGA is the next possible solution to surpass GPU in speed and energy efficiency. Various FPGA-based accelerator designs have been proposed with software and hardware optimization techniques to achieve high speed and energy efficiency. In this article, we give an overview of previous work on neural network inference accelerators based on FPGA and summarize the main techniques used. An investigation from software to hardware, from circuit level to system level is carried out to complete analysis of FPGA-based neural network inference accelerator design and serves as a guide to future work.

查看原文本刊更多论文

[DL]基于fpga的神经网络推理加速器综述

最近对神经网络的研究表明，与基于手工制作的特征和模型的传统算法相比，机器学习具有显著的优势。神经网络现在被广泛应用于图像、语音和视频识别等领域。但是神经网络推理的高计算复杂度和存储复杂度给其应用带来了很大的困难。CPU平台很难提供足够的计算能力。GPU平台以其高计算能力和易于使用的开发框架成为神经网络处理的首选平台。然而，基于fpga的神经网络推理加速器正在成为一个研究课题。有了专门设计的硬件，FPGA是下一个可能在速度和能效方面超越GPU的解决方案。各种基于fpga的加速器设计已经提出了软件和硬件优化技术，以实现高速度和高能效。在本文中，我们概述了基于FPGA的神经网络推理加速器的前期工作，并总结了所使用的主要技术。从软件到硬件，从电路级到系统级，对基于fpga的神经网络推理加速器设计进行了全面的分析，对今后的工作具有指导意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Reconfigurable Technology and Systems (TRETS)

自引率

0.00%

发文量