基于FPGA的可重构收缩锥体神经元块CNN加速

Hossam O. Ahmed, M. Ghoneima, M. Dessouky
{"title":"基于FPGA的可重构收缩锥体神经元块CNN加速","authors":"Hossam O. Ahmed, M. Ghoneima, M. Dessouky","doi":"10.1109/ICSET51301.2020.9265388","DOIUrl":null,"url":null,"abstract":"This paper presents a Configurable Systolic-based Pyramidal Neuron (CSPN) unit that can be used to accelerate the different Deep Neural Network (DNN) algorithms. The proposed CSPN unit is suggested to be one of the new embedded blocks that could replace the conventionally embedded blocks, such as the DSP blocks, in the silicon fabric architectures of the Field Programmable Gate Array (FPGA) chips in order to enhance their computational performance for accelerating the different DNN-based systems. The design of the proposed CSPN unit is fully optimized for Deep Learning (DL) algorithms, especially for the Convolutional Neural Networks (CNN) using VHSIC Hardware Description Language (VHDL). The proposed CSPN unit consists of four main stages: the Systolic-based Multiplier Array Grid, Parallel Signed Adder unit, RELU Activation Function Unit, and the Fixed-Point Configuration Unit. Each single CSPN unit can achieve a computational throughput up to 6.22 Giga Operation per Second (GOPS) using the high-density Stratix V FPGAs for a 3×3 kernel filter input case.","PeriodicalId":299530,"journal":{"name":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","volume":"372 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reconfigurable Systolic-based Pyramidal Neuron Block for CNN Acceleration on FPGA\",\"authors\":\"Hossam O. Ahmed, M. Ghoneima, M. Dessouky\",\"doi\":\"10.1109/ICSET51301.2020.9265388\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a Configurable Systolic-based Pyramidal Neuron (CSPN) unit that can be used to accelerate the different Deep Neural Network (DNN) algorithms. The proposed CSPN unit is suggested to be one of the new embedded blocks that could replace the conventionally embedded blocks, such as the DSP blocks, in the silicon fabric architectures of the Field Programmable Gate Array (FPGA) chips in order to enhance their computational performance for accelerating the different DNN-based systems. The design of the proposed CSPN unit is fully optimized for Deep Learning (DL) algorithms, especially for the Convolutional Neural Networks (CNN) using VHSIC Hardware Description Language (VHDL). The proposed CSPN unit consists of four main stages: the Systolic-based Multiplier Array Grid, Parallel Signed Adder unit, RELU Activation Function Unit, and the Fixed-Point Configuration Unit. Each single CSPN unit can achieve a computational throughput up to 6.22 Giga Operation per Second (GOPS) using the high-density Stratix V FPGAs for a 3×3 kernel filter input case.\",\"PeriodicalId\":299530,\"journal\":{\"name\":\"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)\",\"volume\":\"372 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSET51301.2020.9265388\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSET51301.2020.9265388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种可配置的基于收缩的锥体神经元(CSPN)单元,可用于加速不同的深度神经网络(DNN)算法。本文提出的CSPN单元是一种新的嵌入式模块,可以取代现场可编程门阵列(FPGA)芯片硅结构架构中的传统嵌入式模块,如DSP模块,以提高其计算性能,加速不同的基于dnn的系统。所提出的CSPN单元的设计针对深度学习(DL)算法进行了充分优化,特别是针对使用VHSIC硬件描述语言(VHDL)的卷积神经网络(CNN)。提出的CSPN单元由四个主要阶段组成:基于收缩的乘法器阵列网格、并行符号加法器单元、RELU激活函数单元和定点配置单元。每个单个CSPN单元可以实现高达6.22千兆运算每秒(GOPS)的计算吞吐量,使用高密度Stratix V fpga用于3×3内核滤波器输入情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reconfigurable Systolic-based Pyramidal Neuron Block for CNN Acceleration on FPGA
This paper presents a Configurable Systolic-based Pyramidal Neuron (CSPN) unit that can be used to accelerate the different Deep Neural Network (DNN) algorithms. The proposed CSPN unit is suggested to be one of the new embedded blocks that could replace the conventionally embedded blocks, such as the DSP blocks, in the silicon fabric architectures of the Field Programmable Gate Array (FPGA) chips in order to enhance their computational performance for accelerating the different DNN-based systems. The design of the proposed CSPN unit is fully optimized for Deep Learning (DL) algorithms, especially for the Convolutional Neural Networks (CNN) using VHSIC Hardware Description Language (VHDL). The proposed CSPN unit consists of four main stages: the Systolic-based Multiplier Array Grid, Parallel Signed Adder unit, RELU Activation Function Unit, and the Fixed-Point Configuration Unit. Each single CSPN unit can achieve a computational throughput up to 6.22 Giga Operation per Second (GOPS) using the high-density Stratix V FPGAs for a 3×3 kernel filter input case.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信