基于FPGA的卷积神经网络SoC系统设计

Weizhen Lin, Lei Zhang
{"title":"基于FPGA的卷积神经网络SoC系统设计","authors":"Weizhen Lin, Lei Zhang","doi":"10.1109/CISCE50729.2020.00098","DOIUrl":null,"url":null,"abstract":"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.","PeriodicalId":101777,"journal":{"name":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Design of convolutional neural network SoC system based on FPGA\",\"authors\":\"Weizhen Lin, Lei Zhang\",\"doi\":\"10.1109/CISCE50729.2020.00098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.\",\"PeriodicalId\":101777,\"journal\":{\"name\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE50729.2020.00098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE50729.2020.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

随着神经网络技术的不断发展,它在数字图像处理中越来越受到重视。本文在可编程逻辑器件(FPGA)上设计了卷积神经网络。利用硬件电路的特点,实现了卷积核内并行数据处理和卷积核间并行处理。双缓冲区用于减少对存储器设备的访问。同时,利用循环块和稀疏矩阵的特性,优化网络结构,提高网络速度,降低功耗。采用ARM处理器对图像进行预处理,并配置相应的寄存器来控制CNN网络的网络层数。测试结果表明,采用基于FPGA的CNN加速器对手写体数字的识别率可达97%。同时,大大提高了功耗和速度,满足便携式移动设备的要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Design of convolutional neural network SoC system based on FPGA
With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信