基于FPGA的卷积神经网络SoC系统设计

2020 International Conference on Communications, Information System and Computer Engineering (CISCE) Pub Date : 2020-07-01 DOI:10.1109/CISCE50729.2020.00098

Weizhen Lin, Lei Zhang

{"title":"基于FPGA的卷积神经网络SoC系统设计","authors":"Weizhen Lin, Lei Zhang","doi":"10.1109/CISCE50729.2020.00098","DOIUrl":null,"url":null,"abstract":"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.","PeriodicalId":101777,"journal":{"name":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Design of convolutional neural network SoC system based on FPGA\",\"authors\":\"Weizhen Lin, Lei Zhang\",\"doi\":\"10.1109/CISCE50729.2020.00098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.\",\"PeriodicalId\":101777,\"journal\":{\"name\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE50729.2020.00098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE50729.2020.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

随着神经网络技术的不断发展，它在数字图像处理中越来越受到重视。本文在可编程逻辑器件(FPGA)上设计了卷积神经网络。利用硬件电路的特点，实现了卷积核内并行数据处理和卷积核间并行处理。双缓冲区用于减少对存储器设备的访问。同时，利用循环块和稀疏矩阵的特性，优化网络结构，提高网络速度，降低功耗。采用ARM处理器对图像进行预处理，并配置相应的寄存器来控制CNN网络的网络层数。测试结果表明，采用基于FPGA的CNN加速器对手写体数字的识别率可达97%。同时，大大提高了功耗和速度，满足便携式移动设备的要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Design of convolutional neural network SoC system based on FPGA

With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Communications, Information System and Computer Engineering (CISCE)

自引率

0.00%

发文量