{"title":"基于FPGA的卷积神经网络SoC系统设计","authors":"Weizhen Lin, Lei Zhang","doi":"10.1109/CISCE50729.2020.00098","DOIUrl":null,"url":null,"abstract":"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.","PeriodicalId":101777,"journal":{"name":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Design of convolutional neural network SoC system based on FPGA\",\"authors\":\"Weizhen Lin, Lei Zhang\",\"doi\":\"10.1109/CISCE50729.2020.00098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.\",\"PeriodicalId\":101777,\"journal\":{\"name\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE50729.2020.00098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE50729.2020.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design of convolutional neural network SoC system based on FPGA
With the continuous development of neural network technology, it has been paid more and more attention in digital image processing. In this paper, the convolution neural network is designed on the programmable logic device (FPGA). Using the characteristics of the hardware circuit, the convolution kernel is implemented with the parallel data processing in the core and the parallel processing between the convolution cores. The double buffer is used to reduce the access to memory devices. At the same time, the characteristics of cyclic block and sparse matrix are used to optimize the network structure, improve the network speed and reduce the power consumption. ARM processor is used to preprocess the images and configure the corresponding registers to control the number of network layers of CNN network. The test results show that the recognition rate of handwritten numeral can reach 97% by using CNN accelerator based on FPGA. Meanwhile, the power consumption and speed are significantly improved, which meets the requirements of portable mobile devices.