基于剩余数系统的极简卷积神经网络的面积高效FPGA实现

2018 23rd Conference of Open Innovations Association (FRUCT) Pub Date : 2018-11-01 DOI:10.23919/FRUCT.2018.8588106

N. Chervyakov, P. Lyakhov, M. Valueva, G. Valuev, D. Kaplun, G. Efimenko, D. V. Gnezdilov

{"title":"基于剩余数系统的极简卷积神经网络的面积高效FPGA实现","authors":"N. Chervyakov, P. Lyakhov, M. Valueva, G. Valuev, D. Kaplun, G. Efimenko, D. V. Gnezdilov","doi":"10.23919/FRUCT.2018.8588106","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNN) is the promising tool for solving task of image recognition in computer vision systems. However, the most known implementation of CNNs require a significant amount of memory for storing weights in training and work. To reduce the resource costs of CNN implementation we propose the architecture that separated on hardware and software parts for performance optimization. Also we propose to use Residue Number System (RNS) arithmetic in the hardware part which implements the convolutional layer of CNN. Software simulation using Matlab 2017b shows that CNN with a minimum number of layers can be quickly and successfully trained. Hardware simulation using FPGA Kintex7 xc7k70tfbg484-2 demonstrates that using RNS in convolutional layer of CNN allows to reduce hardware costs by 32% compared with the traditional approach based on the binary number system.","PeriodicalId":183812,"journal":{"name":"2018 23rd Conference of Open Innovations Association (FRUCT)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Area-Efficient FPGA Implementation of Minimalistic Convolutional Neural Network Using Residue Number System\",\"authors\":\"N. Chervyakov, P. Lyakhov, M. Valueva, G. Valuev, D. Kaplun, G. Efimenko, D. V. Gnezdilov\",\"doi\":\"10.23919/FRUCT.2018.8588106\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNN) is the promising tool for solving task of image recognition in computer vision systems. However, the most known implementation of CNNs require a significant amount of memory for storing weights in training and work. To reduce the resource costs of CNN implementation we propose the architecture that separated on hardware and software parts for performance optimization. Also we propose to use Residue Number System (RNS) arithmetic in the hardware part which implements the convolutional layer of CNN. Software simulation using Matlab 2017b shows that CNN with a minimum number of layers can be quickly and successfully trained. Hardware simulation using FPGA Kintex7 xc7k70tfbg484-2 demonstrates that using RNS in convolutional layer of CNN allows to reduce hardware costs by 32% compared with the traditional approach based on the binary number system.\",\"PeriodicalId\":183812,\"journal\":{\"name\":\"2018 23rd Conference of Open Innovations Association (FRUCT)\",\"volume\":\"108 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 23rd Conference of Open Innovations Association (FRUCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/FRUCT.2018.8588106\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 23rd Conference of Open Innovations Association (FRUCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/FRUCT.2018.8588106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

卷积神经网络(CNN)是解决计算机视觉系统中图像识别任务的一个很有前途的工具。然而，大多数已知的cnn实现需要大量的内存来存储训练和工作中的权重。为了降低CNN实现的资源成本，我们提出了硬件和软件分离的架构，以实现性能优化。在实现CNN卷积层的硬件部分，提出了残数系统(RNS)算法。使用Matlab 2017b进行的软件仿真表明，最少层数的CNN可以快速成功训练。基于FPGA Kintex7 xc7k70tfbg484-2的硬件仿真表明，在CNN的卷积层中使用RNS与基于二进制数系统的传统方法相比，硬件成本降低了32%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Area-Efficient FPGA Implementation of Minimalistic Convolutional Neural Network Using Residue Number System

Convolutional Neural Networks (CNN) is the promising tool for solving task of image recognition in computer vision systems. However, the most known implementation of CNNs require a significant amount of memory for storing weights in training and work. To reduce the resource costs of CNN implementation we propose the architecture that separated on hardware and software parts for performance optimization. Also we propose to use Residue Number System (RNS) arithmetic in the hardware part which implements the convolutional layer of CNN. Software simulation using Matlab 2017b shows that CNN with a minimum number of layers can be quickly and successfully trained. Hardware simulation using FPGA Kintex7 xc7k70tfbg484-2 demonstrates that using RNS in convolutional layer of CNN allows to reduce hardware costs by 32% compared with the traditional approach based on the binary number system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 23rd Conference of Open Innovations Association (FRUCT)

自引率

0.00%

发文量