{"title":"CNN for object recognition implementation on FPGA using PYNQ framework","authors":"Meriam Dhouibi, A. K. B. Salem, S. B. Saoud","doi":"10.1109/ComNet47917.2020.9306094","DOIUrl":null,"url":null,"abstract":"Object recognition is one of the most researched and commercialized applications of Deep Learning (DL) where Convolutional Neural Networks (CNNs) are especially accurate. The deployment of these models on embedded systems require low latency and high performance even with limited resources and energy budgets. Embedded systems with Zynq Systems on Chips (SoCs) are attractive platforms for CNNs. In this paper, we use PYNQ framework, that supports a Python-based hardware/software codesign environment to perform CNN inference for object recognition on Xilinx FPGA. We design the CNN model and train it on a CPU platform and we implement it On ZedBoard FPGA. By using only, a single ARM processor core on FPGA, we achieve 100ms latency and up to 10 image recognitions per second on the CIFAR-10 dataset with 79.90% accuracy. This model performance can be highly improved by exploring the hardware resources of the FPGA chip.","PeriodicalId":351664,"journal":{"name":"2020 IEEE Eighth International Conference on Communications and Networking (ComNet)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Eighth International Conference on Communications and Networking (ComNet)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComNet47917.2020.9306094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Object recognition is one of the most researched and commercialized applications of Deep Learning (DL) where Convolutional Neural Networks (CNNs) are especially accurate. The deployment of these models on embedded systems require low latency and high performance even with limited resources and energy budgets. Embedded systems with Zynq Systems on Chips (SoCs) are attractive platforms for CNNs. In this paper, we use PYNQ framework, that supports a Python-based hardware/software codesign environment to perform CNN inference for object recognition on Xilinx FPGA. We design the CNN model and train it on a CPU platform and we implement it On ZedBoard FPGA. By using only, a single ARM processor core on FPGA, we achieve 100ms latency and up to 10 image recognitions per second on the CIFAR-10 dataset with 79.90% accuracy. This model performance can be highly improved by exploring the hardware resources of the FPGA chip.