Guangju Wei, Yanzhao Hou, Qimei Cui, Gang Deng, Xiaofeng Tao, Y. Yao
{"title":"YOLO Acceleration using FPGA Architecture","authors":"Guangju Wei, Yanzhao Hou, Qimei Cui, Gang Deng, Xiaofeng Tao, Y. Yao","doi":"10.1109/ICCCHINA.2018.8641256","DOIUrl":null,"url":null,"abstract":"In recent years, convolutional neural networks (CNN) have achieved breakthrough developments. However, with the spatial and time complexity of CNN gradually increasing, it becomes a great challenge to implement larger CNN for mobile applications. Accelerated platforms based on FPGAs are gradually being studied due to its advantages such as high performance, low power consumption, reconfigurability, etc. In this paper, we implement the YOLO platform based on the Zynq board (which is released by Xillinx) and optimize its architecture combined the FPGA feature. The use case of pedestrian and cars recognition is demonstrated in real time, which outperforms the traditional CPU-based YOLO networks.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
In recent years, convolutional neural networks (CNN) have achieved breakthrough developments. However, with the spatial and time complexity of CNN gradually increasing, it becomes a great challenge to implement larger CNN for mobile applications. Accelerated platforms based on FPGAs are gradually being studied due to its advantages such as high performance, low power consumption, reconfigurability, etc. In this paper, we implement the YOLO platform based on the Zynq board (which is released by Xillinx) and optimize its architecture combined the FPGA feature. The use case of pedestrian and cars recognition is demonstrated in real time, which outperforms the traditional CPU-based YOLO networks.