Guangju Wei, Yanzhao Hou, Qimei Cui, Gang Deng, Xiaofeng Tao, Y. Yao
{"title":"基于FPGA架构的YOLO加速","authors":"Guangju Wei, Yanzhao Hou, Qimei Cui, Gang Deng, Xiaofeng Tao, Y. Yao","doi":"10.1109/ICCCHINA.2018.8641256","DOIUrl":null,"url":null,"abstract":"In recent years, convolutional neural networks (CNN) have achieved breakthrough developments. However, with the spatial and time complexity of CNN gradually increasing, it becomes a great challenge to implement larger CNN for mobile applications. Accelerated platforms based on FPGAs are gradually being studied due to its advantages such as high performance, low power consumption, reconfigurability, etc. In this paper, we implement the YOLO platform based on the Zynq board (which is released by Xillinx) and optimize its architecture combined the FPGA feature. The use case of pedestrian and cars recognition is demonstrated in real time, which outperforms the traditional CPU-based YOLO networks.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"YOLO Acceleration using FPGA Architecture\",\"authors\":\"Guangju Wei, Yanzhao Hou, Qimei Cui, Gang Deng, Xiaofeng Tao, Y. Yao\",\"doi\":\"10.1109/ICCCHINA.2018.8641256\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, convolutional neural networks (CNN) have achieved breakthrough developments. However, with the spatial and time complexity of CNN gradually increasing, it becomes a great challenge to implement larger CNN for mobile applications. Accelerated platforms based on FPGAs are gradually being studied due to its advantages such as high performance, low power consumption, reconfigurability, etc. In this paper, we implement the YOLO platform based on the Zynq board (which is released by Xillinx) and optimize its architecture combined the FPGA feature. The use case of pedestrian and cars recognition is demonstrated in real time, which outperforms the traditional CPU-based YOLO networks.\",\"PeriodicalId\":170216,\"journal\":{\"name\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCHINA.2018.8641256\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In recent years, convolutional neural networks (CNN) have achieved breakthrough developments. However, with the spatial and time complexity of CNN gradually increasing, it becomes a great challenge to implement larger CNN for mobile applications. Accelerated platforms based on FPGAs are gradually being studied due to its advantages such as high performance, low power consumption, reconfigurability, etc. In this paper, we implement the YOLO platform based on the Zynq board (which is released by Xillinx) and optimize its architecture combined the FPGA feature. The use case of pedestrian and cars recognition is demonstrated in real time, which outperforms the traditional CPU-based YOLO networks.