Design of a YOLO Model Accelerator Based on PYNQ Architecture

2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE) Pub Date : 2022-08-01 DOI:10.1109/MLISE57402.2022.00011

Lin Wang, Tianyong Ao, Le Fu, Jian Liu, Yang Liu, Yingjie Zhou

{"title":"Design of a YOLO Model Accelerator Based on PYNQ Architecture","authors":"Lin Wang, Tianyong Ao, Le Fu, Jian Liu, Yang Liu, Yingjie Zhou","doi":"10.1109/MLISE57402.2022.00011","DOIUrl":null,"url":null,"abstract":"The application requirements of object detection models based on deep learning are very extensive. However, high computing power requirements often seriously restrict the application of these models on resource-constrained devices with high energy efficiency requirements. To address this problem, a YOLO model accelerator architecture is proposed based on PYNQ. Based on the FPGA hardware platform, the hardware accelerator is designed by making full use of pipeline, loop unrolling, data reordering and other methods to accelerate the computationally intensive units in the YOLOv2 model such as the convolution and pooling layers. In order to reduce the delay in the data transmission process, the multi-channel transmission architecture combined with the ping-pong buffer is designed, and block-by-block reading strategy is adopted to read the off-chip data. The proposed YOLO model accelerator has been implemented and verified on Xilinx PYNQ-z2 platform. The experimental results show that the system has high detection accuracy and far lower power consumption than CPU and GPU. It can also be deployed on mobile devices to detect the surrounding environment.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLISE57402.2022.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The application requirements of object detection models based on deep learning are very extensive. However, high computing power requirements often seriously restrict the application of these models on resource-constrained devices with high energy efficiency requirements. To address this problem, a YOLO model accelerator architecture is proposed based on PYNQ. Based on the FPGA hardware platform, the hardware accelerator is designed by making full use of pipeline, loop unrolling, data reordering and other methods to accelerate the computationally intensive units in the YOLOv2 model such as the convolution and pooling layers. In order to reduce the delay in the data transmission process, the multi-channel transmission architecture combined with the ping-pong buffer is designed, and block-by-block reading strategy is adopted to read the off-chip data. The proposed YOLO model accelerator has been implemented and verified on Xilinx PYNQ-z2 platform. The experimental results show that the system has high detection accuracy and far lower power consumption than CPU and GPU. It can also be deployed on mobile devices to detect the surrounding environment.

查看原文本刊更多论文

基于PYNQ体系结构的YOLO模型加速器设计

基于深度学习的目标检测模型的应用需求非常广泛。然而，高计算能力要求往往严重限制了这些模型在资源受限、能效要求高的设备上的应用。为了解决这个问题，提出了一种基于PYNQ的YOLO模型加速器体系结构。基于FPGA硬件平台，设计硬件加速器，充分利用流水线、循环展开、数据重排序等方法，对YOLOv2模型中卷积层、池化层等计算密集型单元进行加速。为了减少数据传输过程中的延迟，设计了结合乒乓缓冲器的多通道传输架构，并采用逐块读取策略读取片外数据。所提出的YOLO模型加速器已在Xilinx PYNQ-z2平台上实现并验证。实验结果表明，该系统具有较高的检测精度和远低于CPU和GPU的功耗。它也可以部署在移动设备上，用于检测周围环境。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)

自引率

0.00%

发文量