基于fpga的多尺度注意力目标检测的高效实现

2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS) Pub Date : 2021-08-09 DOI:10.1109/MWSCAS47672.2021.9531732

M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata

{"title":"基于fpga的多尺度注意力目标检测的高效实现","authors":"M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata","doi":"10.1109/MWSCAS47672.2021.9531732","DOIUrl":null,"url":null,"abstract":"Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.","PeriodicalId":6792,"journal":{"name":"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)","volume":"8 1","pages":"321-325"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention\",\"authors\":\"M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata\",\"doi\":\"10.1109/MWSCAS47672.2021.9531732\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.\",\"PeriodicalId\":6792,\"journal\":{\"name\":\"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)\",\"volume\":\"8 1\",\"pages\":\"321-325\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MWSCAS47672.2021.9531732\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MWSCAS47672.2021.9531732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于卷积神经网络(CNN)的目标检测已成为安防视频监控摄像机图像预处理中的一项重要任务。由于cnn需要大量的计算能力，基于fpga的实现方法由于其能源效率和处理速度而成为一种有前途的解决方案。单镜头多盒检测器(single-shot multibox detector, SSD)适合这种应用，而降低CNN计算负荷以实现高检测精度仍然是FPGA设计面临的重要挑战。提出了一种用于SSD目标检测的FPGA加速器。提出的多尺度空间注意机制可以减少计算量。为了提高CNN的硬件实现效率，我们提出了硬件动态量化和自主存储器访问控制。在XCZU7EV上实现的基于SSD的多尺度空间注意机制原型在PASCAL VOC数据集上的目标检测精度达到79.3%。本设计实现了高达94%的数字信号处理器效率和77.7 msec的良好运算速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention

Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)

自引率

0.00%

发文量