基于fpga的多尺度注意力目标检测的高效实现

M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata
{"title":"基于fpga的多尺度注意力目标检测的高效实现","authors":"M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata","doi":"10.1109/MWSCAS47672.2021.9531732","DOIUrl":null,"url":null,"abstract":"Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.","PeriodicalId":6792,"journal":{"name":"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)","volume":"8 1","pages":"321-325"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention\",\"authors\":\"M. Furuta, K. Ban, Daisuke Kobayashi, Tomoyuki Shibata\",\"doi\":\"10.1109/MWSCAS47672.2021.9531732\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.\",\"PeriodicalId\":6792,\"journal\":{\"name\":\"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)\",\"volume\":\"8 1\",\"pages\":\"321-325\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MWSCAS47672.2021.9531732\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MWSCAS47672.2021.9531732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于卷积神经网络(CNN)的目标检测已成为安防视频监控摄像机图像预处理中的一项重要任务。由于cnn需要大量的计算能力,基于fpga的实现方法由于其能源效率和处理速度而成为一种有前途的解决方案。单镜头多盒检测器(single-shot multibox detector, SSD)适合这种应用,而降低CNN计算负荷以实现高检测精度仍然是FPGA设计面临的重要挑战。提出了一种用于SSD目标检测的FPGA加速器。提出的多尺度空间注意机制可以减少计算量。为了提高CNN的硬件实现效率,我们提出了硬件动态量化和自主存储器访问控制。在XCZU7EV上实现的基于SSD的多尺度空间注意机制原型在PASCAL VOC数据集上的目标检测精度达到79.3%。本设计实现了高达94%的数字信号处理器效率和77.7 msec的良好运算速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention
Convolutional neural network (CNN)-based object detection has become an important task in image pre-processing for security video surveillance cameras. Since CNNs require a large amount of computational power, the approach of FPGA-based implementation has emerged as a promising solution owing to its energy efficiency and processing speed. The single-shot multibox detector (SSD) is suitable for this kind of application, while reducing the CNN computational load need to achieve high detection accuracy is still an important challenge for FPGA design. This paper presents an FPGA accelerator for processing SSD object detection. The number of computations can be reduced by the proposed multi-scale spatial attention mechanism. To enhance the efficiency for hardware implementation of the CNN, we propose dynamic quantization on hardware and autonomous memory access control. The developed prototype based on SSD with multi-scale spatial attention mechanism implemented on XCZU7EV exhibits an object detection accuracy of 79.3% mean average precision on the PASCAL VOC dataset. The proposed design achieves high digital signal processor efficiency of 94% and good operation speed of 77.7 msec.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信