A hardware architecture for real-time object detection using depth and edge information

ACM Trans. Embed. Comput. Syst. Pub Date : 2013-12-01 DOI:10.1145/2539036.2539050

C. Kyrkou, Christos Ttofis, T. Theocharides

{"title":"A hardware architecture for real-time object detection using depth and edge information","authors":"C. Kyrkou, Christos Ttofis, T. Theocharides","doi":"10.1145/2539036.2539050","DOIUrl":null,"url":null,"abstract":"Emerging embedded 3D vision systems for robotics and security applications utilize object detection to perform video analysis in order to intelligently interact with their host environment and take appropriate actions. Such systems have high performance and high detection-accuracy demands, while requiring low energy consumption, especially when dealing with embedded mobile systems. However, there is a large image search space involved in object detection, primarily because of the different sizes in which an object may appear, which makes it difficult to meet these demands. Hence, it is possible to meet such constraints by reducing the search space involved in object detection. To this end, this article proposes a depth and edge accelerated search method and a dedicated hardware architecture that implements it to provide an efficient platform for generic real-time object detection. The hardware integration of depth and edge processing mechanisms, with a support vector machine classification core onto an FPGA platform, results in significant speed-ups and improved detection accuracy. The proposed architecture was evaluated using images of various sizes, with results indicating that the proposed architecture is capable of achieving real-time frame rates for a variety of image sizes (271 fps for 320 × 240, 42 fps for 640 × 480, and 23 fps for 800 × 600) compared to existing works, while reducing the false-positive rate by 52%.","PeriodicalId":183677,"journal":{"name":"ACM Trans. Embed. Comput. Syst.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Trans. Embed. Comput. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2539036.2539050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

Abstract

Emerging embedded 3D vision systems for robotics and security applications utilize object detection to perform video analysis in order to intelligently interact with their host environment and take appropriate actions. Such systems have high performance and high detection-accuracy demands, while requiring low energy consumption, especially when dealing with embedded mobile systems. However, there is a large image search space involved in object detection, primarily because of the different sizes in which an object may appear, which makes it difficult to meet these demands. Hence, it is possible to meet such constraints by reducing the search space involved in object detection. To this end, this article proposes a depth and edge accelerated search method and a dedicated hardware architecture that implements it to provide an efficient platform for generic real-time object detection. The hardware integration of depth and edge processing mechanisms, with a support vector machine classification core onto an FPGA platform, results in significant speed-ups and improved detection accuracy. The proposed architecture was evaluated using images of various sizes, with results indicating that the proposed architecture is capable of achieving real-time frame rates for a variety of image sizes (271 fps for 320 × 240, 42 fps for 640 × 480, and 23 fps for 800 × 600) compared to existing works, while reducing the false-positive rate by 52%.

查看原文本刊更多论文

利用深度和边缘信息进行实时目标检测的硬件架构

用于机器人和安全应用的新兴嵌入式3D视觉系统利用目标检测来执行视频分析，以便与主机环境智能交互并采取适当的行动。这种系统具有高性能和高检测精度要求，同时要求低能耗，特别是在处理嵌入式移动系统时。然而，目标检测涉及的图像搜索空间很大，主要是因为目标可能出现的尺寸不同，很难满足这些需求。因此，可以通过减少目标检测中涉及的搜索空间来满足这些约束。为此，本文提出了一种深度和边缘加速搜索方法以及实现该方法的专用硬件架构，为通用实时目标检测提供了一个高效的平台。深度和边缘处理机制的硬件集成，以及在FPGA平台上的支持向量机分类核心，显著提高了检测速度和精度。使用各种尺寸的图像对所提出的架构进行了评估，结果表明，与现有作品相比，所提出的架构能够实现各种图像尺寸的实时帧率(320 × 240 271 fps, 640 × 480 42 fps, 800 × 600 23 fps)，同时将误报率降低了52%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Trans. Embed. Comput. Syst.

自引率

0.00%

发文量