基于体素的实时嵌入式系统三维目标检测方法效率分析

2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI) Pub Date : 2021-05-21 DOI:10.1109/ICETCI51973.2021.9574075

Illia Oleksiienko, A. Iosifidis

{"title":"基于体素的实时嵌入式系统三维目标检测方法效率分析","authors":"Illia Oleksiienko, A. Iosifidis","doi":"10.1109/ICETCI51973.2021.9574075","DOIUrl":null,"url":null,"abstract":"Real-time detection of objects in the 3D scene is one of the tasks an autonomous agent needs to perform for understanding its surroundings. While recent Deep Learning-based solutions achieve satisfactory performance, their high computational cost renders their application in real-life settings in which computations need to be performed on embedded platforms intractable. In this paper, we analyze the efficiency of two popular voxel-based 3D object detection methods providing a good compromise between high performance and speed based on two aspects, their ability to detect objects located at large distances from the agent and their ability to operate in real time on embedded platforms equipped with high-performance GPUs. Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances. Moreover, models trained on near objects achieve similar or better performance compared to those trained on all objects in the scene. This means that the models learn object appearance representations mostly from near objects. Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection. This means that the methods can achieve a speed-up of 40–60% by restricting operation to near objects while not sacrificing much in performance.","PeriodicalId":281877,"journal":{"name":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of voxel-based 3D object detection methods efficiency for real-time embedded systems\",\"authors\":\"Illia Oleksiienko, A. Iosifidis\",\"doi\":\"10.1109/ICETCI51973.2021.9574075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time detection of objects in the 3D scene is one of the tasks an autonomous agent needs to perform for understanding its surroundings. While recent Deep Learning-based solutions achieve satisfactory performance, their high computational cost renders their application in real-life settings in which computations need to be performed on embedded platforms intractable. In this paper, we analyze the efficiency of two popular voxel-based 3D object detection methods providing a good compromise between high performance and speed based on two aspects, their ability to detect objects located at large distances from the agent and their ability to operate in real time on embedded platforms equipped with high-performance GPUs. Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances. Moreover, models trained on near objects achieve similar or better performance compared to those trained on all objects in the scene. This means that the models learn object appearance representations mostly from near objects. Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection. This means that the methods can achieve a speed-up of 40–60% by restricting operation to near objects while not sacrificing much in performance.\",\"PeriodicalId\":281877,\"journal\":{\"name\":\"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETCI51973.2021.9574075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETCI51973.2021.9574075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

实时检测3D场景中的物体是自主智能体理解其周围环境所需要执行的任务之一。虽然最近基于深度学习的解决方案取得了令人满意的性能，但它们的高计算成本使得它们在需要在嵌入式平台上执行计算的现实环境中的应用变得棘手。在本文中，我们分析了两种流行的基于体素的三维物体检测方法的效率，它们在高性能和速度之间取得了很好的折衷，基于两个方面，它们检测距离智能体很远的物体的能力和在配备高性能gpu的嵌入式平台上实时操作的能力。我们的实验表明，由于输入点云在远距离的稀疏性，这些方法大多无法检测到远处的小物体。此外，与在场景中所有物体上训练的模型相比，在近物体上训练的模型获得了相似或更好的性能。这意味着模型主要从附近的对象学习对象的外观表示。我们的研究结果表明，现有方法的相当一部分计算集中在场景的位置，不有助于成功的检测。这意味着这些方法可以通过限制对近对象的操作来实现40-60%的加速，同时不会牺牲太多的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis of voxel-based 3D object detection methods efficiency for real-time embedded systems

Real-time detection of objects in the 3D scene is one of the tasks an autonomous agent needs to perform for understanding its surroundings. While recent Deep Learning-based solutions achieve satisfactory performance, their high computational cost renders their application in real-life settings in which computations need to be performed on embedded platforms intractable. In this paper, we analyze the efficiency of two popular voxel-based 3D object detection methods providing a good compromise between high performance and speed based on two aspects, their ability to detect objects located at large distances from the agent and their ability to operate in real time on embedded platforms equipped with high-performance GPUs. Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances. Moreover, models trained on near objects achieve similar or better performance compared to those trained on all objects in the scene. This means that the models learn object appearance representations mostly from near objects. Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection. This means that the methods can achieve a speed-up of 40–60% by restricting operation to near objects while not sacrificing much in performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)

自引率

0.00%

发文量