Development of an efficient method for object detection and localization in 3D space using RGBD cameras for autonomous systems

Nataliya Boyko
{"title":"Development of an efficient method for object detection and localization in 3D space using RGBD cameras for autonomous systems","authors":"Nataliya Boyko","doi":"10.1016/j.ijcce.2025.04.005","DOIUrl":null,"url":null,"abstract":"<div><div>The work presents an efficient algorithm for object detection, orientation estimation, and isometric positioning in 3D space using RGBD camera data. The goal of the study is to improve the accuracy and processing speed of autonomous navigation and manipulation systems under conditions of limited computational resources. The proposed approach combines heuristic isometry estimation with segmentation methods (DBSCAN), plane estimation (RANSAC), and orientation analysis, enabling effective processing of scenes with planar backgrounds. The main advantage of the algorithm lies in its ability to operate in real time: the processing time for a single frame is only 20 ms, achieving object positioning accuracy up to 5.48 cm. The results of experimental research confirm a high level of accuracy and stability even under challenging conditions. The algorithm outperforms existing models in terms of processing speed while demonstrating comparable or superior positioning accuracy. The practical significance of the proposed method lies in its potential application in mobile robotics, automated warehouse systems, and machine vision systems where high autonomy and precision are required. The algorithm can also be adapted to a broader range of tasks due to its flexible hyperparameter tuning. A key limitation remains the requirement for object placement on a planar surface and the use of a depth camera, which necessitates a specific environmental setup. The proposed method makes a significant contribution to the advancement of computer vision and autonomous robotics technologies, opening prospects for its implementation in next-generation systems.</div></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"6 ","pages":"Pages 537-551"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307425000245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The work presents an efficient algorithm for object detection, orientation estimation, and isometric positioning in 3D space using RGBD camera data. The goal of the study is to improve the accuracy and processing speed of autonomous navigation and manipulation systems under conditions of limited computational resources. The proposed approach combines heuristic isometry estimation with segmentation methods (DBSCAN), plane estimation (RANSAC), and orientation analysis, enabling effective processing of scenes with planar backgrounds. The main advantage of the algorithm lies in its ability to operate in real time: the processing time for a single frame is only 20 ms, achieving object positioning accuracy up to 5.48 cm. The results of experimental research confirm a high level of accuracy and stability even under challenging conditions. The algorithm outperforms existing models in terms of processing speed while demonstrating comparable or superior positioning accuracy. The practical significance of the proposed method lies in its potential application in mobile robotics, automated warehouse systems, and machine vision systems where high autonomy and precision are required. The algorithm can also be adapted to a broader range of tasks due to its flexible hyperparameter tuning. A key limitation remains the requirement for object placement on a planar surface and the use of a depth camera, which necessitates a specific environmental setup. The proposed method makes a significant contribution to the advancement of computer vision and autonomous robotics technologies, opening prospects for its implementation in next-generation systems.
自主系统中使用RGBD相机在三维空间中进行目标检测和定位的有效方法
该工作提出了一种利用RGBD相机数据在三维空间中进行目标检测、方向估计和等距定位的有效算法。研究的目标是在有限的计算资源条件下提高自主导航和操纵系统的精度和处理速度。该方法将启发式等距估计与分割方法(DBSCAN)、平面估计(RANSAC)和方向分析相结合,能够有效地处理具有平面背景的场景。该算法的主要优势在于其实时性:单帧处理时间仅为20ms,目标定位精度高达5.48 cm。实验研究结果证实,即使在具有挑战性的条件下,也具有很高的准确性和稳定性。该算法在处理速度方面优于现有模型,同时显示出相当或更高的定位精度。该方法的实际意义在于其在移动机器人、自动化仓库系统和机器视觉系统等对自主性和精度要求较高的领域具有潜在的应用前景。由于其灵活的超参数调整,该算法还可以适应更广泛的任务范围。一个关键的限制仍然是物体放置在平面上的要求和深度相机的使用,这需要特定的环境设置。所提出的方法对计算机视觉和自主机器人技术的进步做出了重大贡献,为其在下一代系统中的实施开辟了前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
13.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信