基于视觉描述符的6D目标姿态估计

Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence Pub Date : 2020-10-17 DOI:10.1145/3438872.3439095

Qi-Wei Sun, Samuel Cheng

{"title":"基于视觉描述符的6D目标姿态估计","authors":"Qi-Wei Sun, Samuel Cheng","doi":"10.1145/3438872.3439095","DOIUrl":null,"url":null,"abstract":"One essential component for object pose estimation is to extract the objects' features with suitable representation. For symmetrical objects and smooth objects that lack texture, the pose estimation results are not satisfactory because it is difficult to extract and represent these objects' feature information. This work introduces a new method to represent objects' features by constructing pixel-level visual descriptors and performing a 6D pose estimation based on the RGB-D image. Compared with traditional RGB images, RGB-D images can provide richer information, and image descriptors constructed based on RGB-D images can extract and represent object features more effectively. We also use a network to refine the pose estimation result instead of using ICP to improve refinement speed. The proposed architecture has made satisfactory improvement on the YCB-Video dataset, especially for symmetric objects and other categories that are difficult to regress in the past.","PeriodicalId":199307,"journal":{"name":"Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"6D Object Pose Estimation by Visual Descriptor\",\"authors\":\"Qi-Wei Sun, Samuel Cheng\",\"doi\":\"10.1145/3438872.3439095\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One essential component for object pose estimation is to extract the objects' features with suitable representation. For symmetrical objects and smooth objects that lack texture, the pose estimation results are not satisfactory because it is difficult to extract and represent these objects' feature information. This work introduces a new method to represent objects' features by constructing pixel-level visual descriptors and performing a 6D pose estimation based on the RGB-D image. Compared with traditional RGB images, RGB-D images can provide richer information, and image descriptors constructed based on RGB-D images can extract and represent object features more effectively. We also use a network to refine the pose estimation result instead of using ICP to improve refinement speed. The proposed architecture has made satisfactory improvement on the YCB-Video dataset, especially for symmetric objects and other categories that are difficult to regress in the past.\",\"PeriodicalId\":199307,\"journal\":{\"name\":\"Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3438872.3439095\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3438872.3439095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

目标姿态估计的一个重要组成部分是提取具有合适表示形式的目标特征。对于对称物体和缺乏纹理的光滑物体，由于难以提取和表示这些物体的特征信息，姿态估计结果并不令人满意。本文介绍了一种新的方法，通过构建像素级视觉描述符并基于RGB-D图像进行6D姿态估计来表示物体的特征。与传统的RGB图像相比，RGB- d图像可以提供更丰富的信息，基于RGB- d图像构建的图像描述符可以更有效地提取和表示目标特征。我们还使用网络来改进姿态估计结果，而不是使用ICP来提高改进速度。所提出的体系结构在YCB-Video数据集上取得了令人满意的改进，特别是对于对称对象和其他过去难以回归的类别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

6D Object Pose Estimation by Visual Descriptor

One essential component for object pose estimation is to extract the objects' features with suitable representation. For symmetrical objects and smooth objects that lack texture, the pose estimation results are not satisfactory because it is difficult to extract and represent these objects' feature information. This work introduces a new method to represent objects' features by constructing pixel-level visual descriptors and performing a 6D pose estimation based on the RGB-D image. Compared with traditional RGB images, RGB-D images can provide richer information, and image descriptors constructed based on RGB-D images can extract and represent object features more effectively. We also use a network to refine the pose estimation result instead of using ICP to improve refinement speed. The proposed architecture has made satisfactory improvement on the YCB-Video dataset, especially for symmetric objects and other categories that are difficult to regress in the past.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence

自引率

0.00%

发文量