Weakly supervised 6D pose estimation for robotic grasping

Proceedings of the 16th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry Pub Date : 2018-12-02 DOI:10.1145/3284398.3284408

Yaoxin Li, Jin Sun, Xiaoqian Li, Zhanpeng Zhang, Hui Cheng, Xiaogang Wang

{"title":"Weakly supervised 6D pose estimation for robotic grasping","authors":"Yaoxin Li, Jin Sun, Xiaoqian Li, Zhanpeng Zhang, Hui Cheng, Xiaogang Wang","doi":"10.1145/3284398.3284408","DOIUrl":null,"url":null,"abstract":"Learning based robotic grasping methods achieve substantial progress with the development of the deep neural networks. However, the requirement of large-scale training data in the real world limits the application scopes of these methods. Given the 3D models of the target objects, we propose a new learning-based grasping approach built on 6D object poses estimation from a monocular RGB image. We aim to leverage both a large-scale synthesized 6D object pose dataset and a small scale of the real-world weakly labeled dataset (e.g., mark the number of objects in the image), to reduce the system deployment difficulty. In particular, the deep network combines the 6D pose estimation task and an auxiliary task of weak labels to perform knowledge transfer between the synthesized and real-world data. We demonstrate the effectiveness of the method in a real robotic environment and show substantial improvements in the successful grasping rate (about 11.9% on average) to the proposed knowledge transfer scheme.","PeriodicalId":340366,"journal":{"name":"Proceedings of the 16th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 16th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3284398.3284408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Learning based robotic grasping methods achieve substantial progress with the development of the deep neural networks. However, the requirement of large-scale training data in the real world limits the application scopes of these methods. Given the 3D models of the target objects, we propose a new learning-based grasping approach built on 6D object poses estimation from a monocular RGB image. We aim to leverage both a large-scale synthesized 6D object pose dataset and a small scale of the real-world weakly labeled dataset (e.g., mark the number of objects in the image), to reduce the system deployment difficulty. In particular, the deep network combines the 6D pose estimation task and an auxiliary task of weak labels to perform knowledge transfer between the synthesized and real-world data. We demonstrate the effectiveness of the method in a real robotic environment and show substantial improvements in the successful grasping rate (about 11.9% on average) to the proposed knowledge transfer scheme.

查看原文本刊更多论文

机器人抓取的弱监督6D姿态估计

随着深度神经网络的发展，基于学习的机器人抓取方法取得了长足的进步。然而，现实世界对大规模训练数据的需求限制了这些方法的应用范围。考虑到目标物体的三维模型，我们提出了一种新的基于学习的抓取方法，该方法基于单目RGB图像的6D物体姿态估计。我们的目标是利用大规模的合成6D物体姿态数据集和小规模的现实世界弱标记数据集(例如，标记图像中物体的数量)来降低系统部署难度。特别是，深度网络结合了6D姿态估计任务和弱标签的辅助任务，在合成数据和真实数据之间进行知识转移。我们在真实的机器人环境中证明了该方法的有效性，并显示了所提出的知识转移方案在成功抓取率(平均约11.9%)方面的实质性改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 16th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry

自引率

0.00%

发文量