Continual Learning of Vacuum Grasps from Grasp Outcome for Unsupervised Domain Adaption

2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI) Pub Date : 2022-12-09 DOI:10.1109/RAAI56146.2022.10092970

Maximilian Gilles, Vinzenz Rau

{"title":"Continual Learning of Vacuum Grasps from Grasp Outcome for Unsupervised Domain Adaption","authors":"Maximilian Gilles, Vinzenz Rau","doi":"10.1109/RAAI56146.2022.10092970","DOIUrl":null,"url":null,"abstract":"Training grasping robots in isolation can result in large performance gaps when deploying to real world applications. This problem gains in importance when synthetic data is used for training. To meet the desired performance for a specific use-case, fine-tuning the model's parameters to account for the persistent domain shift between training and application data is usually required. To speed up deployment time and reduce costs, a picking robot should be able to continually adapt to its new domain by incorporating knowledge generated during operation. The proposed method enables a robot to perform domain adaption from source domain to target domain data completely selfsupervised by continually adapting its model's weights to the new target domain, relying only on feedback about grasp success or failure. It is based on two core ideas: 1) extrapolation of the suctionable area around a conducted grasp based on local curvature analysis of sensor data, and 2) uncertainty-weighted knowledge distillation-based pseudo labels for ambiguous background pixels for which no information about graspability is available from the current experiment. Extensive sim-to-real experiments on the challenging MetaGraspNet dataset show that the proposed method improves grasp success rate in average by more than 13% on real world scenes compared to purely synthetic training data.","PeriodicalId":190255,"journal":{"name":"2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAAI56146.2022.10092970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Training grasping robots in isolation can result in large performance gaps when deploying to real world applications. This problem gains in importance when synthetic data is used for training. To meet the desired performance for a specific use-case, fine-tuning the model's parameters to account for the persistent domain shift between training and application data is usually required. To speed up deployment time and reduce costs, a picking robot should be able to continually adapt to its new domain by incorporating knowledge generated during operation. The proposed method enables a robot to perform domain adaption from source domain to target domain data completely selfsupervised by continually adapting its model's weights to the new target domain, relying only on feedback about grasp success or failure. It is based on two core ideas: 1) extrapolation of the suctionable area around a conducted grasp based on local curvature analysis of sensor data, and 2) uncertainty-weighted knowledge distillation-based pseudo labels for ambiguous background pixels for which no information about graspability is available from the current experiment. Extensive sim-to-real experiments on the challenging MetaGraspNet dataset show that the proposed method improves grasp success rate in average by more than 13% on real world scenes compared to purely synthetic training data.

查看原文本刊更多论文

无监督域自适应真空抓取结果的持续学习

孤立地训练抓取机器人在部署到实际应用程序时可能会导致巨大的性能差距。当使用合成数据进行训练时，这个问题变得更加重要。为了满足特定用例所需的性能，通常需要对模型的参数进行微调，以考虑训练数据和应用程序数据之间的持续领域转换。为了加快部署时间和降低成本，拾取机器人应该能够通过整合操作过程中产生的知识来不断适应新的领域。该方法使机器人能够完全自监督地完成从源域到目标域数据的域自适应，通过不断地将其模型的权值调整到新的目标域，而仅仅依赖于抓取成功或失败的反馈。它基于两个核心思想:1)基于传感器数据的局部曲率分析来推断进行抓取的可吸入区域;2)基于不确定性加权知识提取的伪标签，用于当前实验中无法获得可抓取性信息的模糊背景像素。在具有挑战性的MetaGraspNet数据集上进行的大量模拟到真实的实验表明，与纯合成训练数据相比，所提出的方法在真实场景上的抓取成功率平均提高了13%以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI)

自引率

0.00%

发文量