Should We Learn Contact-Rich Manipulation Policies From Sampling-Based Planners?

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters Pub Date : 2025-04-28 DOI:10.1109/LRA.2025.3564701

Huaijiang Zhu;Tong Zhao;Xinpei Ni;Jiuguang Wang;Kuan Fang;Ludovic Righetti;Tao Pang

{"title":"Should We Learn Contact-Rich Manipulation Policies From Sampling-Based Planners?","authors":"Huaijiang Zhu;Tong Zhao;Xinpei Ni;Jiuguang Wang;Kuan Fang;Ludovic Righetti;Tao Pang","doi":"10.1109/LRA.2025.3564701","DOIUrl":null,"url":null,"abstract":"The tremendous success of behavior cloning (BC) in robotic manipulation has been largely confined to tasks where demonstrations can be effectively collected through human teleoperation. However, demonstrations for contact-rich manipulation tasks that require complex coordination of multiple contacts are difficult to collect due to the limitations of current teleoperation interfaces. We investigate how to leverage model-based planning and optimization to generate training data for contact-rich dexterous manipulation tasks. Our analysis reveals that popular sampling-based planners like rapidly exploring random tree (RRT), while efficient for motion planning, produce demonstrations with unfavorably high entropy. This motivates modifications to our data generation pipeline that prioritizes demonstration consistency while maintaining solution coverage. Combined with a diffusion-based goal-conditioned BC approach, our method enables effective policy learning and zero-shot transfer to hardware for two challenging contact-rich manipulation tasks.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 6","pages":"6248-6255"},"PeriodicalIF":4.6000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10977833/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

The tremendous success of behavior cloning (BC) in robotic manipulation has been largely confined to tasks where demonstrations can be effectively collected through human teleoperation. However, demonstrations for contact-rich manipulation tasks that require complex coordination of multiple contacts are difficult to collect due to the limitations of current teleoperation interfaces. We investigate how to leverage model-based planning and optimization to generate training data for contact-rich dexterous manipulation tasks. Our analysis reveals that popular sampling-based planners like rapidly exploring random tree (RRT), while efficient for motion planning, produce demonstrations with unfavorably high entropy. This motivates modifications to our data generation pipeline that prioritizes demonstration consistency while maintaining solution coverage. Combined with a diffusion-based goal-conditioned BC approach, our method enables effective policy learning and zero-shot transfer to hardware for two challenging contact-rich manipulation tasks.

查看原文本刊更多论文

我们应该从基于抽样的规划者那里学习接触丰富的操作策略吗？

行为克隆（BC）在机器人操作中的巨大成功主要局限于可以通过人类远程操作有效收集演示的任务。然而，由于当前远程操作接口的限制，需要多个触点复杂协调的富触点操作任务的演示很难收集。我们研究了如何利用基于模型的规划和优化来生成训练数据的接触丰富的灵巧操作任务。我们的分析表明，流行的基于抽样的规划方法，如快速探索随机树（RRT），虽然对运动规划有效，但会产生不利的高熵演示。这促使我们修改数据生成管道，在维护解决方案覆盖率的同时优先考虑演示一致性。结合基于扩散的目标条件BC方法，我们的方法能够有效地为两个具有挑战性的富接触操作任务提供策略学习和零射击转移到硬件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Robotics and Automation Letters Computer Science-Computer Science Applications

CiteScore

9.60

自引率

15.40%

发文量

1428

期刊介绍： The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.