Eugenio Chisari, Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada
{"title":"利用条件流匹配从点云学习机器人操纵策略","authors":"Eugenio Chisari, Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada","doi":"arxiv-2409.07343","DOIUrl":null,"url":null,"abstract":"Learning from expert demonstrations is a promising approach for training\nrobotic manipulation policies from limited data. However, imitation learning\nalgorithms require a number of design choices ranging from the input modality,\ntraining objective, and 6-DoF end-effector pose representation. Diffusion-based\nmethods have gained popularity as they enable predicting long-horizon\ntrajectories and handle multimodal action distributions. Recently, Conditional\nFlow Matching (CFM) (or Rectified Flow) has been proposed as a more flexible\ngeneralization of diffusion models. In this paper, we investigate the\napplication of CFM in the context of robotic policy learning and specifically\nstudy the interplay with the other design choices required to build an\nimitation learning algorithm. We show that CFM gives the best performance when\ncombined with point cloud input observations. Additionally, we study the\nfeasibility of a CFM formulation on the SO(3) manifold and evaluate its\nsuitability with a simplified example. We perform extensive experiments on\nRLBench which demonstrate that our proposed PointFlowMatch approach achieves a\nstate-of-the-art average success rate of 67.8% over eight tasks, double the\nperformance of the next best method.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"109 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching\",\"authors\":\"Eugenio Chisari, Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada\",\"doi\":\"arxiv-2409.07343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Learning from expert demonstrations is a promising approach for training\\nrobotic manipulation policies from limited data. However, imitation learning\\nalgorithms require a number of design choices ranging from the input modality,\\ntraining objective, and 6-DoF end-effector pose representation. Diffusion-based\\nmethods have gained popularity as they enable predicting long-horizon\\ntrajectories and handle multimodal action distributions. Recently, Conditional\\nFlow Matching (CFM) (or Rectified Flow) has been proposed as a more flexible\\ngeneralization of diffusion models. In this paper, we investigate the\\napplication of CFM in the context of robotic policy learning and specifically\\nstudy the interplay with the other design choices required to build an\\nimitation learning algorithm. We show that CFM gives the best performance when\\ncombined with point cloud input observations. Additionally, we study the\\nfeasibility of a CFM formulation on the SO(3) manifold and evaluate its\\nsuitability with a simplified example. We perform extensive experiments on\\nRLBench which demonstrate that our proposed PointFlowMatch approach achieves a\\nstate-of-the-art average success rate of 67.8% over eight tasks, double the\\nperformance of the next best method.\",\"PeriodicalId\":501031,\"journal\":{\"name\":\"arXiv - CS - Robotics\",\"volume\":\"109 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07343\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching
Learning from expert demonstrations is a promising approach for training
robotic manipulation policies from limited data. However, imitation learning
algorithms require a number of design choices ranging from the input modality,
training objective, and 6-DoF end-effector pose representation. Diffusion-based
methods have gained popularity as they enable predicting long-horizon
trajectories and handle multimodal action distributions. Recently, Conditional
Flow Matching (CFM) (or Rectified Flow) has been proposed as a more flexible
generalization of diffusion models. In this paper, we investigate the
application of CFM in the context of robotic policy learning and specifically
study the interplay with the other design choices required to build an
imitation learning algorithm. We show that CFM gives the best performance when
combined with point cloud input observations. Additionally, we study the
feasibility of a CFM formulation on the SO(3) manifold and evaluate its
suitability with a simplified example. We perform extensive experiments on
RLBench which demonstrate that our proposed PointFlowMatch approach achieves a
state-of-the-art average success rate of 67.8% over eight tasks, double the
performance of the next best method.