Agile Catching with Whole-Body MPC and Blackbox Policy Learning

Conference on Learning for Dynamics & Control Pub Date : 2023-06-14 DOI:10.48550/arXiv.2306.08205

Saminda Abeyruwan, A. Bewley, Nicholas M. Boffi, K. Choromanski, David B. D'Ambrosio, Deepali Jain, P. Sanketi, A. Shankar, Vikas Sindhwani, Sumeet Singh, J. Slotine, Stephen Tu

引用次数: 1

Abstract

We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) Model Predictive Control using accelerated constrained trajectory optimization, and (ii) Reinforcement Learning using zeroth-order optimization. We provide insights into various performance trade-offs including sample efficiency, sim-to-real transfer, robustness to distribution shifts, and whole-body multimodality via extensive on-hardware experiments. We conclude with proposals on fusing"classical"and"learning-based"techniques for agile robot control. Videos of our experiments may be found at https://sites.google.com/view/agile-catching

查看原文本刊更多论文

敏捷捕获与全身MPC和黑盒策略学习

我们解决了敏捷机器人中的一个基准任务:捕捉高速抛出的物体。这是一项具有挑战性的任务，涉及跟踪，拦截和抱起投掷的物体，只能通过对物体的视觉观察和机器人的本体感觉状态，所有这些都在几分之一秒内完成。我们提出了两种根本不同的解决策略的相对优点:(i)使用加速约束轨迹优化的模型预测控制，以及(ii)使用零阶优化的强化学习。通过广泛的硬件实验，我们提供了各种性能权衡的见解，包括样本效率，模拟到真实的转移，对分布转移的鲁棒性和全身多模态。最后，我们提出了融合“经典”和“基于学习”的敏捷机器人控制技术的建议。我们的实验视频可以在https://sites.google.com/view/agile-catching上找到

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Learning for Dynamics & Control

自引率

0.00%

发文量