Efficient online adaptation with stochastic recurrent neural networks

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids) Pub Date : 2017-11-01 DOI:10.1109/HUMANOIDS.2017.8246875

Daniel Tanneberg, Jan Peters, E. Rückert

引用次数: 3

Abstract

Autonomous robots need to interact with unknown and unstructured environments. For continuous online adaptation in lifelong learning scenarios, they need sample-efficient mechanisms to adapt to changing environments, constraints, tasks and capabilities. In this paper, we introduce a framework for online motion planning and adaptation based on a bio-inspired stochastic recurrent neural network. By using the intrinsic motivation signal cognitive dissonance with a mental replay strategy, the robot can learn from few physical interactions and can therefore adapt to novel environments in seconds. We evaluate our online planning and adaptation framework on a KUKA LWR arm. The efficient online adaptation is shown by learning unknown workspace constraints sample-efficient within few seconds while following given via points.

查看原文本刊更多论文

随机递归神经网络的有效在线自适应

自主机器人需要与未知和非结构化环境进行交互。为了在终身学习场景中持续在线适应，他们需要样本效率机制来适应不断变化的环境、约束、任务和能力。本文介绍了一种基于仿生随机递归神经网络的在线运动规划和自适应框架。通过使用内在动机信号认知失调和心理重放策略，机器人可以从很少的物理交互中学习，因此可以在几秒钟内适应新的环境。我们在KUKA LWR臂上评估我们的在线规划和适应框架。通过在几秒钟内学习未知的工作空间约束样本效率来显示有效的在线自适应。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)

自引率

0.00%

发文量