Learning the rules of a game: Neural conditioning in human-robot interaction with delayed rewards

2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL) Pub Date : 2013-11-04 DOI:10.1109/DEVLRN.2013.6652572

Andrea Soltoggio, R. F. Reinhart, A. Lemme, Jochen J. Steil

引用次数: 11

Abstract

Learning in human-robot interaction, as well as in human-to-human situations, is characterised by noisy stimuli, variable timing of stimuli and actions, and delayed rewards. A recent model of neural learning, based on modulated plasticity, suggested the use of rare correlations and eligibility traces to model conditioning in real-world situations with uncertain timing. The current study tests neural learning with rare correlations in a human-robot realistic teaching scenario. The humanoid robot iCub learns the rules of the game rock-paper-scissors while playing with a human tutor. The feedback of the tutor is often delayed, missing, or at times even incorrect. Nevertheless, the neural system learns with great robustness and similar performance both in simulation and in robotic experiments. The results demonstrate the efficacy of the plasticity rule based on rare correlations in implementing robotic neural conditioning.

查看原文本刊更多论文

学习游戏规则:具有延迟奖励的人机交互中的神经调节

在人机交互以及人与人之间的情况下，学习的特点是有噪声的刺激、刺激和行动的可变时间以及延迟的奖励。最近，一个基于调节可塑性的神经学习模型提出，在不确定时间的现实情况下，使用罕见的相关性和资格痕迹来模拟条件反射。目前的研究在人机现实教学场景中测试了具有罕见相关性的神经学习。人形机器人iCub在与人类导师玩耍的过程中学会了剪刀石头布的游戏规则。导师的反馈往往是延迟的、缺失的，有时甚至是不正确的。尽管如此，该神经系统在仿真和机器人实验中都具有很强的鲁棒性和相似的学习性能。结果表明，基于稀有关联的可塑性规则在机器人神经调节中的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL)

自引率

0.00%

发文量