Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents

Proceedings of the 5th International Conference on Human Agent Interaction Pub Date : 2017-10-17 DOI:10.1145/3125739.3125746

Yosuke Fukuchi, Masahiko Osawa, H. Yamakawa, M. Imai

引用次数: 27

Abstract

In cooperation, the workers must know how co-workers behave. However, an agent's policy, which is embedded in a statistical machine learning model, is hard to understand, and requires much time and knowledge to comprehend. Therefore, it is difficult for people to predict the behavior of machine learning robots, which makes Human Robot Cooperation challenging. In this paper, we propose Instruction-based Behavior Explanation (IBE), a method to explain an autonomous agent's future behavior. In IBE, an agent can autonomously acquire the expressions to explain its own behavior by reusing the instructions given by a human expert to accelerate the learning of the agent's policy. IBE also enables a developmental agent, whose policy may change during the cooperation, to explain its own behavior with sufficient time granularity.

查看原文本刊更多论文

交互式强化学习智能体行为的自主自我解释

在合作中，员工必须知道同事的行为。然而，嵌入在统计机器学习模型中的代理策略很难理解，并且需要大量的时间和知识来理解。因此，人们很难预测机器学习机器人的行为，这给人机合作带来了挑战。在本文中，我们提出了基于指令的行为解释(IBE)，这是一种解释自主智能体未来行为的方法。在IBE中，智能体可以通过重用人类专家给出的指令来自主获取解释其自身行为的表达式，从而加速智能体策略的学习。IBE还使策略可能在合作过程中发生变化的发展性代理能够以足够的时间粒度解释自己的行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 5th International Conference on Human Agent Interaction

自引率

0.00%

发文量