Path-finding using reinforcement learning and affective states

The 23rd IEEE International Symposium on Robot and Human Interactive Communication Pub Date : 2014-10-20 DOI:10.1109/ROMAN.2014.6926309

Johannes Feldmaier, K. Diepold

引用次数: 10

Abstract

During decision making and acting in the environment humans appraise decisions and observations with feelings and emotions. In this paper we propose a framework to incorporate an emotional model into the decision making process of a machine learning agent. We use a hierarchical structure to combine reinforcement learning with a dimensional emotional model. The dimensional model calculates two dimensions representing the actual affective state of the autonomous agent. For the evaluation of this combination, we use a reinforcement learning experiment (called Dyna Maze) in which, the agent has to find an optimal path through a maze. Our first results show that the agent is able to appraise the situation in terms of emotions and react according to them.

查看原文本刊更多论文

使用强化学习和情感状态的寻路

在做出决策和在环境中行动的过程中，人类用感觉和情绪来评估决策和观察。在本文中，我们提出了一个框架，将情感模型纳入机器学习代理的决策过程。我们使用层次结构将强化学习与维度情感模型相结合。维度模型计算两个维度，表示自治代理的实际情感状态。为了评估这种组合，我们使用了一个强化学习实验(称为Dyna Maze)，在这个实验中，智能体必须在迷宫中找到一条最优路径。我们的第一个结果表明，代理能够根据情绪来评估情况，并根据情绪做出反应。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The 23rd IEEE International Symposium on Robot and Human Interactive Communication

自引率

0.00%

发文量