Automatic Discovery and Transfer of MAXQ Hierarchies in a Complex System

2012 IEEE 24th International Conference on Tools with Artificial Intelligence Pub Date : 2012-11-07 DOI:10.1109/ICTAI.2012.165

Hongbing Wang, Wenya Li, Xuan Zhou

引用次数: 2

Abstract

Reinforcement learning has been an important category of machine learning approaches exhibiting self-learning and online learning characteristics. Using reinforcement learning, an agent can learn its behaviors through trial-and-error interactions with a dynamic environment and finally come up with an optimal strategy. Reinforcement learning suffers the curse of dimensionality, though there has been significant progress to overcome this issue in recent years. MAXQ is one of the most common approaches for reinforcement learning. To function properly, MAXQ requires a decomposition of the agent's task into a task hierarchy. Previously, the decomposition can only be done manually. In this paper, we propose a mechanism for automatic subtask discovery. The mechanism applies clustering to automatically construct task hierarchy required by MAXQ, such that MAXQ can be fully automated. We present the design of our mechanism, and demonstrate its effectiveness through theoretical analysis and an extensive experimental evaluation.

查看原文本刊更多论文

复杂系统中MAXQ层次的自动发现与传递

强化学习是机器学习方法的一个重要类别，具有自我学习和在线学习的特点。使用强化学习，智能体可以通过与动态环境的试错交互来学习其行为，并最终提出最优策略。强化学习遭受维度的诅咒，尽管近年来在克服这个问题方面取得了重大进展。MAXQ是强化学习最常用的方法之一。为了正常工作，MAXQ需要将代理的任务分解为任务层次结构。以前，分解只能手工完成。本文提出了一种自动发现子任务的机制。该机制应用集群来自动构建MAXQ所需的任务层次结构，从而使MAXQ可以完全自动化。我们提出了我们的机制设计，并通过理论分析和广泛的实验评估证明了其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE 24th International Conference on Tools with Artificial Intelligence

自引率

0.00%

发文量