Abstraction in Model Based Partially Observable Reinforcement Learning Using Extended Sequence Trees

2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Pub Date : 2012-12-04 DOI:10.1109/WI-IAT.2012.161

Erkin Çilden, Faruk Polat

引用次数: 3

Abstract

Extended sequence tree is a direct method for automatic generation of useful abstractions in reinforcement learning, designed for problems that can be modelled as Markov decision process. This paper proposes a method to expand the extended sequence tree method over reinforcement learning to cover partial observability formalized via partially observable Markov decision process through belief state formalism. This expansion requires a reasonable approximation of information state. Inspired by statistical ranking, a simple but effective discretization schema over belief state space is defined. Extended sequence tree method is modified to make use of this schema under partial observability, and effectiveness of resulting algorithm is shown by experiments on some benchmark problems.

查看原文本刊更多论文

基于扩展序列树的部分可观察模型强化学习抽象

扩展序列树是一种在强化学习中自动生成有用抽象的直接方法，专为可建模为马尔可夫决策过程的问题而设计。本文提出了一种方法，将强化学习上的扩展序列树方法扩展到包含部分可观察马尔可夫决策过程的部分可观察性。这种扩展需要对信息状态有一个合理的近似。受统计排序的启发，定义了一种简单有效的信念状态空间离散化模式。在部分可观察性条件下，对扩展序列树方法进行了改进，利用了该模式，并通过一些基准问题的实验证明了算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology

自引率

0.00%

发文量