Improvement of Particle Filter for Reinforcement Learning

2011 10th International Conference on Machine Learning and Applications and Workshops Pub Date : 2011-12-18 DOI:10.1109/ICMLA.2011.75

A. Notsu, Katsuhiro Honda, H. Ichihashi, Yuki Komori, Yuuki Iwamoto

引用次数: 3

Abstract

In this paper, we propose a novel framework of learning that uses a particle filter. In a real-world situation, it is difficult to express a continuous state and a continuous action. The problem is solved by using our particle filter, which is one of the methods for dividing a continuous state and a continuous action. Our method needs only a small number of memories and parameters for searching the solution in the space. We conducted pendulum and double-pendulum simulations and observed the difference between the conventional method and the proposed method. Simulation results show there was no bad effect on the received reward.

查看原文本刊更多论文

粒子滤波在强化学习中的改进

在本文中，我们提出了一个使用粒子滤波的学习框架。在现实世界中，很难表达连续的状态和连续的动作。我们提出的粒子滤波是分割连续状态和连续动作的一种方法，它解决了这一问题。我们的方法只需要少量的存储器和参数就可以在空间中搜索解。我们进行了单摆和双摆模拟，并观察了传统方法与所提方法的差异。仿真结果表明，这对获得的奖励没有不良影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 10th International Conference on Machine Learning and Applications and Workshops

自引率

0.00%

发文量