Reinforcement Learning using Kalman Filters

2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) Pub Date : 2019-07-01 DOI:10.1109/ICCICC46617.2019.9146066

Kei Takahata, T. Miura

引用次数: 2

Abstract

In this investigation, we discuss a game of pursuit-evasion, or a hunter-prey problems using Q-learning framework. This has always been a popular research subject in the field of robotics where a hunter moves around in pursuit a prey. We involve Kalman filters to estimate the prey's status (location and velocity) and learn Q-values based on the estimated status. We evaluate our approach by convergence of Q-values and capturing steps.

查看原文本刊更多论文

使用卡尔曼滤波器的强化学习

在本研究中，我们使用Q-learning框架讨论了一个追捕-逃避博弈，或一个狩猎-猎物问题。这一直是机器人领域的热门研究课题，即猎人四处移动以追捕猎物。我们使用卡尔曼滤波来估计猎物的状态(位置和速度)，并根据估计的状态学习q值。我们通过q值的收敛性和捕获步骤来评估我们的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE 18th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

自引率

0.00%

发文量