Novelty detector for reinforcement learning based on forecasting

2014 IEEE 12th International Symposium on Applied Machine Intelligence and Informatics (SAMI) Pub Date : 2014-05-29 DOI:10.1109/SAMI.2014.6822379

M. Gregor, J. Spalek

引用次数: 9

Abstract

The paper proposes a novelty detector based on an artificial neural network forecaster. It shows how such forecaster can be constructed and as a novelty detector. Two variations of the forecaster are presented - one is based on backpropagation, and the other on Rprop. It is shown how the detector can be used to approach the exploration vs. exploitation trade-off. Experimental results are presented for both versions of the detector along with a comparison with novelty detectors based on the concept of the habituated self-organising map (HSOM). It is shown that learning based on the proposed detector can outperform that using the HSOM-based detector. Finally, the paper identifies several lines along which future research may progress.

查看原文本刊更多论文

基于预测的强化学习新颖性检测器

提出了一种基于人工神经网络预测器的新颖性检测方法。它展示了如何构建这样的预测器，并作为一个新奇的探测器。提出了预报器的两种变体——一种是基于反向传播的，另一种是基于反向传播的。它展示了如何使用检测器来处理探索与开发之间的权衡。给出了两种探测器的实验结果，并与基于惯化自组织映射(HSOM)概念的新型探测器进行了比较。结果表明，基于该检测器的学习性能优于基于hsom检测器的学习性能。最后，本文确定了未来研究可能取得进展的几个方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE 12th International Symposium on Applied Machine Intelligence and Informatics (SAMI)

自引率

0.00%

发文量