Faster reinforcement learning after pretraining deep networks to predict state dynamics

2015 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2015-07-12 DOI:10.1109/IJCNN.2015.7280824

C. Anderson, Minwoo Lee, D. Elliott

引用次数: 42

Abstract

Deep learning algorithms have recently appeared that pretrain hidden layers of neural networks in unsupervised ways, leading to state-of-the-art performance on large classification problems. These methods can also pretrain networks used for reinforcement learning. However, this ignores the additional information that exists in a reinforcement learning paradigm via the ongoing sequence of state, action, new state tuples. This paper demonstrates that learning a predictive model of state dynamics can result in a pretrained hidden layer structure that reduces the time needed to solve reinforcement learning problems.

查看原文本刊更多论文

预训练深度网络后更快的强化学习来预测状态动态

最近出现了深度学习算法，以无监督的方式预训练神经网络的隐藏层，从而在大型分类问题上取得了最先进的性能。这些方法也可以预训练用于强化学习的网络。然而，这忽略了通过状态、动作、新状态元组的持续序列存在于强化学习范式中的附加信息。本文证明，学习状态动力学的预测模型可以产生预训练的隐藏层结构，从而减少解决强化学习问题所需的时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量