Planning using online evolutionary overfitting

2010 UK Workshop on Computational Intelligence (UKCI) Pub Date : 2010-11-09 DOI:10.1109/UKCI.2010.5625569

Spyridon Samothrakis, S. Lucas

引用次数: 4

Abstract

Biological systems tend to perform a range of tasks of extreme variability with extraordinary efficiency. It has been argued that a plausible scenario for achieving such versatility is explicitly learning a forward model. We perform a set of experiments using the original and a modified version of a classic reinforcement learning task, the mountain car problem, using a number of agents that encode both a direct and an abstracted version of a forward model. The results suggest that superior performance can be achieved if the forward model can be exploited in real-time by an agent that has already internalised a model-free control function.

查看原文本刊更多论文

利用在线进化过拟合进行规划

生物系统倾向于以非凡的效率执行一系列极端可变性的任务。有人认为，实现这种多功能性的合理方案是明确地学习正向模型。我们使用经典强化学习任务(山地车问题)的原始版本和修改版本进行了一组实验，使用许多代理对前向模型的直接版本和抽象版本进行编码。结果表明，如果一个已经内化了无模型控制功能的智能体可以实时地利用前向模型，则可以获得更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 UK Workshop on Computational Intelligence (UKCI)

自引率

0.00%

发文量