Backpropagation without human supervision for visual control in Quake II

2009 IEEE Symposium on Computational Intelligence and Games Pub Date : 2009-09-07 DOI:10.1109/CIG.2009.5286462

M. Parker, B. D. Bryant

引用次数: 7

Abstract

Backpropagation and neuroevolution are used in a Lamarckian evolution process to train a neural network visual controller for agents in the Quake II environment. In previous work, we hand-coded a non-visual controller for supervising in backpropagation, but hand-coding can only be done for problems with known solutions. In this research the problem for the agent is to attack a moving enemy in a visually complex room with a large central pillar. Because we did not know a solution to the problem, we could not hand-code a supervising controller; instead, we evolve a non-visual neural network as supervisor to the visual controller. This setup creates controllers that learn much faster and have a greater fitness than those learning by neuroevolution-only on the same problem in the same amount of time.

查看原文本刊更多论文

在《雷神之锤2》中，没有人类监督的反向传播视觉控制

在拉马克进化过程中使用反向传播和神经进化来训练Quake II环境中代理的神经网络视觉控制器。在以前的工作中，我们手工编码了一个非视觉控制器来监督反向传播，但是手工编码只能用于已知解的问题。在这项研究中，智能体的问题是在一个视觉复杂的房间里攻击一个移动的敌人，房间里有一个大的中心柱子。因为我们不知道问题的解决方案，所以我们无法手工编写监督控制器;相反，我们进化了一个非视觉神经网络作为视觉控制器的监督器。这种设置使控制器比那些通过神经进化学习的控制器学习得更快，具有更强的适应性——只能在相同的时间内解决相同的问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Symposium on Computational Intelligence and Games

自引率

0.00%

发文量