Autonomous Agents in Snake Game via Deep Reinforcement Learning

2018 IEEE International Conference on Agents (ICA) Pub Date : 2018-07-01 DOI:10.1109/AGENTS.2018.8460004

Zhepei Wei, D. Wang, M. Zhang, A. Tan, C. Miao, You Zhou

引用次数: 11

Abstract

Since DeepMind pioneered a deep reinforcement learning (DRL) model to play the Atari games, DRL has become a commonly adopted method to enable the agents to learn complex control policies in various video games. However, similar approaches may still need to be improved when applied to more challenging scenarios, where reward signals are sparse and delayed. In this paper, we develop a refined DRL model to enable our autonomous agent to play the classical Snake Game, whose constraint gets stricter as the game progresses. Specifically, we employ a convolutional neural network (CNN) trained with a variant of Q-learning. Moreover, we propose a carefully designed reward mechanism to properly train the network, adopt a training gap strategy to temporarily bypass training after the location of the target changes, and introduce a dual experience replay method to categorize different experiences for better training efficacy. The experimental results show that our agent outperforms the baseline model and surpasses human-level performance in terms of playing the Snake Game.

查看原文本刊更多论文

基于深度强化学习的Snake博弈自主智能体

自从DeepMind开创了一种深度强化学习(DRL)模型来玩Atari游戏以来，DRL已经成为一种常用的方法，使智能体能够在各种视频游戏中学习复杂的控制策略。然而，当应用于更具挑战性的场景时，类似的方法可能仍然需要改进，其中奖励信号稀疏且延迟。在本文中，我们开发了一个改进的DRL模型，使我们的自主智能体能够玩经典的Snake游戏，随着游戏的进行，其约束变得更加严格。具体来说，我们使用了一个经过q学习变体训练的卷积神经网络(CNN)。此外，我们提出了精心设计的奖励机制，以适当地训练网络，采用训练间隙策略，在目标位置发生变化后暂时绕过训练，并引入双重经验重播方法，对不同的经验进行分类，以获得更好的训练效果。实验结果表明，我们的智能体在玩蛇游戏方面的表现优于基线模型，超过了人类水平的表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Agents (ICA)

自引率

0.00%

发文量