Enhancement of CNN-based 2048 Player with Monte-Carlo Tree Search

2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI) Pub Date : 2022-12-01 DOI:10.1109/TAAI57707.2022.00018

Shota Watanabe, Kiminori Matsuzaki

引用次数: 0

Abstract

In this study, we developed computer players for a single-player stochastic game 2048 using an existing neural-network evaluation function and a version of Monte-Carlo tree search. We applied the Monte-Carlo softmax search (MCSS) algorithm, with some modifications in order to adapt it to the stochastic game, and designed six methods of controlling the search algorithm. We evaluated the MCSS players in an exhaustive manner and also conducted longer experiments for two MCSS players by changing the number of simulations per move. Our MCSS player achieved an average score of 533 542 under the limit of 2000 simulations per move. This result was better than Expectimax players that used the same evaluation function.

查看原文本刊更多论文

用蒙特卡罗树搜索增强基于cnn的2048播放器

在这项研究中，我们使用现有的神经网络评估函数和蒙特卡罗树搜索版本开发了单人随机博弈2048的计算机玩家。采用蒙特卡罗软最大搜索算法，并对其进行了一些修改，使其适应随机博弈，设计了6种控制搜索算法的方法。我们以详尽的方式评估了MCSS玩家，并通过改变每次移动的模拟次数，对两个MCSS玩家进行了更长的实验。我们的MCSS玩家在每次移动2000次模拟的限制下获得了533 542的平均分数。该结果优于使用相同评价函数的Expectimax玩家。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)

自引率

0.00%

发文量