Monte-Carlo Tree Search and minimax hybrids

2013 IEEE Conference on Computational Inteligence in Games (CIG) Pub Date : 2013-10-17 DOI:10.1109/CIG.2013.6633630

Hendrik Baier, M. Winands

引用次数: 29

Abstract

Monte-Carlo Tree Search is a sampling-based search algorithm that has been successfully applied to a variety of games. Monte-Carlo rollouts allow it to take distant consequences of moves into account, giving it a strategic advantage in many domains over traditional depth-limited minimax search with alpha-beta pruning. However, MCTS builds a highly selective tree and can therefore miss crucial moves and fall into traps in tactical situations. Full-width minimax search does not suffer from this weakness. This paper proposes MCTS-minimax hybrids that employ shallow minimax searches within the MCTS framework. The three proposed approaches use minimax in the selection/expansion phase, the rollout phase, and the backpropagation phase of MCTS. Without requiring domain knowledge in the form of evaluation functions, these hybrid algorithms are a first step at combining the strategic strength of MCTS and the tactical strength of minimax. We investigate their effectiveness in the test domains of Connect-4 and Breakthrough.

查看原文本刊更多论文

蒙特卡罗树搜索和极大极小混合

蒙特卡洛树搜索是一种基于采样的搜索算法，已成功应用于各种游戏。蒙特卡罗的推出允许它考虑到移动的遥远后果，使它在许多领域比传统的深度限制的极大极小搜索具有战略性优势。然而，MCTS建立了一个高度选择性的树，因此可能会错过关键的动作，并在战术情况下陷入陷阱。全宽度极大极小搜索没有这个缺点。本文提出了在MCTS框架内采用浅极大极小搜索的MCTS-minimax混合算法。提出的三种方法在MCTS的选择/扩展阶段、推出阶段和反向传播阶段使用极小最大值。不需要评估函数形式的领域知识，这些混合算法是将MCTS的战略强度和极大极小的战术强度结合起来的第一步。我们研究了它们在Connect-4和Breakthrough测试领域的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Conference on Computational Inteligence in Games (CIG)

自引率

0.00%

发文量