建立了一个基于蒙特卡罗仿真和对手模型的计算机麻将播放器

2015 IEEE Conference on Computational Intelligence and Games (CIG) Pub Date : 2015-11-05 DOI:10.1109/CIG.2015.7317929

Naoki Mizukami, Yoshimasa Tsuruoka

{"title":"建立了一个基于蒙特卡罗仿真和对手模型的计算机麻将播放器","authors":"Naoki Mizukami, Yoshimasa Tsuruoka","doi":"10.1109/CIG.2015.7317929","DOIUrl":null,"url":null,"abstract":"Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.","PeriodicalId":244862,"journal":{"name":"2015 IEEE Conference on Computational Intelligence and Games (CIG)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":"{\"title\":\"Building a computer Mahjong player based on Monte Carlo simulation and opponent models\",\"authors\":\"Naoki Mizukami, Yoshimasa Tsuruoka\",\"doi\":\"10.1109/CIG.2015.7317929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.\",\"PeriodicalId\":244862,\"journal\":{\"name\":\"2015 IEEE Conference on Computational Intelligence and Games (CIG)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"31\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE Conference on Computational Intelligence and Games (CIG)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIG.2015.7317929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Conference on Computational Intelligence and Games (CIG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIG.2015.7317929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 31

摘要

在不完全信息博弈中，预测对手的移动和隐藏状态非常重要。本文描述了一种构建麻将程序的方法，该程序对对手玩家进行建模，并利用这些模型进行蒙特卡罗模拟。我们将对手的玩法分解为三个元素，即等待，获胜砖块和获胜分数，并使用专家玩家的游戏记录来训练这些元素的预测模型。在蒙特卡罗模拟中，对手的走法是根据对手模型的概率分布来确定的。我们在一个很受欢迎的麻将网站“天后”上评估了最终程序的游戏强度。该程序的评分达到了1718分，明显高于人类玩家的平均水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Building a computer Mahjong player based on Monte Carlo simulation and opponent models

Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 IEEE Conference on Computational Intelligence and Games (CIG)

自引率

0.00%

发文量