建立了一个基于蒙特卡罗仿真和对手模型的计算机麻将播放器

Naoki Mizukami, Yoshimasa Tsuruoka
{"title":"建立了一个基于蒙特卡罗仿真和对手模型的计算机麻将播放器","authors":"Naoki Mizukami, Yoshimasa Tsuruoka","doi":"10.1109/CIG.2015.7317929","DOIUrl":null,"url":null,"abstract":"Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.","PeriodicalId":244862,"journal":{"name":"2015 IEEE Conference on Computational Intelligence and Games (CIG)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":"{\"title\":\"Building a computer Mahjong player based on Monte Carlo simulation and opponent models\",\"authors\":\"Naoki Mizukami, Yoshimasa Tsuruoka\",\"doi\":\"10.1109/CIG.2015.7317929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.\",\"PeriodicalId\":244862,\"journal\":{\"name\":\"2015 IEEE Conference on Computational Intelligence and Games (CIG)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"31\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE Conference on Computational Intelligence and Games (CIG)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIG.2015.7317929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Conference on Computational Intelligence and Games (CIG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIG.2015.7317929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31

摘要

在不完全信息博弈中,预测对手的移动和隐藏状态非常重要。本文描述了一种构建麻将程序的方法,该程序对对手玩家进行建模,并利用这些模型进行蒙特卡罗模拟。我们将对手的玩法分解为三个元素,即等待,获胜砖块和获胜分数,并使用专家玩家的游戏记录来训练这些元素的预测模型。在蒙特卡罗模拟中,对手的走法是根据对手模型的概率分布来确定的。我们在一个很受欢迎的麻将网站“天后”上评估了最终程序的游戏强度。该程序的评分达到了1718分,明显高于人类玩家的平均水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Building a computer Mahjong player based on Monte Carlo simulation and opponent models
Predicting opponents' moves and hidden states is important in imperfect information games. This paper describes a method for building a Mahjong program that models opponent players and performs Monte Carlo simulation with the models. We decompose an opponent's play into three elements, namely, waiting, winning tiles, and winning scores, and train prediction models for those elements using game records of expert human players. Opponents' moves in the Monte Carlo simulations are determined based on the probability distributions of the opponent models. We have evaluated the playing strength of the resulting program on a popular online Mahjong site “Tenhou”. The program has achieved a rating of 1718, which is significantly higher than that of the average human player.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信