{"title":"Enhancement of CNN-based 2048 Player with Monte-Carlo Tree Search","authors":"Shota Watanabe, Kiminori Matsuzaki","doi":"10.1109/TAAI57707.2022.00018","DOIUrl":null,"url":null,"abstract":"In this study, we developed computer players for a single-player stochastic game 2048 using an existing neural-network evaluation function and a version of Monte-Carlo tree search. We applied the Monte-Carlo softmax search (MCSS) algorithm, with some modifications in order to adapt it to the stochastic game, and designed six methods of controlling the search algorithm. We evaluated the MCSS players in an exhaustive manner and also conducted longer experiments for two MCSS players by changing the number of simulations per move. Our MCSS player achieved an average score of 533 542 under the limit of 2000 simulations per move. This result was better than Expectimax players that used the same evaluation function.","PeriodicalId":111620,"journal":{"name":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"196 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAAI57707.2022.00018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, we developed computer players for a single-player stochastic game 2048 using an existing neural-network evaluation function and a version of Monte-Carlo tree search. We applied the Monte-Carlo softmax search (MCSS) algorithm, with some modifications in order to adapt it to the stochastic game, and designed six methods of controlling the search algorithm. We evaluated the MCSS players in an exhaustive manner and also conducted longer experiments for two MCSS players by changing the number of simulations per move. Our MCSS player achieved an average score of 533 542 under the limit of 2000 simulations per move. This result was better than Expectimax players that used the same evaluation function.