Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi
{"title":"广义回归神经网络在进化马尔可夫博弈中策略选择的有效性","authors":"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi","doi":"10.12962/j20882033.v32i1.7074","DOIUrl":null,"url":null,"abstract":"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.","PeriodicalId":14549,"journal":{"name":"IPTEK: The Journal for Technology and Science","volume":"48 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games\",\"authors\":\"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi\",\"doi\":\"10.12962/j20882033.v32i1.7074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.\",\"PeriodicalId\":14549,\"journal\":{\"name\":\"IPTEK: The Journal for Technology and Science\",\"volume\":\"48 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPTEK: The Journal for Technology and Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12962/j20882033.v32i1.7074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPTEK: The Journal for Technology and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12962/j20882033.v32i1.7074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
目前,研究玩家学习模式的进化博弈论受到了越来越多的关注。这些游戏可以模拟真实情况和动态过程中的处理时间。本文创建了进化马尔可夫博弈,将玩家的策略选择映射到具有收益的马尔可夫决策过程(mdp)。转移概率采用玻尔兹曼分布,通用回归神经网络(GRNN)模拟进化马尔可夫博弈中的策略选择。囚徒困境是利用人类策略选择线与GRNN策略选择线在48次迭代后重叠的方法和输出结果,并选择相同策略的问题。此外,采用以牙还牙(Tit for Tat, TFT)策略训练的GRNN的错误率低于同类方法,并取得了较好的训练效果。
The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games
Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.