广义回归神经网络在进化马尔可夫博弈中策略选择的有效性

Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi
{"title":"广义回归神经网络在进化马尔可夫博弈中策略选择的有效性","authors":"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi","doi":"10.12962/j20882033.v32i1.7074","DOIUrl":null,"url":null,"abstract":"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.","PeriodicalId":14549,"journal":{"name":"IPTEK: The Journal for Technology and Science","volume":"48 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games\",\"authors\":\"Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi\",\"doi\":\"10.12962/j20882033.v32i1.7074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.\",\"PeriodicalId\":14549,\"journal\":{\"name\":\"IPTEK: The Journal for Technology and Science\",\"volume\":\"48 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPTEK: The Journal for Technology and Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12962/j20882033.v32i1.7074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPTEK: The Journal for Technology and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12962/j20882033.v32i1.7074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目前,研究玩家学习模式的进化博弈论受到了越来越多的关注。这些游戏可以模拟真实情况和动态过程中的处理时间。本文创建了进化马尔可夫博弈,将玩家的策略选择映射到具有收益的马尔可夫决策过程(mdp)。转移概率采用玻尔兹曼分布,通用回归神经网络(GRNN)模拟进化马尔可夫博弈中的策略选择。囚徒困境是利用人类策略选择线与GRNN策略选择线在48次迭代后重叠的方法和输出结果,并选择相同策略的问题。此外,采用以牙还牙(Tit for Tat, TFT)策略训练的GRNN的错误率低于同类方法,并取得了较好的训练效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games
Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoffs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
17
审稿时长
9 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信