The Efficacy of Choosing Strategy with General Regression Neural Network on Evolutionary Markov Games

IPTEK: The Journal for Technology and Science Pub Date : 2021-08-30 DOI:10.12962/j20882033.v32i1.7074

Shirin Kordnoori, H. Mostafaei, M. Ostadrahimi, S. Banihashemi

引用次数: 0

Abstract

Nowadays, Evolutionary Game Theory which studies the learning model of players, has attracted more attention than before. These Games can simulate the real situation and dynamic during processing time. This paper creates the Evolutionary Markov Games, which maps players’ strategy-choosing to a Markov Decision Processes (MDPs) with payoﬀs. Boltzmann distribution is used for transition probability and the General Regression Neural Network (GRNN) simulating the strategy-choosing in Evolutionary Markov Games. Prisoner’s dilemma is a problem that uses the method and output results showing the overlapping the human strategy-choosing line and GRNN strategy-choosing line after 48 iterations, and they choose the same strategies. Also, the error rate of the GRNN training by Tit for Tat (TFT) strategy is lower than similar work and shows a better result.

查看原文本刊更多论文

广义回归神经网络在进化马尔可夫博弈中策略选择的有效性

目前，研究玩家学习模式的进化博弈论受到了越来越多的关注。这些游戏可以模拟真实情况和动态过程中的处理时间。本文创建了进化马尔可夫博弈，将玩家的策略选择映射到具有收益的马尔可夫决策过程(mdp)。转移概率采用玻尔兹曼分布，通用回归神经网络(GRNN)模拟进化马尔可夫博弈中的策略选择。囚徒困境是利用人类策略选择线与GRNN策略选择线在48次迭代后重叠的方法和输出结果，并选择相同策略的问题。此外，采用以牙还牙(Tit for Tat, TFT)策略训练的GRNN的错误率低于同类方法，并取得了较好的训练效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IPTEK: The Journal for Technology and Science

自引率

0.00%

发文量

审稿时长

9 weeks