大种群随机对策的渐近分析

Ramesh Johari, S. Adlakha, G. Weintraub
{"title":"大种群随机对策的渐近分析","authors":"Ramesh Johari, S. Adlakha, G. Weintraub","doi":"10.1145/1807406.1807474","DOIUrl":null,"url":null,"abstract":"We study stochastic games with a large number of players, where players are coupled via their payoff functions. A standard solution concept for such games is Markov perfect equilibrium (MPE). It is well known that the computation of MPE suffers from the \"curse of dimensionality.\" To deal with this complexity, several researchers have introduced a notion of mean field equilibrium that we call oblivious equilibrium (OE). In OE, each player reacts to only the average behavior of other players. In this paper, we develop a unified framework to study OE in large population stochastic games. In particular, we prove that under a set of simple assumptions on the model, an OE always exists. Furthermore, as a simple consequence of this existence theorem, we prove that OE approximates MPE well: we show that from the viewpoint of a single agent, a near optimal decision making policy is one that reacts only to the average behavior of its environment. We also study two different classes for games, competition and coordination games. For these classes of games, we isolate key assumptions on the model primitives under which OE exists and approximates MPE asymptotically.","PeriodicalId":142982,"journal":{"name":"Behavioral and Quantitative Game Theory","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2010-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Asymptotic analysis of large population stochastic games\",\"authors\":\"Ramesh Johari, S. Adlakha, G. Weintraub\",\"doi\":\"10.1145/1807406.1807474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study stochastic games with a large number of players, where players are coupled via their payoff functions. A standard solution concept for such games is Markov perfect equilibrium (MPE). It is well known that the computation of MPE suffers from the \\\"curse of dimensionality.\\\" To deal with this complexity, several researchers have introduced a notion of mean field equilibrium that we call oblivious equilibrium (OE). In OE, each player reacts to only the average behavior of other players. In this paper, we develop a unified framework to study OE in large population stochastic games. In particular, we prove that under a set of simple assumptions on the model, an OE always exists. Furthermore, as a simple consequence of this existence theorem, we prove that OE approximates MPE well: we show that from the viewpoint of a single agent, a near optimal decision making policy is one that reacts only to the average behavior of its environment. We also study two different classes for games, competition and coordination games. For these classes of games, we isolate key assumptions on the model primitives under which OE exists and approximates MPE asymptotically.\",\"PeriodicalId\":142982,\"journal\":{\"name\":\"Behavioral and Quantitative Game Theory\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behavioral and Quantitative Game Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1807406.1807474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavioral and Quantitative Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1807406.1807474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们研究具有大量参与者的随机博弈,其中参与者通过其收益函数相关联。这类博弈的标准解概念是马尔可夫完美均衡(MPE)。众所周知,MPE的计算受到“维数诅咒”的困扰。为了处理这种复杂性,一些研究人员引入了平均场平衡的概念,我们称之为遗忘平衡(OE)。在《OE》中,每个玩家只会对其他玩家的平均行为做出反应。在本文中,我们建立了一个统一的框架来研究大群体随机对策中的OE。特别地,我们证明了在模型的一组简单假设下,OE总是存在的。此外,作为这个存在性定理的一个简单结果,我们证明了OE很好地近似于MPE:我们表明,从单个智能体的角度来看,一个接近最优的决策策略是一个只对其环境的平均行为作出反应的策略。我们还学习了两种不同的游戏,竞赛和协调游戏。对于这类博弈,我们分离了OE存在的模型原语的关键假设,并渐近地逼近了MPE。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Asymptotic analysis of large population stochastic games
We study stochastic games with a large number of players, where players are coupled via their payoff functions. A standard solution concept for such games is Markov perfect equilibrium (MPE). It is well known that the computation of MPE suffers from the "curse of dimensionality." To deal with this complexity, several researchers have introduced a notion of mean field equilibrium that we call oblivious equilibrium (OE). In OE, each player reacts to only the average behavior of other players. In this paper, we develop a unified framework to study OE in large population stochastic games. In particular, we prove that under a set of simple assumptions on the model, an OE always exists. Furthermore, as a simple consequence of this existence theorem, we prove that OE approximates MPE well: we show that from the viewpoint of a single agent, a near optimal decision making policy is one that reacts only to the average behavior of its environment. We also study two different classes for games, competition and coordination games. For these classes of games, we isolate key assumptions on the model primitives under which OE exists and approximates MPE asymptotically.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信