基于最佳响应更新规则的有限异质种群策略交互分析与控制

Pouria Ramazi, M. Cao
{"title":"基于最佳响应更新规则的有限异质种群策略交互分析与控制","authors":"Pouria Ramazi, M. Cao","doi":"10.1109/CDC.2015.7402928","DOIUrl":null,"url":null,"abstract":"For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.","PeriodicalId":308101,"journal":{"name":"2015 54th IEEE Conference on Decision and Control (CDC)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Analysis and control of strategic interactions in finite heterogeneous populations under best-response update rule\",\"authors\":\"Pouria Ramazi, M. Cao\",\"doi\":\"10.1109/CDC.2015.7402928\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.\",\"PeriodicalId\":308101,\"journal\":{\"name\":\"2015 54th IEEE Conference on Decision and Control (CDC)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 54th IEEE Conference on Decision and Control (CDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDC.2015.7402928\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 54th IEEE Conference on Decision and Control (CDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.2015.7402928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

摘要

对于在进化博弈中选择合作或背叛的有限、混合良好的异质智能体群体,我们研究了当智能体在每轮博弈中使用近视最佳响应规则更新策略时,合作智能体的数量如何随时间变化,并演示了如何通过改变智能体的收益矩阵来控制这一数量。代理人是异质的,因为他们的报酬矩阵可能彼此不同;我们关注的是在进化过程中固定的收益矩阵对应于囚徒困境或雪堆博弈的特定情况。为了进行稳定性分析,我们将合作智能体的数量作为感兴趣的随机变量来识别系统的吸收状态。证明了当所有智能体更新足够频繁时,可达的最终状态完全由可用的收益矩阵类型决定。作为进一步的步骤,我们展示了如何通过在进化开始时改变一组代理的收益矩阵的类型来控制最终状态。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis and control of strategic interactions in finite heterogeneous populations under best-response update rule
For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信