基于最佳响应更新规则的有限异质种群策略交互分析与控制

2015 54th IEEE Conference on Decision and Control (CDC) Pub Date : 2015-12-15 DOI:10.1109/CDC.2015.7402928

Pouria Ramazi, M. Cao

{"title":"基于最佳响应更新规则的有限异质种群策略交互分析与控制","authors":"Pouria Ramazi, M. Cao","doi":"10.1109/CDC.2015.7402928","DOIUrl":null,"url":null,"abstract":"For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.","PeriodicalId":308101,"journal":{"name":"2015 54th IEEE Conference on Decision and Control (CDC)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Analysis and control of strategic interactions in finite heterogeneous populations under best-response update rule\",\"authors\":\"Pouria Ramazi, M. Cao\",\"doi\":\"10.1109/CDC.2015.7402928\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.\",\"PeriodicalId\":308101,\"journal\":{\"name\":\"2015 54th IEEE Conference on Decision and Control (CDC)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 54th IEEE Conference on Decision and Control (CDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDC.2015.7402928\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 54th IEEE Conference on Decision and Control (CDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.2015.7402928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

对于在进化博弈中选择合作或背叛的有限、混合良好的异质智能体群体，我们研究了当智能体在每轮博弈中使用近视最佳响应规则更新策略时，合作智能体的数量如何随时间变化，并演示了如何通过改变智能体的收益矩阵来控制这一数量。代理人是异质的，因为他们的报酬矩阵可能彼此不同;我们关注的是在进化过程中固定的收益矩阵对应于囚徒困境或雪堆博弈的特定情况。为了进行稳定性分析，我们将合作智能体的数量作为感兴趣的随机变量来识别系统的吸收状态。证明了当所有智能体更新足够频繁时，可达的最终状态完全由可用的收益矩阵类型决定。作为进一步的步骤，我们展示了如何通过在进化开始时改变一组代理的收益矩阵的类型来控制最终状态。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis and control of strategic interactions in finite heterogeneous populations under best-response update rule

For a finite, well-mixed population of heterogeneous agents playing evolutionary games choosing to cooperate or defect in each round of the game, we investigate, when agents update their strategies in each round using the myopic best-response rule, how the number of cooperating agents changes over time and demonstrate how to control that number by changing the agents' payoff matrices. The agents are heterogeneous in that their payoff matrices may differ from one another; we focus on the specific case when the payoff matrices, fixed throughout the evolution, correspond to prisoner's dilemma or snowdrift games. To carry out stability analysis, we identify the system's absorbing states when taking the number of cooperating agents as a random variable of interest. It is proven that when all the agents update frequently enough, the reachable final states are completely determined by the available types of payoff matrices. As a further step, we show how to control the final state by changing at the beginning of the evolution, the types of the payoff matrices of a group of agents.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 54th IEEE Conference on Decision and Control (CDC)

自引率

0.00%

发文量