On Learning Stable Cooperation in the Iterated Prisoner's Dilemma with Paid Incentives

Xiyue Sun, Fabian R. Pieroth, Kyrill Schmid, M. Wirsing, Lenz Belzner
{"title":"On Learning Stable Cooperation in the Iterated Prisoner's Dilemma with Paid Incentives","authors":"Xiyue Sun, Fabian R. Pieroth, Kyrill Schmid, M. Wirsing, Lenz Belzner","doi":"10.1109/ICDCSW56584.2022.00031","DOIUrl":null,"url":null,"abstract":"An essential step towards collective intelligence in systems comprised of multiple independent and autonomous agents is that individual decision-makers are capable of acting cooperatively. Cooperation is especially challenging in environ-ments where collective and individual rationality diverge, like in the Prisoner's Dilemma (PD), which is often used to test whether algorithms are capable of circumventing the single non-optimal Nash equilibrium. In this paper, we extend the approach “Learning to Incentivize other Learning Agents” in two ways: 1. We analyze the impact of the payoff matrices on incentive updates, as different payoff matrices could accelerate or decelerate the growth of incentives. 2. We adapt the concept of the market from “Action Markets in Deep Multi-Agent Reinforcement Learning” to iterated PD games as to trade incentives, i.e., the final revenue of the agent is the game revenue minus the incentive it provided, and propose (sufficient) conditions for reaching stable two-way cooperation under specific assumptions.","PeriodicalId":357138,"journal":{"name":"2022 IEEE 42nd International Conference on Distributed Computing Systems Workshops (ICDCSW)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 42nd International Conference on Distributed Computing Systems Workshops (ICDCSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCSW56584.2022.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

An essential step towards collective intelligence in systems comprised of multiple independent and autonomous agents is that individual decision-makers are capable of acting cooperatively. Cooperation is especially challenging in environ-ments where collective and individual rationality diverge, like in the Prisoner's Dilemma (PD), which is often used to test whether algorithms are capable of circumventing the single non-optimal Nash equilibrium. In this paper, we extend the approach “Learning to Incentivize other Learning Agents” in two ways: 1. We analyze the impact of the payoff matrices on incentive updates, as different payoff matrices could accelerate or decelerate the growth of incentives. 2. We adapt the concept of the market from “Action Markets in Deep Multi-Agent Reinforcement Learning” to iterated PD games as to trade incentives, i.e., the final revenue of the agent is the game revenue minus the incentive it provided, and propose (sufficient) conditions for reaching stable two-way cooperation under specific assumptions.
有偿激励下迭代囚徒困境下的学习稳定合作
在由多个独立和自主的主体组成的系统中,迈向集体智慧的重要一步是,单个决策者能够合作行动。在集体和个人理性分歧的环境中,合作尤其具有挑战性,比如囚徒困境(PD),它通常用于测试算法是否能够绕过单一非最优纳什均衡。在本文中,我们从两个方面扩展了“学习激励其他学习主体”的方法:1。我们分析了报酬矩阵对激励更新的影响,因为不同的报酬矩阵可以加速或减缓激励的增长。2. 我们将“Deep Multi-Agent Reinforcement Learning中的Action Markets”中的市场概念应用到迭代PD博弈的交易激励中,即agent的最终收益为博弈收益减去其提供的激励,并提出了在特定假设下实现稳定双向合作的(充分)条件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信