Reputation in public goods cooperation under double Q-learning protocol

IF 5.3 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
Kai Xie , Attila Szolnoki
{"title":"Reputation in public goods cooperation under double Q-learning protocol","authors":"Kai Xie ,&nbsp;Attila Szolnoki","doi":"10.1016/j.chaos.2025.116398","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding and resolving cooperation dilemmas are key challenges in evolutionary game theory, which have revealed several mechanisms to address them. This paper investigates the comprehensive influence of multiple reputation-related components on public cooperation. In particular, cooperative investments in public goods game are not fixed but simultaneously depend on the reputation of group organizers and the population’s cooperation willingness, hence indirectly impacting on the players’ income. Additionally, individual payoff can also be directly affected by their reputation via a weighted approach which effectively evaluates the actual income of players. Unlike conventional models, the reputation change of players is non-monotonic, but may transform abruptly due to specific actions. Importantly, a theoretically supported double <span><math><mi>Q</mi></math></span>-learning algorithm is introduced to avoid overestimation bias inherent from the classical <span><math><mi>Q</mi></math></span>-learning algorithm. Our simulations reveal a significantly improved cooperation level, that is explained by a detailed <span><math><mi>Q</mi></math></span>-value analysis. We also observe the lack of massive cooperative clusters in the absence of network reciprocity. At the same time, as an intriguing phenomenon, some actors maintain moderate reputation and are continuously flipping between cooperation and defection. The robustness of our results are validated by mean-field approximation.</div></div>","PeriodicalId":9764,"journal":{"name":"Chaos Solitons & Fractals","volume":"196 ","pages":"Article 116398"},"PeriodicalIF":5.3000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chaos Solitons & Fractals","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0960077925004114","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Understanding and resolving cooperation dilemmas are key challenges in evolutionary game theory, which have revealed several mechanisms to address them. This paper investigates the comprehensive influence of multiple reputation-related components on public cooperation. In particular, cooperative investments in public goods game are not fixed but simultaneously depend on the reputation of group organizers and the population’s cooperation willingness, hence indirectly impacting on the players’ income. Additionally, individual payoff can also be directly affected by their reputation via a weighted approach which effectively evaluates the actual income of players. Unlike conventional models, the reputation change of players is non-monotonic, but may transform abruptly due to specific actions. Importantly, a theoretically supported double Q-learning algorithm is introduced to avoid overestimation bias inherent from the classical Q-learning algorithm. Our simulations reveal a significantly improved cooperation level, that is explained by a detailed Q-value analysis. We also observe the lack of massive cooperative clusters in the absence of network reciprocity. At the same time, as an intriguing phenomenon, some actors maintain moderate reputation and are continuously flipping between cooperation and defection. The robustness of our results are validated by mean-field approximation.
双q学习协议下的公共产品合作声誉
理解和解决合作困境是进化博弈论的关键挑战,进化博弈论揭示了解决合作困境的几种机制。本文研究了多个声誉相关因素对公众合作的综合影响。特别是在公共物品博弈中,合作投资不是固定的,同时取决于群体组织者的声誉和群体的合作意愿,从而间接影响参与者的收入。此外,个人收益也会直接受到声誉的影响,这是一种评估玩家实际收入的加权方法。与传统模型不同,参与者的声誉变化是非单调的,但可能会因特定的行为而突然转变。重要的是,引入了一种理论上支持的双q -学习算法,以避免经典q -学习算法固有的高估偏差。我们的模拟显示合作水平显著提高,这是通过详细的q值分析来解释的。我们还观察到,在缺乏网络互惠的情况下,缺乏大规模的合作集群。与此同时,作为一个有趣的现象,一些演员保持适度的声誉,不断地在合作和背叛之间摇摆。通过平均场近似验证了结果的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chaos Solitons & Fractals
Chaos Solitons & Fractals 物理-数学跨学科应用
CiteScore
13.20
自引率
10.30%
发文量
1087
审稿时长
9 months
期刊介绍: Chaos, Solitons & Fractals strives to establish itself as a premier journal in the interdisciplinary realm of Nonlinear Science, Non-equilibrium, and Complex Phenomena. It welcomes submissions covering a broad spectrum of topics within this field, including dynamics, non-equilibrium processes in physics, chemistry, and geophysics, complex matter and networks, mathematical models, computational biology, applications to quantum and mesoscopic phenomena, fluctuations and random processes, self-organization, and social phenomena.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信