{"title":"Learning Nudges for Conditional Cooperation: A Multi-Agent Reinforcement Learning Model","authors":"Shatayu Kulkarni, Sabine Brunswicker","doi":"arxiv-2409.09509","DOIUrl":null,"url":null,"abstract":"The public goods game describes a social dilemma in which a large proportion\nof agents act as conditional cooperators (CC): they only act cooperatively if\nthey see others acting cooperatively because they satisfice with the social\nnorm to be in line with what others are doing instead of optimizing\ncooperation. CCs are guided by aspiration-based reinforcement learning guided\nby past experiences of interactions with others and satisficing aspirations. In\nmany real-world settings, reinforcing social norms do not emerge. In this\npaper, we propose that an optimizing reinforcement agent can facilitate\ncooperation through nudges, i.e. indirect mechanisms for cooperation to happen.\nThe agent's goal is to motivate CCs into cooperation through its own actions to\ncreate social norms that signal that others are cooperating. We introduce a\nmulti-agent reinforcement learning model for public goods games, with 3 CC\nlearning agents using aspirational reinforcement learning and 1 nudging agent\nusing deep reinforcement learning to learn nudges that optimize cooperation.\nFor our nudging agent, we model two distinct reward functions, one maximizing\nthe total game return (sum DRL) and one maximizing the number of cooperative\ncontributions contributions higher than a proportional threshold (prop DRL).\nOur results show that our aspiration-based RL model for CC agents is consistent\nwith empirically observed CC behavior. Games combining 3 CC RL agents and one\nnudging RL agent outperform the baseline consisting of 4 CC RL agents only. The\nsum DRL nudging agent increases the total sum of contributions by 8.22% and the\ntotal proportion of cooperative contributions by 12.42%, while the prop nudging\nDRL increases the total sum of contributions by 8.85% and the total proportion\nof cooperative contributions by 14.87%. Our findings advance the literature on\npublic goods games and reinforcement learning.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"208 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The public goods game describes a social dilemma in which a large proportion
of agents act as conditional cooperators (CC): they only act cooperatively if
they see others acting cooperatively because they satisfice with the social
norm to be in line with what others are doing instead of optimizing
cooperation. CCs are guided by aspiration-based reinforcement learning guided
by past experiences of interactions with others and satisficing aspirations. In
many real-world settings, reinforcing social norms do not emerge. In this
paper, we propose that an optimizing reinforcement agent can facilitate
cooperation through nudges, i.e. indirect mechanisms for cooperation to happen.
The agent's goal is to motivate CCs into cooperation through its own actions to
create social norms that signal that others are cooperating. We introduce a
multi-agent reinforcement learning model for public goods games, with 3 CC
learning agents using aspirational reinforcement learning and 1 nudging agent
using deep reinforcement learning to learn nudges that optimize cooperation.
For our nudging agent, we model two distinct reward functions, one maximizing
the total game return (sum DRL) and one maximizing the number of cooperative
contributions contributions higher than a proportional threshold (prop DRL).
Our results show that our aspiration-based RL model for CC agents is consistent
with empirically observed CC behavior. Games combining 3 CC RL agents and one
nudging RL agent outperform the baseline consisting of 4 CC RL agents only. The
sum DRL nudging agent increases the total sum of contributions by 8.22% and the
total proportion of cooperative contributions by 12.42%, while the prop nudging
DRL increases the total sum of contributions by 8.85% and the total proportion
of cooperative contributions by 14.87%. Our findings advance the literature on
public goods games and reinforcement learning.