在分散的多智能体环境中,从公共制裁中获得社会规范的学习型智能体

Eugene Vinitsky, R. Koster, J. Agapiou, Edgar A. Duéñez-Guzmán, A. Vezhnevets, Joel Z. Leibo
{"title":"在分散的多智能体环境中,从公共制裁中获得社会规范的学习型智能体","authors":"Eugene Vinitsky, R. Koster, J. Agapiou, Edgar A. Duéñez-Guzmán, A. Vezhnevets, Joel Z. Leibo","doi":"10.1177/26339137231162025","DOIUrl":null,"url":null,"abstract":"Society is characterized by the presence of a variety of social norms: collective patterns of sanctioning that can prevent miscoordination and free-riding. Inspired by this, we aim to construct learning dynamics where potentially beneficial social norms can emerge. Since social norms are underpinned by sanctioning, we introduce a training regime where agents can access all sanctioning events but learning is otherwise decentralized. This setting is technologically interesting because sanctioning events may be the only available public signal in decentralized multi-agent systems where reward or policy-sharing is infeasible or undesirable. To achieve collective action in this setting, we construct an agent architecture containing a classifier module that categorizes observed behaviors as approved or disapproved, and a motivation to punish in accord with the group. We show that social norms emerge in multi-agent systems containing this agent and investigate the conditions under which this helps them achieve socially beneficial outcomes.","PeriodicalId":93948,"journal":{"name":"Collective intelligence","volume":"76 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings\",\"authors\":\"Eugene Vinitsky, R. Koster, J. Agapiou, Edgar A. Duéñez-Guzmán, A. Vezhnevets, Joel Z. Leibo\",\"doi\":\"10.1177/26339137231162025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Society is characterized by the presence of a variety of social norms: collective patterns of sanctioning that can prevent miscoordination and free-riding. Inspired by this, we aim to construct learning dynamics where potentially beneficial social norms can emerge. Since social norms are underpinned by sanctioning, we introduce a training regime where agents can access all sanctioning events but learning is otherwise decentralized. This setting is technologically interesting because sanctioning events may be the only available public signal in decentralized multi-agent systems where reward or policy-sharing is infeasible or undesirable. To achieve collective action in this setting, we construct an agent architecture containing a classifier module that categorizes observed behaviors as approved or disapproved, and a motivation to punish in accord with the group. We show that social norms emerge in multi-agent systems containing this agent and investigate the conditions under which this helps them achieve socially beneficial outcomes.\",\"PeriodicalId\":93948,\"journal\":{\"name\":\"Collective intelligence\",\"volume\":\"76 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Collective intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/26339137231162025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Collective intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/26339137231162025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

摘要

社会的特点是存在各种各样的社会规范:集体制裁模式,可以防止不协调和搭便车。受此启发,我们的目标是构建学习动态,其中可能出现有益的社会规范。由于社会规范以制裁为基础,我们引入了一种培训制度,在这种制度下,代理人可以访问所有制裁事件,但学习是分散的。这种设置在技术上是有趣的,因为制裁事件可能是分散的多代理系统中唯一可用的公共信号,奖励或策略共享是不可行的或不受欢迎的。为了在这种情况下实现集体行动,我们构建了一个代理体系结构,该体系结构包含一个分类器模块,该模块将观察到的行为分类为批准或不批准,以及一个与群体一致的惩罚动机。我们表明,社会规范出现在包含这个主体的多主体系统中,并研究了在什么条件下这有助于他们实现对社会有益的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Society is characterized by the presence of a variety of social norms: collective patterns of sanctioning that can prevent miscoordination and free-riding. Inspired by this, we aim to construct learning dynamics where potentially beneficial social norms can emerge. Since social norms are underpinned by sanctioning, we introduce a training regime where agents can access all sanctioning events but learning is otherwise decentralized. This setting is technologically interesting because sanctioning events may be the only available public signal in decentralized multi-agent systems where reward or policy-sharing is infeasible or undesirable. To achieve collective action in this setting, we construct an agent architecture containing a classifier module that categorizes observed behaviors as approved or disapproved, and a motivation to punish in accord with the group. We show that social norms emerge in multi-agent systems containing this agent and investigate the conditions under which this helps them achieve socially beneficial outcomes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信