为取得更好的社会成果而进行补贴设计

Maria-Florina Balcan, Matteo Pozzi, Dravyansh Sharma
{"title":"为取得更好的社会成果而进行补贴设计","authors":"Maria-Florina Balcan, Matteo Pozzi, Dravyansh Sharma","doi":"arxiv-2409.03129","DOIUrl":null,"url":null,"abstract":"Overcoming the impact of selfish behavior of rational players in multiagent\nsystems is a fundamental problem in game theory. Without any intervention from\na central agent, strategic users take actions in order to maximize their\npersonal utility, which can lead to extremely inefficient overall system\nperformance, often indicated by a high Price of Anarchy. Recent work (Lin et\nal. 2021) investigated and formalized yet another undesirable behavior of\nrational agents, that of avoiding freely available information about the game\nfor selfish reasons, leading to worse social outcomes. A central planner can\nsignificantly mitigate these issues by injecting a subsidy to reduce certain\ncosts associated with the system and obtain net gains in the system\nperformance. Crucially, the planner needs to determine how to allocate this\nsubsidy effectively. We formally show that designing subsidies that perfectly optimize the social\ngood, in terms of minimizing the Price of Anarchy or preventing the information\navoidance behavior, is computationally hard under standard complexity theoretic\nassumptions. On the positive side, we show that we can learn provably good\nvalues of subsidy in repeated games coming from the same domain. This\ndata-driven subsidy design approach avoids solving computationally hard\nproblems for unseen games by learning over polynomially many games. We also\nshow that optimal subsidy can be learned with no-regret given an online\nsequence of games, under mild assumptions on the cost matrix. Our study focuses\non two distinct games: a Bayesian extension of the well-studied fair\ncost-sharing game, and a component maintenance game with engineering\napplications.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"98 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Subsidy design for better social outcomes\",\"authors\":\"Maria-Florina Balcan, Matteo Pozzi, Dravyansh Sharma\",\"doi\":\"arxiv-2409.03129\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Overcoming the impact of selfish behavior of rational players in multiagent\\nsystems is a fundamental problem in game theory. Without any intervention from\\na central agent, strategic users take actions in order to maximize their\\npersonal utility, which can lead to extremely inefficient overall system\\nperformance, often indicated by a high Price of Anarchy. Recent work (Lin et\\nal. 2021) investigated and formalized yet another undesirable behavior of\\nrational agents, that of avoiding freely available information about the game\\nfor selfish reasons, leading to worse social outcomes. A central planner can\\nsignificantly mitigate these issues by injecting a subsidy to reduce certain\\ncosts associated with the system and obtain net gains in the system\\nperformance. Crucially, the planner needs to determine how to allocate this\\nsubsidy effectively. We formally show that designing subsidies that perfectly optimize the social\\ngood, in terms of minimizing the Price of Anarchy or preventing the information\\navoidance behavior, is computationally hard under standard complexity theoretic\\nassumptions. On the positive side, we show that we can learn provably good\\nvalues of subsidy in repeated games coming from the same domain. This\\ndata-driven subsidy design approach avoids solving computationally hard\\nproblems for unseen games by learning over polynomially many games. We also\\nshow that optimal subsidy can be learned with no-regret given an online\\nsequence of games, under mild assumptions on the cost matrix. Our study focuses\\non two distinct games: a Bayesian extension of the well-studied fair\\ncost-sharing game, and a component maintenance game with engineering\\napplications.\",\"PeriodicalId\":501316,\"journal\":{\"name\":\"arXiv - CS - Computer Science and Game Theory\",\"volume\":\"98 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Science and Game Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.03129\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Science and Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

克服多代理系统中理性玩家自私行为的影响是博弈论中的一个基本问题。在没有中心代理干预的情况下,策略用户会采取行动以最大化其个人效用,这可能导致系统整体性能效率极低,通常表现为高无政府价格。最近的研究(Lin 等人,2021 年)对理性代理的另一种不良行为进行了研究,并将其形式化,即出于自私的原因而回避有关游戏的免费信息,从而导致更糟糕的社会结果。中央规划者可以通过注入补贴来降低与系统相关的某些成本,并获得系统性能的净收益,从而显著缓解这些问题。最关键的是,规划者需要确定如何有效地分配补贴。我们正式证明,在标准复杂性理论假设下,要设计出能完美优化社会福利的补贴,使无政府价格最小化或防止信息规避行为,在计算上是很困难的。从积极的一面来看,我们证明了在来自同一领域的重复博弈中,我们可以学习到可证明的良好补贴值。这种数据驱动的补贴设计方法通过对多项式多场博弈的学习,避免了解决未见博弈的计算困难问题。我们还证明,在成本矩阵的温和假设下,最优补贴可以在一连串博弈中无遗憾地学习到。我们的研究主要集中在两个不同的博弈上:一个是贝叶斯扩展的公平成本分摊博弈,另一个是具有工程应用价值的组件维护博弈。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Subsidy design for better social outcomes
Overcoming the impact of selfish behavior of rational players in multiagent systems is a fundamental problem in game theory. Without any intervention from a central agent, strategic users take actions in order to maximize their personal utility, which can lead to extremely inefficient overall system performance, often indicated by a high Price of Anarchy. Recent work (Lin et al. 2021) investigated and formalized yet another undesirable behavior of rational agents, that of avoiding freely available information about the game for selfish reasons, leading to worse social outcomes. A central planner can significantly mitigate these issues by injecting a subsidy to reduce certain costs associated with the system and obtain net gains in the system performance. Crucially, the planner needs to determine how to allocate this subsidy effectively. We formally show that designing subsidies that perfectly optimize the social good, in terms of minimizing the Price of Anarchy or preventing the information avoidance behavior, is computationally hard under standard complexity theoretic assumptions. On the positive side, we show that we can learn provably good values of subsidy in repeated games coming from the same domain. This data-driven subsidy design approach avoids solving computationally hard problems for unseen games by learning over polynomially many games. We also show that optimal subsidy can be learned with no-regret given an online sequence of games, under mild assumptions on the cost matrix. Our study focuses on two distinct games: a Bayesian extension of the well-studied fair cost-sharing game, and a component maintenance game with engineering applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信