Efficiency and fairness trade-offs in two player bargaining games

IF 1.8 1区哲学 Q1 HISTORY & PHILOSOPHY OF SCIENCE

European Journal for Philosophy of Science Pub Date : 2023-10-24 DOI:10.1007/s13194-023-00553-6

David Freeborn

{"title":"Efficiency and fairness trade-offs in two player bargaining games","authors":"David Freeborn","doi":"10.1007/s13194-023-00553-6","DOIUrl":null,"url":null,"abstract":"<p>Recent work on the evolution of social contracts and conventions has often used models of bargaining games, with reinforcement learning. A recent innovation is the requirement that every strategy must be invented either through through learning or reinforcement. However, agents frequently get stuck in highly-reinforced “traps” that prevent them from arriving at outcomes that are efficient or fair to the both players. Agents face a trade-off between exploration and exploitation, i.e. between continuing to invent new strategies and reinforcing strategies that have already become highly reinforced by yielding high rewards. In this paper I systematically study the relationship between rates of invention and the efficiency and fairness of outcomes in two-player, repeated bargaining games. I use a basic reinforcement learning model with invention, and five variations of this model, designed introduce various forms of forgetting, to prioritize more recent reinforcement, or to maintain a higher rate of invention. I use computer simulations to investigate the outcomes of each model. Each models shows qualitative similarities in the relationship between the efficiency and fairness of outcomes, and the relative amount of exploration or exploitation that takes place. Surprisingly, there are often trade-offs between the efficiency and the fairness of the outcomes.</p>","PeriodicalId":48832,"journal":{"name":"European Journal for Philosophy of Science","volume":"5 12","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2023-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal for Philosophy of Science","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1007/s13194-023-00553-6","RegionNum":1,"RegionCategory":"哲学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HISTORY & PHILOSOPHY OF SCIENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Recent work on the evolution of social contracts and conventions has often used models of bargaining games, with reinforcement learning. A recent innovation is the requirement that every strategy must be invented either through through learning or reinforcement. However, agents frequently get stuck in highly-reinforced “traps” that prevent them from arriving at outcomes that are efficient or fair to the both players. Agents face a trade-off between exploration and exploitation, i.e. between continuing to invent new strategies and reinforcing strategies that have already become highly reinforced by yielding high rewards. In this paper I systematically study the relationship between rates of invention and the efficiency and fairness of outcomes in two-player, repeated bargaining games. I use a basic reinforcement learning model with invention, and five variations of this model, designed introduce various forms of forgetting, to prioritize more recent reinforcement, or to maintain a higher rate of invention. I use computer simulations to investigate the outcomes of each model. Each models shows qualitative similarities in the relationship between the efficiency and fairness of outcomes, and the relative amount of exploration or exploitation that takes place. Surprisingly, there are often trade-offs between the efficiency and the fairness of the outcomes.

查看原文本刊更多论文

两人讨价还价博弈中的效率与公平权衡

最近关于社会契约和惯例演变的工作经常使用谈判游戏模型，并进行强化学习。最近的一项创新是要求每一种策略都必须通过学习或强化来发明。然而，特工们经常陷入高度强化的“陷阱”，这使他们无法达成对双方都有效或公平的结果。代理人面临着探索和利用之间的权衡，即在继续发明新策略和强化策略之间，这些策略已经通过产生高回报而得到高度强化。在本文中，我系统地研究了两人重复讨价还价游戏中发明率与结果的效率和公平性之间的关系。我使用了一个带有发明的基本强化学习模型，以及该模型的五个变体，旨在引入各种形式的遗忘，以优先考虑最近的强化，或保持更高的发明率。我使用计算机模拟来研究每个模型的结果。每一个模型都显示了结果的效率和公平性以及相对勘探或开采量之间的关系在质量上的相似性。令人惊讶的是，结果的效率和公平性之间往往存在权衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal for Philosophy of Science HISTORY & PHILOSOPHY OF SCIENCE-

CiteScore

2.60

自引率

13.30%

发文量

期刊介绍： The European Journal for Philosophy of Science publishes groundbreaking works that can deepen understanding of the concepts and methods of the sciences, as they explore increasingly many facets of the world we live in. It is of direct interest to philosophers of science coming from different perspectives, as well as scientists, citizens and policymakers. The journal is interested in articles from all traditions and all backgrounds, as long as they engage with the sciences in a constructive, and critical, way. The journal represents the various longstanding European philosophical traditions engaging with the sciences, but welcomes articles from every part of the world.