重复游戏的逆向归纳

CoRR Pub Date : 2018-04-19 DOI:10.4204/EPTCS.275.5

Jules Hedges

{"title":"重复游戏的逆向归纳","authors":"Jules Hedges","doi":"10.4204/EPTCS.275.5","DOIUrl":null,"url":null,"abstract":"We present a method of backward induction for computing approximate subgame perfect Nash equilibria of infinitely repeated games with discounted payoffs. This uses the selection monad transformer, combined with the searchable set monad viewed as a notion of 'topologically compact' nondeterminism, and a simple model of computable real numbers. This is the first application of Escard\\'o and Oliva's theory of higher-order sequential games to games of imperfect information, in which (as well as its mathematical elegance) lazy evaluation does nontrivial work for us compared with a traditional game-theoretic analysis. Since a full theoretical understanding of this method is lacking (and appears to be very hard), we consider this an 'experimental' paper heavily inspired by theoretical ideas. We use the famous Iterated Prisoner's Dilemma as a worked example.","PeriodicalId":10720,"journal":{"name":"CoRR","volume":"27 1","pages":"35-52"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Backward induction for repeated games\",\"authors\":\"Jules Hedges\",\"doi\":\"10.4204/EPTCS.275.5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a method of backward induction for computing approximate subgame perfect Nash equilibria of infinitely repeated games with discounted payoffs. This uses the selection monad transformer, combined with the searchable set monad viewed as a notion of 'topologically compact' nondeterminism, and a simple model of computable real numbers. This is the first application of Escard\\\\'o and Oliva's theory of higher-order sequential games to games of imperfect information, in which (as well as its mathematical elegance) lazy evaluation does nontrivial work for us compared with a traditional game-theoretic analysis. Since a full theoretical understanding of this method is lacking (and appears to be very hard), we consider this an 'experimental' paper heavily inspired by theoretical ideas. We use the famous Iterated Prisoner's Dilemma as a worked example.\",\"PeriodicalId\":10720,\"journal\":{\"name\":\"CoRR\",\"volume\":\"27 1\",\"pages\":\"35-52\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CoRR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4204/EPTCS.275.5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CoRR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4204/EPTCS.275.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

提出了一种计算具有折现收益的无限重复对策的近似子对策完美纳什均衡的逆向归纳法。它使用选择单子转换器，结合可搜索集合单子(被视为“拓扑紧凑型”不确定性的概念)和可计算实数的简单模型。这是Escard\ o和Oliva的高阶序列博弈理论在不完全信息博弈中的首次应用，与传统博弈论分析相比，惰性评估(以及它的数学优雅)为我们提供了重要的工作。由于缺乏对这种方法的全面理论理解(并且似乎非常困难)，我们认为这是一篇受到理论思想启发的“实验性”论文。我们以著名的迭代囚徒困境为例。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Backward induction for repeated games

We present a method of backward induction for computing approximate subgame perfect Nash equilibria of infinitely repeated games with discounted payoffs. This uses the selection monad transformer, combined with the searchable set monad viewed as a notion of 'topologically compact' nondeterminism, and a simple model of computable real numbers. This is the first application of Escard\'o and Oliva's theory of higher-order sequential games to games of imperfect information, in which (as well as its mathematical elegance) lazy evaluation does nontrivial work for us compared with a traditional game-theoretic analysis. Since a full theoretical understanding of this method is lacking (and appears to be very hard), we consider this an 'experimental' paper heavily inspired by theoretical ideas. We use the famous Iterated Prisoner's Dilemma as a worked example.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CoRR

自引率

0.00%

发文量