论算法串通的内在机制

Zhang Xu, Wei Zhao
{"title":"论算法串通的内在机制","authors":"Zhang Xu, Wei Zhao","doi":"arxiv-2409.01147","DOIUrl":null,"url":null,"abstract":"Two issues of algorithmic collusion are addressed in this paper. First, we\nshow that in a general class of symmetric games, including Prisoner's Dilemma,\nBertrand competition, and any (nonlinear) mixture of first and second price\nauction, only (strict) Nash Equilibrium (NE) is stochastically stable.\nTherefore, the tacit collusion is driven by failure to learn NE due to\ninsufficient learning, instead of learning some strategies to sustain collusive\noutcomes. Second, we study how algorithms adapt to collusion in real\nsimulations with insufficient learning. Extensive explorations in early stages\nand discount factors inflates the Q-value, which interrupts the sequential and\nalternative price undercut and leads to bilateral rebound. The process is\niterated, making the price curves like Edgeworth cycles. When both exploration\nrate and Q-value decrease, algorithms may bilaterally rebound to relatively\nhigh common price level by coincidence, and then get stuck. Finally, we\naccommodate our reasoning to simulation outcomes in the literature, including\noptimistic initialization, market design and algorithm design.","PeriodicalId":501188,"journal":{"name":"arXiv - ECON - Theoretical Economics","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On Mechanism Underlying Algorithmic Collusion\",\"authors\":\"Zhang Xu, Wei Zhao\",\"doi\":\"arxiv-2409.01147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Two issues of algorithmic collusion are addressed in this paper. First, we\\nshow that in a general class of symmetric games, including Prisoner's Dilemma,\\nBertrand competition, and any (nonlinear) mixture of first and second price\\nauction, only (strict) Nash Equilibrium (NE) is stochastically stable.\\nTherefore, the tacit collusion is driven by failure to learn NE due to\\ninsufficient learning, instead of learning some strategies to sustain collusive\\noutcomes. Second, we study how algorithms adapt to collusion in real\\nsimulations with insufficient learning. Extensive explorations in early stages\\nand discount factors inflates the Q-value, which interrupts the sequential and\\nalternative price undercut and leads to bilateral rebound. The process is\\niterated, making the price curves like Edgeworth cycles. When both exploration\\nrate and Q-value decrease, algorithms may bilaterally rebound to relatively\\nhigh common price level by coincidence, and then get stuck. Finally, we\\naccommodate our reasoning to simulation outcomes in the literature, including\\noptimistic initialization, market design and algorithm design.\",\"PeriodicalId\":501188,\"journal\":{\"name\":\"arXiv - ECON - Theoretical Economics\",\"volume\":\"68 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - ECON - Theoretical Economics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.01147\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Theoretical Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文探讨了算法合谋的两个问题。首先,我们证明了在一般对称博弈中,包括囚徒困境、伯特兰德竞争和任何(非线性)第一和第二价格扣除的混合博弈中,只有(严格的)纳什均衡(NE)是随机稳定的。因此,默契串通是由于学习不足而无法学习纳什均衡,而不是学习一些策略来维持串通结果。其次,我们研究了算法如何在学习不足的真实模拟中适应合谋。在早期阶段的大量探索和折扣因素会使 Q 值膨胀,从而中断连续和替代性的压价,导致双边反弹。这一过程不断重复,使得价格曲线像埃奇沃斯循环一样。当探索率和 Q 值都下降时,算法可能会巧合地双边反弹到相对较高的共同价格水平,然后陷入僵局。最后,我们将我们的推理与文献中的模拟结果相适应,包括乐观的初始化、市场设计和算法设计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On Mechanism Underlying Algorithmic Collusion
Two issues of algorithmic collusion are addressed in this paper. First, we show that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only (strict) Nash Equilibrium (NE) is stochastically stable. Therefore, the tacit collusion is driven by failure to learn NE due to insufficient learning, instead of learning some strategies to sustain collusive outcomes. Second, we study how algorithms adapt to collusion in real simulations with insufficient learning. Extensive explorations in early stages and discount factors inflates the Q-value, which interrupts the sequential and alternative price undercut and leads to bilateral rebound. The process is iterated, making the price curves like Edgeworth cycles. When both exploration rate and Q-value decrease, algorithms may bilaterally rebound to relatively high common price level by coincidence, and then get stuck. Finally, we accommodate our reasoning to simulation outcomes in the literature, including optimistic initialization, market design and algorithm design.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信