论算法串通的内在机制

arXiv - ECON - Theoretical Economics Pub Date : 2024-09-02 DOI:arxiv-2409.01147

Zhang Xu, Wei Zhao

{"title":"论算法串通的内在机制","authors":"Zhang Xu, Wei Zhao","doi":"arxiv-2409.01147","DOIUrl":null,"url":null,"abstract":"Two issues of algorithmic collusion are addressed in this paper. First, we\nshow that in a general class of symmetric games, including Prisoner's Dilemma,\nBertrand competition, and any (nonlinear) mixture of first and second price\nauction, only (strict) Nash Equilibrium (NE) is stochastically stable.\nTherefore, the tacit collusion is driven by failure to learn NE due to\ninsufficient learning, instead of learning some strategies to sustain collusive\noutcomes. Second, we study how algorithms adapt to collusion in real\nsimulations with insufficient learning. Extensive explorations in early stages\nand discount factors inflates the Q-value, which interrupts the sequential and\nalternative price undercut and leads to bilateral rebound. The process is\niterated, making the price curves like Edgeworth cycles. When both exploration\nrate and Q-value decrease, algorithms may bilaterally rebound to relatively\nhigh common price level by coincidence, and then get stuck. Finally, we\naccommodate our reasoning to simulation outcomes in the literature, including\noptimistic initialization, market design and algorithm design.","PeriodicalId":501188,"journal":{"name":"arXiv - ECON - Theoretical Economics","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On Mechanism Underlying Algorithmic Collusion\",\"authors\":\"Zhang Xu, Wei Zhao\",\"doi\":\"arxiv-2409.01147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Two issues of algorithmic collusion are addressed in this paper. First, we\\nshow that in a general class of symmetric games, including Prisoner's Dilemma,\\nBertrand competition, and any (nonlinear) mixture of first and second price\\nauction, only (strict) Nash Equilibrium (NE) is stochastically stable.\\nTherefore, the tacit collusion is driven by failure to learn NE due to\\ninsufficient learning, instead of learning some strategies to sustain collusive\\noutcomes. Second, we study how algorithms adapt to collusion in real\\nsimulations with insufficient learning. Extensive explorations in early stages\\nand discount factors inflates the Q-value, which interrupts the sequential and\\nalternative price undercut and leads to bilateral rebound. The process is\\niterated, making the price curves like Edgeworth cycles. When both exploration\\nrate and Q-value decrease, algorithms may bilaterally rebound to relatively\\nhigh common price level by coincidence, and then get stuck. Finally, we\\naccommodate our reasoning to simulation outcomes in the literature, including\\noptimistic initialization, market design and algorithm design.\",\"PeriodicalId\":501188,\"journal\":{\"name\":\"arXiv - ECON - Theoretical Economics\",\"volume\":\"68 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - ECON - Theoretical Economics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.01147\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Theoretical Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文探讨了算法合谋的两个问题。首先，我们证明了在一般对称博弈中，包括囚徒困境、伯特兰德竞争和任何（非线性）第一和第二价格扣除的混合博弈中，只有（严格的）纳什均衡（NE）是随机稳定的。因此，默契串通是由于学习不足而无法学习纳什均衡，而不是学习一些策略来维持串通结果。其次，我们研究了算法如何在学习不足的真实模拟中适应合谋。在早期阶段的大量探索和折扣因素会使 Q 值膨胀，从而中断连续和替代性的压价，导致双边反弹。这一过程不断重复，使得价格曲线像埃奇沃斯循环一样。当探索率和 Q 值都下降时，算法可能会巧合地双边反弹到相对较高的共同价格水平，然后陷入僵局。最后，我们将我们的推理与文献中的模拟结果相适应，包括乐观的初始化、市场设计和算法设计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On Mechanism Underlying Algorithmic Collusion

Two issues of algorithmic collusion are addressed in this paper. First, we show that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only (strict) Nash Equilibrium (NE) is stochastically stable. Therefore, the tacit collusion is driven by failure to learn NE due to insufficient learning, instead of learning some strategies to sustain collusive outcomes. Second, we study how algorithms adapt to collusion in real simulations with insufficient learning. Extensive explorations in early stages and discount factors inflates the Q-value, which interrupts the sequential and alternative price undercut and leads to bilateral rebound. The process is iterated, making the price curves like Edgeworth cycles. When both exploration rate and Q-value decrease, algorithms may bilaterally rebound to relatively high common price level by coincidence, and then get stuck. Finally, we accommodate our reasoning to simulation outcomes in the literature, including optimistic initialization, market design and algorithm design.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - ECON - Theoretical Economics

自引率

0.00%

发文量