通过强化学习和代理建模模拟理性对经济的影响

arXiv - QuantFin - Economics Pub Date : 2024-05-03 DOI:arxiv-2405.02161

Simone Brusatin, Tommaso Padoan, Andrea Coletta, Domenico Delli Gatti, Aldo Glielmo

{"title":"通过强化学习和代理建模模拟理性对经济的影响","authors":"Simone Brusatin, Tommaso Padoan, Andrea Coletta, Domenico Delli Gatti, Aldo Glielmo","doi":"arxiv-2405.02161","DOIUrl":null,"url":null,"abstract":"Agent-based models (ABMs) are simulation models used in economics to overcome\nsome of the limitations of traditional frameworks based on general equilibrium\nassumptions. However, agents within an ABM follow predetermined, not fully\nrational, behavioural rules which can be cumbersome to design and difficult to\njustify. Here we leverage multi-agent reinforcement learning (RL) to expand the\ncapabilities of ABMs with the introduction of fully rational agents that learn\ntheir policy by interacting with the environment and maximising a reward\nfunction. Specifically, we propose a 'Rational macro ABM' (R-MABM) framework by\nextending a paradigmatic macro ABM from the economic literature. We show that\ngradually substituting ABM firms in the model with RL agents, trained to\nmaximise profits, allows for a thorough study of the impact of rationality on\nthe economy. We find that RL agents spontaneously learn three distinct\nstrategies for maximising profits, with the optimal strategy depending on the\nlevel of market competition and rationality. We also find that RL agents with\nindependent policies, and without the ability to communicate with each other,\nspontaneously learn to segregate into different strategic groups, thus\nincreasing market power and overall profits. Finally, we find that a higher\ndegree of rationality in the economy always improves the macroeconomic\nenvironment as measured by total output, depending on the specific rational\npolicy, this can come at the cost of higher instability. Our R-MABM framework\nis general, it allows for stable multi-agent learning, and represents a\nprincipled and robust direction to extend existing economic simulators.","PeriodicalId":501487,"journal":{"name":"arXiv - QuantFin - Economics","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Simulating the economic impact of rationality through reinforcement learning and agent-based modelling\",\"authors\":\"Simone Brusatin, Tommaso Padoan, Andrea Coletta, Domenico Delli Gatti, Aldo Glielmo\",\"doi\":\"arxiv-2405.02161\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Agent-based models (ABMs) are simulation models used in economics to overcome\\nsome of the limitations of traditional frameworks based on general equilibrium\\nassumptions. However, agents within an ABM follow predetermined, not fully\\nrational, behavioural rules which can be cumbersome to design and difficult to\\njustify. Here we leverage multi-agent reinforcement learning (RL) to expand the\\ncapabilities of ABMs with the introduction of fully rational agents that learn\\ntheir policy by interacting with the environment and maximising a reward\\nfunction. Specifically, we propose a 'Rational macro ABM' (R-MABM) framework by\\nextending a paradigmatic macro ABM from the economic literature. We show that\\ngradually substituting ABM firms in the model with RL agents, trained to\\nmaximise profits, allows for a thorough study of the impact of rationality on\\nthe economy. We find that RL agents spontaneously learn three distinct\\nstrategies for maximising profits, with the optimal strategy depending on the\\nlevel of market competition and rationality. We also find that RL agents with\\nindependent policies, and without the ability to communicate with each other,\\nspontaneously learn to segregate into different strategic groups, thus\\nincreasing market power and overall profits. Finally, we find that a higher\\ndegree of rationality in the economy always improves the macroeconomic\\nenvironment as measured by total output, depending on the specific rational\\npolicy, this can come at the cost of higher instability. Our R-MABM framework\\nis general, it allows for stable multi-agent learning, and represents a\\nprincipled and robust direction to extend existing economic simulators.\",\"PeriodicalId\":501487,\"journal\":{\"name\":\"arXiv - QuantFin - Economics\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuantFin - Economics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.02161\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.02161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于代理的模型（ABMs）是经济学中用来克服基于一般均衡假设的传统框架的一些局限性的模拟模型。然而，ABM 中的代理遵循预定的、非完全理性的行为规则，这可能会给设计带来麻烦，也很难证明其合理性。在这里，我们利用多代理强化学习（RL）来扩展人工智能模型的能力，引入完全理性的代理，通过与环境互动和最大化奖励函数来学习其政策。具体来说，我们提出了一个 "理性宏观 ABM"（R-MABM）框架，扩展了经济文献中的一个典型宏观 ABM。我们的研究表明，将模型中的 ABM 公司逐步替换为 RL 代理，并对其进行利润最大化训练，可以深入研究理性对经济的影响。我们发现，RL 代理会自发学习三种不同的利润最大化策略，最佳策略取决于市场竞争程度和理性程度。我们还发现，具有独立政策且无法相互沟通的 RL 代理会自发地学习隔离成不同的战略集团，从而提高市场力量和整体利润。最后，我们发现，经济中较高程度的理性总是能改善以总产出衡量的宏观经济环境，这取决于具体的理性政策，但其代价可能是更高的不稳定性。我们的 R-MABM 框架是通用的，它允许稳定的多代理学习，是扩展现有经济模拟器的一个原则性和稳健的方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Simulating the economic impact of rationality through reinforcement learning and agent-based modelling

Agent-based models (ABMs) are simulation models used in economics to overcome some of the limitations of traditional frameworks based on general equilibrium assumptions. However, agents within an ABM follow predetermined, not fully rational, behavioural rules which can be cumbersome to design and difficult to justify. Here we leverage multi-agent reinforcement learning (RL) to expand the capabilities of ABMs with the introduction of fully rational agents that learn their policy by interacting with the environment and maximising a reward function. Specifically, we propose a 'Rational macro ABM' (R-MABM) framework by extending a paradigmatic macro ABM from the economic literature. We show that gradually substituting ABM firms in the model with RL agents, trained to maximise profits, allows for a thorough study of the impact of rationality on the economy. We find that RL agents spontaneously learn three distinct strategies for maximising profits, with the optimal strategy depending on the level of market competition and rationality. We also find that RL agents with independent policies, and without the ability to communicate with each other, spontaneously learn to segregate into different strategic groups, thus increasing market power and overall profits. Finally, we find that a higher degree of rationality in the economy always improves the macroeconomic environment as measured by total output, depending on the specific rational policy, this can come at the cost of higher instability. Our R-MABM framework is general, it allows for stable multi-agent learning, and represents a principled and robust direction to extend existing economic simulators.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - QuantFin - Economics

自引率

0.00%

发文量