Multi-agent reinforcement learning via distributed MPC as a function approximator

IF 4.8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Automatica Pub Date : 2024-06-22 DOI:10.1016/j.automatica.2024.111803

Samuel Mallick, Filippo Airaldi, Azita Dabiri, Bart De Schutter

引用次数: 0

Abstract

This paper presents a novel approach to multi-agent reinforcement learning (RL) for linear systems with convex polytopic constraints. Existing work on RL has demonstrated the use of model predictive control (MPC) as a function approximator for the policy and value functions. The current paper is the first work to extend this idea to the multi-agent setting. We propose the use of a distributed MPC scheme as a function approximator, with a structure allowing for distributed learning and deployment. We then show that Q-learning updates can be performed distributively without introducing nonstationarity, by reconstructing a centralized learning update. The effectiveness of the approach is demonstrated on a numerical example.

查看原文本刊更多论文

通过分布式 MPC 作为函数近似器进行多代理强化学习

本文提出了一种针对具有凸多顶约束条件的线性系统的多代理强化学习（RL）新方法。现有的 RL 研究已经证明，可以使用模型预测控制 (MPC) 作为策略和价值函数的函数近似值。本文是首次将这一想法扩展到多代理环境的研究。我们提出使用分布式 MPC 方案作为函数近似器，其结构允许分布式学习和部署。然后，我们展示了 Q 学习更新可以通过重构集中学习更新，在不引入非稳态的情况下分布式地执行。我们通过一个数值示例证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Automatica 工程技术-工程：电子与电气

CiteScore

10.70

自引率

7.80%

发文量

617

审稿时长

5 months

期刊介绍： Automatica is a leading archival publication in the field of systems and control. The field encompasses today a broad set of areas and topics, and is thriving not only within itself but also in terms of its impact on other fields, such as communications, computers, biology, energy and economics. Since its inception in 1963, Automatica has kept abreast with the evolution of the field over the years, and has emerged as a leading publication driving the trends in the field. After being founded in 1963, Automatica became a journal of the International Federation of Automatic Control (IFAC) in 1969. It features a characteristic blend of theoretical and applied papers of archival, lasting value, reporting cutting edge research results by authors across the globe. It features articles in distinct categories, including regular, brief and survey papers, technical communiqués, correspondence items, as well as reviews on published books of interest to the readership. It occasionally publishes special issues on emerging new topics or established mature topics of interest to a broad audience. Automatica solicits original high-quality contributions in all the categories listed above, and in all areas of systems and control interpreted in a broad sense and evolving constantly. They may be submitted directly to a subject editor or to the Editor-in-Chief if not sure about the subject area. Editorial procedures in place assure careful, fair, and prompt handling of all submitted articles. Accepted papers appear in the journal in the shortest time feasible given production time constraints.

文献相关原料

公司名称	产品信息	采购帮参考价格