Multi-agent differential graphical games: Nash online adaptive learning solutions

52nd IEEE Conference on Decision and Control Pub Date : 2013-12-01 DOI:10.1109/CDC.2013.6760804

M. Abouheaf, F. Lewis

引用次数: 29

Abstract

This paper studies a class of multi-agent graphical games denoted by differential graphical games, where interactions between agents are prescribed by a communication graph structure. Ideas from cooperative control are given to achieve synchronization among the agents to a leader dynamics. New coupled Bellman and Hamilton-Jacobi-Bellman equations are developed for this class of games using Integral Reinforcement Learning. Nash solutions are given in terms of solutions to a set of coupled continuous-time Hamilton-Jacobi-Bellman equations. A multi-agent policy iteration algorithm is given to learn the Nash solution in real time without knowing the complete dynamic models of the agents. A proof of convergence for this algorithm is given. An online multi-agent method based on policy iterations is developed using a critic network to solve all the Hamilton-Jacobi-Bellman equations simultaneously for the graphical game.

查看原文本刊更多论文

多智能体微分图形游戏:纳什在线自适应学习解决方案

本文研究了一类多智能体图形对策，用微分图形对策来表示，其中智能体之间的相互作用用通信图结构来规定。给出了协作控制的思想，以实现agent之间对leader动态的同步。针对这类对策，利用积分强化学习建立了新的耦合Bellman方程和Hamilton-Jacobi-Bellman方程。Nash解是用一组连续耦合Hamilton-Jacobi-Bellman方程的解来表示的。提出了一种多智能体策略迭代算法，在不知道智能体完整动态模型的情况下实时学习纳什解。给出了该算法的收敛性证明。提出了一种基于策略迭代的在线多智能体方法，利用批评家网络同时求解图形博弈的所有Hamilton-Jacobi-Bellman方程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

52nd IEEE Conference on Decision and Control

自引率

0.00%

发文量