A hybrid P2P and master-slave architecture for intelligent multi-agent reinforcement learning in a distributed computing environment: A case study

2010 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications Pub Date : 2010-10-28 DOI:10.1109/CIMSA.2010.5611770

D. B. Megherbi, M. Madera

{"title":"A hybrid P2P and master-slave architecture for intelligent multi-agent reinforcement learning in a distributed computing environment: A case study","authors":"D. B. Megherbi, M. Madera","doi":"10.1109/CIMSA.2010.5611770","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a distributed architecture for reinforcement learning in a multi-agent environment, where agents share information learned via a distributed network. Here we propose a hybrid master/slave and peer-to-peer system architecture, where a master node effectively assigns a work load (a portion of the terrain) to each node. However, this master node also manages communications between all the other system nodes, and in that sense it is a peer-to-peer architecture. It is a loosely-coupled system in that node slaves only know about the existence of the master node, and are only concerned with their work load (portion of the terrain). As part of this architecture, we show how agents are allowed to communicate with other agents in the same or different nodes and share information that pertains to all agents, including the agent obstacle barriers. In particular, one main contribution of the paper is multi-agent reenforcement learning in a distributed system, where the agents do not have complete knowledge and information of their environment, other than what is available on the computing node, the particular agent (s) is (are) running on. We show how agents, running on same or different nodes, coordinate the sharing of their respective environment states/information to collaboratively perform their respective tasks.","PeriodicalId":162890,"journal":{"name":"2010 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIMSA.2010.5611770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

In this paper, we propose a distributed architecture for reinforcement learning in a multi-agent environment, where agents share information learned via a distributed network. Here we propose a hybrid master/slave and peer-to-peer system architecture, where a master node effectively assigns a work load (a portion of the terrain) to each node. However, this master node also manages communications between all the other system nodes, and in that sense it is a peer-to-peer architecture. It is a loosely-coupled system in that node slaves only know about the existence of the master node, and are only concerned with their work load (portion of the terrain). As part of this architecture, we show how agents are allowed to communicate with other agents in the same or different nodes and share information that pertains to all agents, including the agent obstacle barriers. In particular, one main contribution of the paper is multi-agent reenforcement learning in a distributed system, where the agents do not have complete knowledge and information of their environment, other than what is available on the computing node, the particular agent (s) is (are) running on. We show how agents, running on same or different nodes, coordinate the sharing of their respective environment states/information to collaboratively perform their respective tasks.

查看原文本刊更多论文

分布式计算环境中用于智能多代理强化学习的混合P2P和主从架构:一个案例研究

在本文中，我们提出了一种在多智能体环境中用于强化学习的分布式架构，其中智能体通过分布式网络共享学习到的信息。在这里，我们提出了一个混合的主/从和点对点系统架构，其中主节点有效地为每个节点分配工作负载(地形的一部分)。但是，这个主节点还管理所有其他系统节点之间的通信，从这个意义上说，它是一个点对点架构。它是一个松散耦合的系统，因为从节点只知道主节点的存在，并且只关心它们的工作负载(地形的一部分)。作为该体系结构的一部分，我们将展示如何允许代理与相同或不同节点中的其他代理进行通信，并共享属于所有代理的信息，包括代理障碍障碍。特别是，本文的一个主要贡献是分布式系统中的多智能体强化学习，其中智能体不具有其环境的完整知识和信息，除了计算节点上可用的信息，特定的智能体正在运行。我们将展示运行在相同或不同节点上的代理如何协调各自环境状态/信息的共享，以协同执行各自的任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications

自引率

0.00%

发文量