协同多智能体系统中混合多线程和分层强化学习方法的实例研究

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) Pub Date : 2015-10-25 DOI:10.1109/MICAI.2015.20

Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali

{"title":"协同多智能体系统中混合多线程和分层强化学习方法的实例研究","authors":"Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali","doi":"10.1109/MICAI.2015.20","DOIUrl":null,"url":null,"abstract":"This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems\",\"authors\":\"Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali\",\"doi\":\"10.1109/MICAI.2015.20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.\",\"PeriodicalId\":448255,\"journal\":{\"name\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"volume\":\"128 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2015.20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文描述了一个多智能体系统的合作任务的案例研究，即给定三种不同的颜色来源的混合颜色任务。智能体执行了一种强化学习方法，然而，当环境中的状态数量非常大时，这种类型的学习会呈指数级增长。在这种意义上，本文提出使用MaxQ-Q分层强化学习算法来获得适合agent的策略，以最小化实现目标的时间过程，并减小状态空间。此外，由于多智能体系统运行在软件应用程序中，因此提出了一种多线程模式。实验结果表明，该多智能体系统可以在减少时间过程的同时保持智能体的独立性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)

自引率

0.00%

发文量