Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali
{"title":"协同多智能体系统中混合多线程和分层强化学习方法的实例研究","authors":"Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali","doi":"10.1109/MICAI.2015.20","DOIUrl":null,"url":null,"abstract":"This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems\",\"authors\":\"Hiram Ponce, Ricardo Padilla, Alan Davalos, Alvaro Herrasti, Cynthia Pichardo, Daniel Dovali\",\"doi\":\"10.1109/MICAI.2015.20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.\",\"PeriodicalId\":448255,\"journal\":{\"name\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"volume\":\"128 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2015.20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems
This paper describes a case study about a multi-agent system for cooperative tasks, i.e. a mixing color task given three different sources of color. A reinforcement learning approach was performed by the agents, however, this type of learning exploits exponentially when the number of states in the environment is very large. In that sense, the paper proposes to use the MaxQ-Q hierarchical reinforcement learning algorithm to obtain a suitable policy for agents in order to minimize the time process to achieve the goal, and to reduce the state space. In addition, since the multi-agent system runs in a software application, a multi-threading paradigm was proposed to use. Experimental results show that this multi-agent system can reduce the time process and still maintain independence of agents.