{"title":"论多智能体强化学习中利润分配的合理性","authors":"K. Miyazaki, S. Kobayashi","doi":"10.1109/ICCIMA.2001.970506","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. In this paper, we use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. Especially, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through cranes control problem, we confirm the effectiveness of PS in multi-agent environments.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the rationality of profit sharing in multi-agent reinforcement learning\",\"authors\":\"K. Miyazaki, S. Kobayashi\",\"doi\":\"10.1109/ICCIMA.2001.970506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. In this paper, we use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. Especially, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through cranes control problem, we confirm the effectiveness of PS in multi-agent environments.\",\"PeriodicalId\":232504,\"journal\":{\"name\":\"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIMA.2001.970506\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIMA.2001.970506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the rationality of profit sharing in multi-agent reinforcement learning
Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. In this paper, we use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. Especially, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through cranes control problem, we confirm the effectiveness of PS in multi-agent environments.