{"title":"论多智能体强化学习中利润分配的合理性","authors":"K. Miyazaki, S. Kobayashi","doi":"10.1109/ICCIMA.2001.970455","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from a theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However, it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. The authors use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. In particular, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through a crane control problem, we confirm the effectiveness of PS in multi-agent environments.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"On the rationality of profit sharing in multi-agent reinforcement learning\",\"authors\":\"K. Miyazaki, S. Kobayashi\",\"doi\":\"10.1109/ICCIMA.2001.970455\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from a theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However, it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. The authors use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. In particular, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through a crane control problem, we confirm the effectiveness of PS in multi-agent environments.\",\"PeriodicalId\":232504,\"journal\":{\"name\":\"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIMA.2001.970455\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIMA.2001.970455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the rationality of profit sharing in multi-agent reinforcement learning
Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from a theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However, it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. The authors use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. In particular, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through a crane control problem, we confirm the effectiveness of PS in multi-agent environments.