{"title":"基于小鸡博弈的多智能体协同强化学习方法","authors":"Zihui Wang, Zhi Wang, Chunlin Chen","doi":"10.1109/ICNSC48988.2020.9238072","DOIUrl":null,"url":null,"abstract":"Sparse interaction in multiagent tasks is an important approach to reduce the exponential computational cost for multiagent reinforcement learning (MARL) systems. How to select proper equilibrium solutions is the key to find the optimal policy and to improve the learning performance when collisions occur. We propose a new MARL algorithm, Efficient Coordination based MARL with Sparse Interactions (ECoSI), using the sparse interaction framework and an efficient coordination mechanism, where equilibrium solutions are selected via Nash equilibrium and Chicken game. ECoSI not only separates the Q-value updating rule in joint states from non-joint states with sparse interactions to achieve lower computation and storage complexity, but also takes advantage of efficient coordination with equilibrium solutions to find the optimal policy. Experimental results demonstrate the effectiveness and robustness of ECoSI compared to other state-of-the-art MARL algorithms.","PeriodicalId":412290,"journal":{"name":"2020 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Coordinated Multiagent Reinforcement Learning Method Using Chicken Game\",\"authors\":\"Zihui Wang, Zhi Wang, Chunlin Chen\",\"doi\":\"10.1109/ICNSC48988.2020.9238072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sparse interaction in multiagent tasks is an important approach to reduce the exponential computational cost for multiagent reinforcement learning (MARL) systems. How to select proper equilibrium solutions is the key to find the optimal policy and to improve the learning performance when collisions occur. We propose a new MARL algorithm, Efficient Coordination based MARL with Sparse Interactions (ECoSI), using the sparse interaction framework and an efficient coordination mechanism, where equilibrium solutions are selected via Nash equilibrium and Chicken game. ECoSI not only separates the Q-value updating rule in joint states from non-joint states with sparse interactions to achieve lower computation and storage complexity, but also takes advantage of efficient coordination with equilibrium solutions to find the optimal policy. Experimental results demonstrate the effectiveness and robustness of ECoSI compared to other state-of-the-art MARL algorithms.\",\"PeriodicalId\":412290,\"journal\":{\"name\":\"2020 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNSC48988.2020.9238072\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC48988.2020.9238072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
多智能体任务中的稀疏交互是降低多智能体强化学习(MARL)系统指数计算成本的重要途径。如何选择合适的平衡解是在发生碰撞时找到最优策略和提高学习性能的关键。本文提出了一种新的MARL算法——基于稀疏交互的高效协调(Efficient Coordination based MARL with Sparse Interactions, ECoSI),该算法利用稀疏交互框架和高效协调机制,通过纳什均衡和Chicken game选择均衡解。ECoSI不仅将联合状态下的q值更新规则从相互作用稀疏的非联合状态中分离出来,降低了计算和存储复杂度,而且利用与平衡解的有效协调来寻找最优策略。实验结果证明了ECoSI与其他先进的MARL算法相比的有效性和鲁棒性。
A Coordinated Multiagent Reinforcement Learning Method Using Chicken Game
Sparse interaction in multiagent tasks is an important approach to reduce the exponential computational cost for multiagent reinforcement learning (MARL) systems. How to select proper equilibrium solutions is the key to find the optimal policy and to improve the learning performance when collisions occur. We propose a new MARL algorithm, Efficient Coordination based MARL with Sparse Interactions (ECoSI), using the sparse interaction framework and an efficient coordination mechanism, where equilibrium solutions are selected via Nash equilibrium and Chicken game. ECoSI not only separates the Q-value updating rule in joint states from non-joint states with sparse interactions to achieve lower computation and storage complexity, but also takes advantage of efficient coordination with equilibrium solutions to find the optimal policy. Experimental results demonstrate the effectiveness and robustness of ECoSI compared to other state-of-the-art MARL algorithms.