Semantic Communication for Partial Observation Multi-agent Reinforcement Learning

2023 IEEE Statistical Signal Processing Workshop (SSP) Pub Date : 2023-07-02 DOI:10.1109/SSP53291.2023.10207979

Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen

{"title":"Semantic Communication for Partial Observation Multi-agent Reinforcement Learning","authors":"Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen","doi":"10.1109/SSP53291.2023.10207979","DOIUrl":null,"url":null,"abstract":"Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.","PeriodicalId":296346,"journal":{"name":"2023 IEEE Statistical Signal Processing Workshop (SSP)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Statistical Signal Processing Workshop (SSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSP53291.2023.10207979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.

查看原文本刊更多论文

部分观察多智能体强化学习的语义通信

在许多现实场景中，智能体之间的有效合作和协调对于成功至关重要，特别是在强化学习挑战中。然而，部分观察，即代理不知道其他代理所做的所有观察，对协调造成了重大障碍。为了克服这一挑战，我们提出了共享在线多智能体知识交换(SOME)框架，该框架允许智能体学习预测彼此的观察结果并改进其局部学习。在SOME中，智能体学会预测其他智能体的观察结果，以改善它们的局部学习，从而实现更好的协调与合作。此外，使用知识生成器而不是完整的观察可以降低沟通成本。我们的实验评估表明，使用SOME训练的智能体不仅可以预测对手和合作者的下一个观察和行动，而且还可以采取适当的行动，这使得它成为克服多智能体强化学习中部分观察挑战的一种有希望的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE Statistical Signal Processing Workshop (SSP)

自引率

0.00%

发文量