Semantic Communication for Partial Observation Multi-agent Reinforcement Learning

Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen
{"title":"Semantic Communication for Partial Observation Multi-agent Reinforcement Learning","authors":"Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen","doi":"10.1109/SSP53291.2023.10207979","DOIUrl":null,"url":null,"abstract":"Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.","PeriodicalId":296346,"journal":{"name":"2023 IEEE Statistical Signal Processing Workshop (SSP)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Statistical Signal Processing Workshop (SSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSP53291.2023.10207979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.
部分观察多智能体强化学习的语义通信
在许多现实场景中,智能体之间的有效合作和协调对于成功至关重要,特别是在强化学习挑战中。然而,部分观察,即代理不知道其他代理所做的所有观察,对协调造成了重大障碍。为了克服这一挑战,我们提出了共享在线多智能体知识交换(SOME)框架,该框架允许智能体学习预测彼此的观察结果并改进其局部学习。在SOME中,智能体学会预测其他智能体的观察结果,以改善它们的局部学习,从而实现更好的协调与合作。此外,使用知识生成器而不是完整的观察可以降低沟通成本。我们的实验评估表明,使用SOME训练的智能体不仅可以预测对手和合作者的下一个观察和行动,而且还可以采取适当的行动,这使得它成为克服多智能体强化学习中部分观察挑战的一种有希望的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信