基于多智能体q学习的无人机网络目标检测与室内测绘

2021 International Balkan Conference on Communications and Networking (BalkanCom) Pub Date : 2021-09-20 DOI:10.1109/BalkanCom53780.2021.9593232

Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić

{"title":"基于多智能体q学习的无人机网络目标检测与室内测绘","authors":"Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić","doi":"10.1109/BalkanCom53780.2021.9593232","DOIUrl":null,"url":null,"abstract":"We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.","PeriodicalId":115090,"journal":{"name":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping\",\"authors\":\"Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić\",\"doi\":\"10.1109/BalkanCom53780.2021.9593232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.\",\"PeriodicalId\":115090,\"journal\":{\"name\":\"2021 International Balkan Conference on Communications and Networking (BalkanCom)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Balkan Conference on Communications and Networking (BalkanCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BalkanCom53780.2021.9593232\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BalkanCom53780.2021.9593232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

我们考虑了一个无人机网络用于搜索和救援行动，涉及多目标检测和环境映射，其中学习时间有限。在保证较短学习时间的前提下实现目标的一种可能是采用无人机间的协作。为此，我们采用多智能体q -学习算法，使无人机能够实时学习合适的导航策略，以便在固定的时间框架内完成任务。获得的结果表明，无人机收集的信息的适当组合允许加速学习过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping

We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Balkan Conference on Communications and Networking (BalkanCom)

自引率

0.00%

发文量