Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić
{"title":"Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping","authors":"Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić","doi":"10.1109/BalkanCom53780.2021.9593232","DOIUrl":null,"url":null,"abstract":"We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.","PeriodicalId":115090,"journal":{"name":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BalkanCom53780.2021.9593232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.