Jiaping Xiao, Yi Xuan Marcus Tan, Xin-qiu Zhou, M. Feroskhan
{"title":"Learning Collaborative Multi-Target Search for A Visual Drone Swarm","authors":"Jiaping Xiao, Yi Xuan Marcus Tan, Xin-qiu Zhou, M. Feroskhan","doi":"10.1109/CAI54212.2023.00012","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a multi-agent reinforcement learning approach, POCA-Mix, to achieve collaborative multi-target search with a visual drone swarm. The proposed approach leverages the benefits of curriculum learning and mixed credit assignment to guide the drone swarm in per-forming the search task with only local visual perception in a constrained 3D environment. To validate the performance of the proposed approach, we conducted simulation experiments with various combinations regarding the number of drones and targets. The results demonstrate that the proposed approach outperforms other baseline methods such as PPO and MA-POCA with a higher success rate in different scenarios.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Conference on Artificial Intelligence (CAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAI54212.2023.00012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we propose a multi-agent reinforcement learning approach, POCA-Mix, to achieve collaborative multi-target search with a visual drone swarm. The proposed approach leverages the benefits of curriculum learning and mixed credit assignment to guide the drone swarm in per-forming the search task with only local visual perception in a constrained 3D environment. To validate the performance of the proposed approach, we conducted simulation experiments with various combinations regarding the number of drones and targets. The results demonstrate that the proposed approach outperforms other baseline methods such as PPO and MA-POCA with a higher success rate in different scenarios.