{"title":"UAV Swarm Communication Aware Formation Control via Deep Q Network","authors":"Chengtao Xu, Kai Zhang, H. Song","doi":"10.1109/IPCCC50635.2020.9391509","DOIUrl":null,"url":null,"abstract":"We propose a DQN based reinforcement learning method in swarm communication constraint based formation control with target searching function. A decentralized communication performance indicator is applied in evaluating the UAV’s formation control in simulating a more realistic wireless communication environment between each UAV. A target searching model based on communication aware formation control is presented. The simulation results show that the trained model with state observation space and one thrust action space could be applied in the larger swarm system’s group formation and target point tracking.","PeriodicalId":226034,"journal":{"name":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","volume":"194 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPCCC50635.2020.9391509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
We propose a DQN based reinforcement learning method in swarm communication constraint based formation control with target searching function. A decentralized communication performance indicator is applied in evaluating the UAV’s formation control in simulating a more realistic wireless communication environment between each UAV. A target searching model based on communication aware formation control is presented. The simulation results show that the trained model with state observation space and one thrust action space could be applied in the larger swarm system’s group formation and target point tracking.