{"title":"基于深度强化学习的禁飞区数据采集场景无人机轨迹设计","authors":"Yunfei Gao, Mingliu Liu, Ziwei Mei, Yulin Hu","doi":"10.1109/ICCC56324.2022.10065712","DOIUrl":null,"url":null,"abstract":"Recently, unmanned aerial vehicle (UAV)-assisted communication system has been introduced as a promising paradigm for the future space-aerial-terrestrial integrated communications. In this paper, we investigate an UAV communication system, where the UAV is employed to assist multiple ground loT devices for data collection in the area of interest with the existence of no-fly zones. Unlike existing approaches focusing only on simplified line-of-sigh (LoS)-dominant channel model, this paper considers a more practical probability LoS channel model, which considers path loss and shadowing. On the premise of satisfying the data throughput requirements of all ground loT devices, we intend to minimize the total task completion time by jointly optimizing UAV's trajectory and communication scheduling. To tackle the non-convex and difficult intractable problem, we first transform the original problem into an Markov decision process (MDP) problem, and then we propose a trajectory design solution based on deep reinforcement learning (DRL) algorithm for completion time minimization. The UAV serves as an agent in the process of execution algorithm, interacting with the environment and constantly improving its own mobile strategy. Finally, numerical results demonstrate that the proposed design contributes to significant performance enhancement and can be applied to practical scenarios with no-fly zones.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning Based UAV Trajectory Design for Data Collection Scenario with No-Fly Zones\",\"authors\":\"Yunfei Gao, Mingliu Liu, Ziwei Mei, Yulin Hu\",\"doi\":\"10.1109/ICCC56324.2022.10065712\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, unmanned aerial vehicle (UAV)-assisted communication system has been introduced as a promising paradigm for the future space-aerial-terrestrial integrated communications. In this paper, we investigate an UAV communication system, where the UAV is employed to assist multiple ground loT devices for data collection in the area of interest with the existence of no-fly zones. Unlike existing approaches focusing only on simplified line-of-sigh (LoS)-dominant channel model, this paper considers a more practical probability LoS channel model, which considers path loss and shadowing. On the premise of satisfying the data throughput requirements of all ground loT devices, we intend to minimize the total task completion time by jointly optimizing UAV's trajectory and communication scheduling. To tackle the non-convex and difficult intractable problem, we first transform the original problem into an Markov decision process (MDP) problem, and then we propose a trajectory design solution based on deep reinforcement learning (DRL) algorithm for completion time minimization. The UAV serves as an agent in the process of execution algorithm, interacting with the environment and constantly improving its own mobile strategy. Finally, numerical results demonstrate that the proposed design contributes to significant performance enhancement and can be applied to practical scenarios with no-fly zones.\",\"PeriodicalId\":263098,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC56324.2022.10065712\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065712","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning Based UAV Trajectory Design for Data Collection Scenario with No-Fly Zones
Recently, unmanned aerial vehicle (UAV)-assisted communication system has been introduced as a promising paradigm for the future space-aerial-terrestrial integrated communications. In this paper, we investigate an UAV communication system, where the UAV is employed to assist multiple ground loT devices for data collection in the area of interest with the existence of no-fly zones. Unlike existing approaches focusing only on simplified line-of-sigh (LoS)-dominant channel model, this paper considers a more practical probability LoS channel model, which considers path loss and shadowing. On the premise of satisfying the data throughput requirements of all ground loT devices, we intend to minimize the total task completion time by jointly optimizing UAV's trajectory and communication scheduling. To tackle the non-convex and difficult intractable problem, we first transform the original problem into an Markov decision process (MDP) problem, and then we propose a trajectory design solution based on deep reinforcement learning (DRL) algorithm for completion time minimization. The UAV serves as an agent in the process of execution algorithm, interacting with the environment and constantly improving its own mobile strategy. Finally, numerical results demonstrate that the proposed design contributes to significant performance enhancement and can be applied to practical scenarios with no-fly zones.