{"title":"Path Planning of UAV Using Step-wise Deep Q-learning Algorithm","authors":"Qijia Gu, Zhen An, Lanmin Chen, Kunfu Wang","doi":"10.1109/ICNISC57059.2022.00066","DOIUrl":null,"url":null,"abstract":"With the increasing application of the Unmanned Aerial Vehicle(UAV) technology, the path planning of UAV is becoming increasing important, However, with the increasing complexity of UAV applications, the application scenario is always complex, crowded with dense obstacles, open, and dynamic. In this paper, we dedicate to deep reinforcement learning algorithms for autonomous obstacle avoidance and navigation of UAV. The navigation problem is considered as a target-driven MDP problem, in which UAV takes its next action conditioned on both its current observation and the destination, Additionally, DRL algorithm with sparse rewards is hard to convergence, we introduce a step-wise dynamic relative goal method to extract the common feature between different navigation targets.","PeriodicalId":286467,"journal":{"name":"2022 8th Annual International Conference on Network and Information Systems for Computers (ICNISC)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th Annual International Conference on Network and Information Systems for Computers (ICNISC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNISC57059.2022.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the increasing application of the Unmanned Aerial Vehicle(UAV) technology, the path planning of UAV is becoming increasing important, However, with the increasing complexity of UAV applications, the application scenario is always complex, crowded with dense obstacles, open, and dynamic. In this paper, we dedicate to deep reinforcement learning algorithms for autonomous obstacle avoidance and navigation of UAV. The navigation problem is considered as a target-driven MDP problem, in which UAV takes its next action conditioned on both its current observation and the destination, Additionally, DRL algorithm with sparse rewards is hard to convergence, we introduce a step-wise dynamic relative goal method to extract the common feature between different navigation targets.