Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM

Quanxi Zhou;Yongjing Wang;Ruiyu Shen;Jin Nakazato;Manabu Tsukada;Zhenyu Guan
{"title":"Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM","authors":"Quanxi Zhou;Yongjing Wang;Ruiyu Shen;Jin Nakazato;Manabu Tsukada;Zhenyu Guan","doi":"10.1109/JMASS.2024.3490762","DOIUrl":null,"url":null,"abstract":"Due to the randomness of channel fading, communication devices, and malicious interference sources, uncrewed aerial vehicles (UAVs) face a complex and ever-changing task scenario, which poses significant communication security challenges, such as transmission outages. Fortunately, these communication security challenges can be transformed into path-planning problems that minimize the weighted sum of UAV mission time and transmission outage time. In order to design the complex communication environment faced by UAVs in actual scenarios, we propose a system model, including building distribution, communication channel, and antenna design, in this article. Besides, we introduce other UAVs with fixed flight paths and ground interference resources with random locations to ensure mission UAVs have better anti-interference ability. However, it is challenging for classical search algorithms and heuristic algorithms to cope with the complex path problems mentioned above. In this article, we propose an improved deep deterministic policy gradient (DDPG) algorithm with better performance compared with basic DDPG and double deep Q-network learning (DDQN) algorithms. Specifically, a post-decision state (PDS) mechanism has been introduced to accelerate the convergence rate and enhance the stability of the training process. In addition, a transmission outage probability experience memory (TOPEM) has been designed to quickly generate wireless communication quality maps and provide temporary experience for the post-decision process, resulting in better training results. Simulation experiments have proven that, compared to basic DDPG, the improved algorithm increases training speed by at least 50 %, significantly improves convergence rate, and reduces the episode required for convergence to 20 %. It can alsohelp UAVs choose better paths than basic DDPG and DDQN algorithms.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 1","pages":"2-18"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10742197/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the randomness of channel fading, communication devices, and malicious interference sources, uncrewed aerial vehicles (UAVs) face a complex and ever-changing task scenario, which poses significant communication security challenges, such as transmission outages. Fortunately, these communication security challenges can be transformed into path-planning problems that minimize the weighted sum of UAV mission time and transmission outage time. In order to design the complex communication environment faced by UAVs in actual scenarios, we propose a system model, including building distribution, communication channel, and antenna design, in this article. Besides, we introduce other UAVs with fixed flight paths and ground interference resources with random locations to ensure mission UAVs have better anti-interference ability. However, it is challenging for classical search algorithms and heuristic algorithms to cope with the complex path problems mentioned above. In this article, we propose an improved deep deterministic policy gradient (DDPG) algorithm with better performance compared with basic DDPG and double deep Q-network learning (DDQN) algorithms. Specifically, a post-decision state (PDS) mechanism has been introduced to accelerate the convergence rate and enhance the stability of the training process. In addition, a transmission outage probability experience memory (TOPEM) has been designed to quickly generate wireless communication quality maps and provide temporary experience for the post-decision process, resulting in better training results. Simulation experiments have proven that, compared to basic DDPG, the improved algorithm increases training speed by at least 50 %, significantly improves convergence rate, and reduces the episode required for convergence to 20 %. It can alsohelp UAVs choose better paths than basic DDPG and DDQN algorithms.
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.40
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信