Intelligent UAV Navigation: A DRL-QiER Solution

ICC 2022 - IEEE International Conference on Communications Pub Date : 2022-05-16 DOI:10.1109/ICC45855.2022.9838566

Yuanjian Li, H. Aghvami

引用次数: 5

Abstract

In cellular-connected unmanned aerial vehicle (UAV) network, a minimization problem on the weighted sum of time cost and expected outage duration is considered. Taking advantage of UAV’s adjustable mobility, an intelligent UAV navigation approach is formulated to achieve the aforementioned optimization goal. Specifically, after mapping the navigation task into a Markov decision process (MDP), a deep reinforcement learning (DRL) solution with novel quantum-inspired experience replay (QiER) framework is proposed to help the UAV find the optimal flying direction within each time slot. Via relating experienced transition’s importance to its associated quantum bit (qubit) and applying Grover-iteration-based amplitude amplification technique, the proposed DRL-QiER solution commits a better trade-off between sampling priority and diversity. Compared to several representative baselines, the effectiveness and supremacy of the proposed DRL-QiER solution are demonstrated and validated in numerical results.

查看原文本刊更多论文

智能无人机导航:一个DRL-QiER解决方案

在蜂窝连接的无人机网络中，考虑了时间成本和预期停机时间加权和的最小化问题。利用无人机机动性可调的特点，提出了一种无人机智能导航方法来实现上述优化目标。具体而言，在将导航任务映射到马尔可夫决策过程(MDP)之后，提出了一种基于量子启发经验重放(QiER)框架的深度强化学习(DRL)解决方案，帮助无人机在每个时槽内找到最优飞行方向。通过将经验跃迁的重要性与其相关的量子比特(qubit)联系起来，并应用基于grover迭代的幅度放大技术，所提出的DRL-QiER解决方案在采样优先级和多样性之间实现了更好的权衡。通过与几个代表性基线的比较，数值结果验证了所提出的DRL-QiER解的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICC 2022 - IEEE International Conference on Communications

自引率

0.00%

发文量