Joint Trajectory and Power Optimization for Energy Efficient UAV Communication Using Deep Reinforcement Learning

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) Pub Date : 2021-05-10 DOI:10.1109/INFOCOMWKSHPS51825.2021.9484490

Yuling Cui, Danhao Deng, Chaowei Wang, Weidong Wang

引用次数: 5

Abstract

In recent years, unmanned aerial vehicles (UAVs) have been widely used in wireless communication, attracting intensive attentions. UAVs can not only serve as relays, but also serve as aerial base station for ground users (GUs). However, limited energy means that they cannot work for long and cover a limited area of services. In this paper, we investigate 2D UAV trajectory design and power allocation in order to maximize the UAV's service time and downlink throughput. Based on deep reinforcement learning, we propose a deep deterministic policy gradient (DDPG) algorithm for trajectory design and power allocation (TDPA) to solve the energy efficient and communication service quality problem. The simulation results show that TDPA can extend the service time of UAV, improve the communication service quality, and realize the maximization of downlink throughput, which are significantly improved compared with existing methods.

查看原文本刊更多论文

基于深度强化学习的高能效无人机通信联合轨迹与功率优化

近年来，无人机在无线通信领域的广泛应用引起了人们的广泛关注。无人机不仅可以作为中继，还可以作为地面用户(GUs)的空中基站。然而，有限的能量意味着它们不能长时间工作，服务范围也有限。为了使无人机的服务时间和下行吞吐量最大化，本文研究了二维无人机的轨迹设计和功率分配。基于深度强化学习，提出了一种深度确定性策略梯度(DDPG)算法用于轨迹设计和功率分配(TDPA)，以解决通信服务质量和能源效率问题。仿真结果表明，TDPA可以延长无人机的服务时间，提高通信服务质量，实现下行吞吐量最大化，与现有方法相比有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

自引率

0.00%

发文量