Energy-Efficient UAV Trajectory Design with Information Freshness Constraint via Deep Reinforcement Learning

Mob. Inf. Syst. Pub Date : 2021-12-23 DOI:10.1155/2021/1430512

Xinmin Li, Jiahui Li, Dandan Liu

{"title":"Energy-Efficient UAV Trajectory Design with Information Freshness Constraint via Deep Reinforcement Learning","authors":"Xinmin Li, Jiahui Li, Dandan Liu","doi":"10.1155/2021/1430512","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicle (UAV) technique with flexible deployment has enabled the development of Internet of Things (IoT) applications. However, it is difficult to guarantee the freshness of information delivery for the energy-limited UAV. Thus, we study the trajectory design in the multiple-UAV communication system, in which the massive ground devices send the individual information to mobile UAV base stations under the demand of information freshness. First, an energy-efficiency (EE) maximization optimization problem is formulated under the rest energy, safety distance, and age of information (AoI) constraints. However, it is difficult to solve the optimization problem due to the nonconvex objective function and unknown dynamic environment. Second, a trajectory design based on the deep Q-network method is proposed, in which the state space considering energy efficiency, rest energy, and AoI and the efficient reward function related with EE performance are constructed, respectively. Furthermore, to avoid the dependency of training data for the neural network, the experience replay and random sampling for batch are adopted. Finally, we validate the system performance of the proposed scheme. Simulation results show that the proposed scheme can achieve a better EE performance compared with the benchmark scheme.","PeriodicalId":18790,"journal":{"name":"Mob. Inf. Syst.","volume":"17 1","pages":"1430512:1-1430512:9"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mob. Inf. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2021/1430512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Unmanned aerial vehicle (UAV) technique with flexible deployment has enabled the development of Internet of Things (IoT) applications. However, it is difficult to guarantee the freshness of information delivery for the energy-limited UAV. Thus, we study the trajectory design in the multiple-UAV communication system, in which the massive ground devices send the individual information to mobile UAV base stations under the demand of information freshness. First, an energy-efficiency (EE) maximization optimization problem is formulated under the rest energy, safety distance, and age of information (AoI) constraints. However, it is difficult to solve the optimization problem due to the nonconvex objective function and unknown dynamic environment. Second, a trajectory design based on the deep Q-network method is proposed, in which the state space considering energy efficiency, rest energy, and AoI and the efficient reward function related with EE performance are constructed, respectively. Furthermore, to avoid the dependency of training data for the neural network, the experience replay and random sampling for batch are adopted. Finally, we validate the system performance of the proposed scheme. Simulation results show that the proposed scheme can achieve a better EE performance compared with the benchmark scheme.

查看原文本刊更多论文

基于深度强化学习的信息新鲜度约束节能无人机轨迹设计

灵活部署的无人机(UAV)技术使物联网(IoT)应用的发展成为可能。然而，对于能量有限的无人机来说，很难保证信息传递的新鲜度。因此，我们研究了多无人机通信系统中大量地面设备在信息新鲜度要求下向移动无人机基站发送个体信息的轨迹设计。首先，在剩余能量、安全距离和信息年龄(AoI)约束下，建立了能效最大化优化问题。然而，由于目标函数的非凸性和动态环境的未知，使得优化问题难以求解。其次，提出了基于深度q -网络方法的轨迹设计，分别构建了考虑能量效率、休息能量和AoI的状态空间以及与EE绩效相关的有效奖励函数。此外，为了避免神经网络对训练数据的依赖性，采用了经验重放和随机抽样的方法。最后，对所提方案的系统性能进行了验证。仿真结果表明，与基准方案相比，该方案可以获得更好的EE性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Mob. Inf. Syst.

自引率

0.00%

发文量