Optimizing Drone Energy Use for Emergency Communications in Disasters via Deep Reinforcement Learning

Future Internet Pub Date : 2024-07-11 DOI:10.3390/fi16070245

Wen Qiu, Xun Shao, Hiroshi Masui, William Liu

{"title":"Optimizing Drone Energy Use for Emergency Communications in Disasters via Deep Reinforcement Learning","authors":"Wen Qiu, Xun Shao, Hiroshi Masui, William Liu","doi":"10.3390/fi16070245","DOIUrl":null,"url":null,"abstract":"For a communication control system in a disaster area where drones (also called unmanned aerial vehicles (UAVs)) are used as aerial base stations (ABSs), the reliability of communication is a key challenge for drones to provide emergency communication services. However, the effective configuration of UAVs remains a major challenge due to limitations in their communication range and energy capacity. In addition, the relatively high cost of drones and the issue of mutual communication interference make it impractical to deploy an unlimited number of drones in a given area. To maximize the communication services provided by a limited number of drones to the ground user equipment (UE) within a certain time frame while minimizing the drone energy consumption, we propose a multi-agent proximal policy optimization (MAPPO) algorithm. Considering the dynamic nature of the environment, we analyze diverse observation data structures and design novel objective functions to enhance the drone performance. We find that, when drone energy consumption is used as a penalty term in the objective function, the drones—acting as agents—can identify the optimal trajectory that maximizes the UE coverage while minimizing the energy consumption. At the same time, the experimental results reveal that, without considering the machine computing power required for training and convergence time, the proposed key algorithm demonstrates better performance in communication coverage and energy saving as compared with other methods. The average coverage performance is 10–45% higher than that of the other three methods, and it can save up to 3% more energy.","PeriodicalId":509567,"journal":{"name":"Future Internet","volume":"120 24","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/fi16070245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

For a communication control system in a disaster area where drones (also called unmanned aerial vehicles (UAVs)) are used as aerial base stations (ABSs), the reliability of communication is a key challenge for drones to provide emergency communication services. However, the effective configuration of UAVs remains a major challenge due to limitations in their communication range and energy capacity. In addition, the relatively high cost of drones and the issue of mutual communication interference make it impractical to deploy an unlimited number of drones in a given area. To maximize the communication services provided by a limited number of drones to the ground user equipment (UE) within a certain time frame while minimizing the drone energy consumption, we propose a multi-agent proximal policy optimization (MAPPO) algorithm. Considering the dynamic nature of the environment, we analyze diverse observation data structures and design novel objective functions to enhance the drone performance. We find that, when drone energy consumption is used as a penalty term in the objective function, the drones—acting as agents—can identify the optimal trajectory that maximizes the UE coverage while minimizing the energy consumption. At the same time, the experimental results reveal that, without considering the machine computing power required for training and convergence time, the proposed key algorithm demonstrates better performance in communication coverage and energy saving as compared with other methods. The average coverage performance is 10–45% higher than that of the other three methods, and it can save up to 3% more energy.

查看原文本刊更多论文

通过深度强化学习优化无人机在灾难中的应急通信能源使用

在灾区，无人机（也称为无人驾驶飞行器（UAV））被用作空中基站（ABS），对于灾区的通信控制系统来说，通信的可靠性是无人机提供应急通信服务的关键挑战。然而，由于无人机通信距离和能源容量的限制，如何有效配置无人机仍是一大挑战。此外，无人机的成本相对较高，且存在相互通信干扰的问题，因此在特定区域部署无限数量的无人机并不现实。为了在一定时间内使有限数量的无人机向地面用户设备（UE）提供的通信服务最大化，同时使无人机的能耗最小化，我们提出了一种多代理近端策略优化（MAPPO）算法。考虑到环境的动态特性，我们分析了多样化的观测数据结构，并设计了新颖的目标函数来提高无人机性能。我们发现，当无人机能耗作为目标函数中的惩罚项时，作为代理的无人机可以识别出最优轨迹，在最大化 UE 覆盖范围的同时最小化能耗。同时，实验结果表明，在不考虑训练所需的机器计算能力和收敛时间的情况下，与其他方法相比，所提出的关键算法在通信覆盖和节能方面表现出更好的性能。其平均覆盖性能比其他三种方法高出 10-45%，并能节省多达 3% 的能源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Future Internet

自引率

0.00%

发文量