MAHTD-DDPG-Based Multiobjective Resource Allocation for UAV-Assisted Wireless Network

IF 2.1

IEEE Journal on Miniaturization for Air and Space Systems Pub Date : 2024-07-01 DOI:10.1109/JMASS.2024.3420893

Wentao Sun;Zan Li;Jia Shi;Zixuan Bai;Feng Wang;Tony Q. S. Quek

{"title":"MAHTD-DDPG-Based Multiobjective Resource Allocation for UAV-Assisted Wireless Network","authors":"Wentao Sun;Zan Li;Jia Shi;Zixuan Bai;Feng Wang;Tony Q. S. Quek","doi":"10.1109/JMASS.2024.3420893","DOIUrl":null,"url":null,"abstract":"As an aerial base station (BS), uncrewed aerial vehicle (UAV) has been considered as a promising platform to provide wireless data service in future networks due to its flexible, swift, and low-cost features. However, since the suddenness and randomness of ground users’ (GUs’) data requirements, it is challenging for the UAV BSs to dynamically make decisions to provide real-time data services to GUs. In a multimode UAV-assisted wireless network, we formulate a multiobjective optimization problem to minimize the average peak age of information (APAoI) and energy consumption of UAVs and to maximize the accumulated service data (ASD) for GUs. Therefore, this article proposes the multiagent hybrid twin delayed deep deterministic policy gradient (MAHTD-DDPG) algorithm with hybrid action space design, which is empowered by the centralized training and distributed execution (CTDE) framework. In the proposed algorithm, the UAVs can cooperatively make decisions by sharing the GU status information, in a result of jointly optimizing the UAV trajectory, mode selection, and transmit power. Simulation results demonstrate that our proposed approach achieves 79.6% and 120.4% higher rewards than the multiagent DDPG algorithm and HTD-DDPG algorithm, respectively.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 2","pages":"70-81"},"PeriodicalIF":2.1000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10578051/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

As an aerial base station (BS), uncrewed aerial vehicle (UAV) has been considered as a promising platform to provide wireless data service in future networks due to its flexible, swift, and low-cost features. However, since the suddenness and randomness of ground users’ (GUs’) data requirements, it is challenging for the UAV BSs to dynamically make decisions to provide real-time data services to GUs. In a multimode UAV-assisted wireless network, we formulate a multiobjective optimization problem to minimize the average peak age of information (APAoI) and energy consumption of UAVs and to maximize the accumulated service data (ASD) for GUs. Therefore, this article proposes the multiagent hybrid twin delayed deep deterministic policy gradient (MAHTD-DDPG) algorithm with hybrid action space design, which is empowered by the centralized training and distributed execution (CTDE) framework. In the proposed algorithm, the UAVs can cooperatively make decisions by sharing the GU status information, in a result of jointly optimizing the UAV trajectory, mode selection, and transmit power. Simulation results demonstrate that our proposed approach achieves 79.6% and 120.4% higher rewards than the multiagent DDPG algorithm and HTD-DDPG algorithm, respectively.

查看原文本刊更多论文

基于mahtd - ddpg的无人机辅助无线网络多目标资源分配

无人机（UAV）作为一种空中基站（BS），以其灵活、快速和低成本的特点，被认为是未来网络中提供无线数据服务的一个很有前途的平台。然而，由于地面用户数据需求的突发性和随机性，无人机导航系统如何动态决策为地面用户提供实时数据服务是一个挑战。在多模无人机辅助的无线网络中，以无人机的平均峰值信息年龄（APAoI）和能耗最小，GUs的累计服务数据（ASD）最大为目标，建立了多目标优化问题。为此，本文提出了基于集中训练和分布式执行（CTDE）框架的混合动作空间设计的多智能体混合双延迟深度确定性策略梯度（MAHTD-DDPG）算法。该算法通过共享GU状态信息，实现无人机协同决策，共同优化无人机的飞行轨迹、模式选择和发射功率。仿真结果表明，该方法比多智能体DDPG算法和HTD-DDPG算法分别高出79.6%和120.4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Journal on Miniaturization for Air and Space Systems

CiteScore

4.40

自引率

0.00%

发文量