Wentao Sun;Zan Li;Jia Shi;Zixuan Bai;Feng Wang;Tony Q. S. Quek
{"title":"MAHTD-DDPG-Based Multiobjective Resource Allocation for UAV-Assisted Wireless Network","authors":"Wentao Sun;Zan Li;Jia Shi;Zixuan Bai;Feng Wang;Tony Q. S. Quek","doi":"10.1109/JMASS.2024.3420893","DOIUrl":null,"url":null,"abstract":"As an aerial base station (BS), uncrewed aerial vehicle (UAV) has been considered as a promising platform to provide wireless data service in future networks due to its flexible, swift, and low-cost features. However, since the suddenness and randomness of ground users’ (GUs’) data requirements, it is challenging for the UAV BSs to dynamically make decisions to provide real-time data services to GUs. In a multimode UAV-assisted wireless network, we formulate a multiobjective optimization problem to minimize the average peak age of information (APAoI) and energy consumption of UAVs and to maximize the accumulated service data (ASD) for GUs. Therefore, this article proposes the multiagent hybrid twin delayed deep deterministic policy gradient (MAHTD-DDPG) algorithm with hybrid action space design, which is empowered by the centralized training and distributed execution (CTDE) framework. In the proposed algorithm, the UAVs can cooperatively make decisions by sharing the GU status information, in a result of jointly optimizing the UAV trajectory, mode selection, and transmit power. Simulation results demonstrate that our proposed approach achieves 79.6% and 120.4% higher rewards than the multiagent DDPG algorithm and HTD-DDPG algorithm, respectively.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 2","pages":"70-81"},"PeriodicalIF":2.1000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10578051/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As an aerial base station (BS), uncrewed aerial vehicle (UAV) has been considered as a promising platform to provide wireless data service in future networks due to its flexible, swift, and low-cost features. However, since the suddenness and randomness of ground users’ (GUs’) data requirements, it is challenging for the UAV BSs to dynamically make decisions to provide real-time data services to GUs. In a multimode UAV-assisted wireless network, we formulate a multiobjective optimization problem to minimize the average peak age of information (APAoI) and energy consumption of UAVs and to maximize the accumulated service data (ASD) for GUs. Therefore, this article proposes the multiagent hybrid twin delayed deep deterministic policy gradient (MAHTD-DDPG) algorithm with hybrid action space design, which is empowered by the centralized training and distributed execution (CTDE) framework. In the proposed algorithm, the UAVs can cooperatively make decisions by sharing the GU status information, in a result of jointly optimizing the UAV trajectory, mode selection, and transmit power. Simulation results demonstrate that our proposed approach achieves 79.6% and 120.4% higher rewards than the multiagent DDPG algorithm and HTD-DDPG algorithm, respectively.