Aerial Reliable Collaborative Communications for Terrestrial Mobile Users via Evolutionary Multi-Objective Deep Reinforcement Learning

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Mobile Computing Pub Date : 2025-01-29 DOI:10.1109/TMC.2025.3536093

Geng Sun;Jian Xiao;Jiahui Li;Jiacheng Wang;Jiawen Kang;Dusit Niyato;Shiwen Mao

{"title":"Aerial Reliable Collaborative Communications for Terrestrial Mobile Users via Evolutionary Multi-Objective Deep Reinforcement Learning","authors":"Geng Sun;Jian Xiao;Jiahui Li;Jiacheng Wang;Jiawen Kang;Dusit Niyato;Shiwen Mao","doi":"10.1109/TMC.2025.3536093","DOIUrl":null,"url":null,"abstract":"Autonomous aerial vehicles (AAVs) have emerged as the potential aerial base stations (BSs) to improve terrestrial communications. However, the limited onboard energy and antenna power of a AAV restrict its communication range and transmission capability. To address these limitations, this work employs collaborative beamforming through a AAV-enabled virtual antenna array to improve transmission performance from the AAV to terrestrial mobile users, under interference from non-associated BSs and dynamic channel conditions. Specifically, we introduce a memory-based random walk model to more accurately depict the mobility patterns of terrestrial mobile users. Following this, we formulate a multi-objective optimization problem (MOP) focused on maximizing the transmission rate while minimizing the flight energy consumption of the AAV swarm. Given the NP-hard nature of the formulated MOP and the highly dynamic environment, we transform this problem into a multi-objective Markov decision process and propose an improved evolutionary multi-objective reinforcement learning algorithm. Specifically, this algorithm introduces an evolutionary learning approach to obtain the approximate Pareto set for the formulated MOP. Moreover, the algorithm incorporates a long short-term memory network and hyper-sphere-based task selection method to discern the movement patterns of terrestrial mobile users and improve the diversity of the obtained Pareto set. Simulation results demonstrate that the proposed method effectively generates a diverse range of non-dominated policies and outperforms existing methods. Additional simulations demonstrate the scalability and robustness of the proposed CB-based method under different system parameters and various unexpected circumstances.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 7","pages":"5731-5748"},"PeriodicalIF":7.7000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10857476/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous aerial vehicles (AAVs) have emerged as the potential aerial base stations (BSs) to improve terrestrial communications. However, the limited onboard energy and antenna power of a AAV restrict its communication range and transmission capability. To address these limitations, this work employs collaborative beamforming through a AAV-enabled virtual antenna array to improve transmission performance from the AAV to terrestrial mobile users, under interference from non-associated BSs and dynamic channel conditions. Specifically, we introduce a memory-based random walk model to more accurately depict the mobility patterns of terrestrial mobile users. Following this, we formulate a multi-objective optimization problem (MOP) focused on maximizing the transmission rate while minimizing the flight energy consumption of the AAV swarm. Given the NP-hard nature of the formulated MOP and the highly dynamic environment, we transform this problem into a multi-objective Markov decision process and propose an improved evolutionary multi-objective reinforcement learning algorithm. Specifically, this algorithm introduces an evolutionary learning approach to obtain the approximate Pareto set for the formulated MOP. Moreover, the algorithm incorporates a long short-term memory network and hyper-sphere-based task selection method to discern the movement patterns of terrestrial mobile users and improve the diversity of the obtained Pareto set. Simulation results demonstrate that the proposed method effectively generates a diverse range of non-dominated policies and outperforms existing methods. Additional simulations demonstrate the scalability and robustness of the proposed CB-based method under different system parameters and various unexpected circumstances.

查看原文本刊更多论文

基于进化多目标深度强化学习的地面移动用户空中可靠协同通信

自主飞行器（aav）已成为改善地面通信的潜在空中基站（BSs）。然而，机载机载无人机有限的能量和天线功率限制了其通信范围和传输能力。为了解决这些限制，这项工作通过支持AAV的虚拟天线阵列采用协作波束形成，以提高在非相关BSs和动态信道条件下从AAV到地面移动用户的传输性能。具体来说，我们引入了一种基于记忆的随机漫步模型来更准确地描述地面移动用户的移动模式。在此基础上，提出了以最大传输速率和最小飞行能耗为目标的多目标优化问题（MOP）。考虑到拟定MOP的NP-hard性质和高度动态的环境，我们将该问题转化为一个多目标马尔可夫决策过程，并提出了一种改进的进化多目标强化学习算法。具体来说，该算法引入了一种进化学习方法来获得公式化MOP的近似帕累托集。此外，该算法结合长短期记忆网络和基于超球的任务选择方法来识别地面移动用户的运动模式，提高了得到的Pareto集的多样性。仿真结果表明，该方法有效地生成了多种非支配策略，并优于现有方法。仿真验证了该方法在不同系统参数和各种意外情况下的可扩展性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.