基于多智能体强化学习的多无人机无线网络资源共享

IEEE Journal on Miniaturization for Air and Space Systems Pub Date : 2024-12-04 DOI:10.1109/JMASS.2024.3510808

Yaxiu Zhang;Mingan Luan;Zheng Chang;Timo Hämäläinen

{"title":"基于多智能体强化学习的多无人机无线网络资源共享","authors":"Yaxiu Zhang;Mingan Luan;Zheng Chang;Timo Hämäläinen","doi":"10.1109/JMASS.2024.3510808","DOIUrl":null,"url":null,"abstract":"This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 2","pages":"103-112"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiagent Reinforcement Learning-Based Resource Sharing in Multi-UAV Wireless Networks\",\"authors\":\"Yaxiu Zhang;Mingan Luan;Zheng Chang;Timo Hämäläinen\",\"doi\":\"10.1109/JMASS.2024.3510808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.\",\"PeriodicalId\":100624,\"journal\":{\"name\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"volume\":\"6 2\",\"pages\":\"103-112\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10777085/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10777085/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文利用多智能体强化学习（MARL）方法研究了多无人机无线网络中的资源共享问题。具体而言，所考虑的多无人机系统涉及两种传输模式，即U2D （UAV-to-device）模式和U2N （UAV-to-network）模式，其中U2D模式允许复用U2N模式的频谱，以提高频谱效率。然后，在保证U2N链路通信质量的前提下，通过对信道分配、功率电平选择和无人机轨迹进行联合优化，提出了U2D链路吞吐量最大化的优化问题。由于其高度的复杂性和动态性，以及具有挑战性的非凸目标函数和约束，所产生的问题很难解决。在此基础上，提出了一种基于多智能体深度确定性策略梯度（madpg）的资源分配和多无人机轨迹优化策略。仿真结果表明了该方法在提高系统传输速率方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multiagent Reinforcement Learning-Based Resource Sharing in Multi-UAV Wireless Networks

This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Journal on Miniaturization for Air and Space Systems

CiteScore

4.40

自引率

0.00%

发文量