{"title":"基于多智能体强化学习的多无人机无线网络资源共享","authors":"Yaxiu Zhang;Mingan Luan;Zheng Chang;Timo Hämäläinen","doi":"10.1109/JMASS.2024.3510808","DOIUrl":null,"url":null,"abstract":"This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 2","pages":"103-112"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiagent Reinforcement Learning-Based Resource Sharing in Multi-UAV Wireless Networks\",\"authors\":\"Yaxiu Zhang;Mingan Luan;Zheng Chang;Timo Hämäläinen\",\"doi\":\"10.1109/JMASS.2024.3510808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.\",\"PeriodicalId\":100624,\"journal\":{\"name\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"volume\":\"6 2\",\"pages\":\"103-112\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10777085/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10777085/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multiagent Reinforcement Learning-Based Resource Sharing in Multi-UAV Wireless Networks
This article investigates the resource sharing problem in a multiuncrewed aerial vehicle (UAV) wireless network by utilizing the multiagent reinforcement learning (MARL) method. Specifically, the considered multi-UAV system involves two transmission modes, i.e., UAV-to-device (U2D) mode and UAV-to-network (U2N) mode, in which the U2D mode is allowed to reuse the spectrum of U2N mode to improve the spectrum efficiency. Then, we formulate an optimization problem to maximize the throughput of U2D links by jointly optimizing the channel allocation, power level selection, and UAV trajectory, while ensuring the communication quality of U2N links. Due to the highly complex and dynamic nature, as well as the challenging nonconvex objective function and constraints, the resulting problem is hard to address. Accordingly, we propose a novel multiagent deep deterministic policy gradient (MADDPG)-based resource allocation and multi-UAV trajectory optimization policy. Simulation results illustrate the efficacy of our method in improving the system transmission rate.