Chiya Zhang , Xinjie Li , Chunlong He , Xingquan Li , Dongping Lin
{"title":"基于强化学习的无人机中继轨迹优化","authors":"Chiya Zhang , Xinjie Li , Chunlong He , Xingquan Li , Dongping Lin","doi":"10.1016/j.dcan.2023.07.006","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we investigate the application of the Unmanned Aerial Vehicle (UAV)-enabled relaying system in emergency communications, where one UAV is applied as a relay to help transmit information from ground users to a Base Station (BS). We maximize the total transmitted data from the users to the BS, by optimizing the user communication scheduling and association along with the power allocation and the trajectory of the UAV. To solve this non-convex optimization problem, we propose the traditional Convex Optimization (CO) and the Reinforcement Learning (RL)-based approaches. Specifically, we apply the block coordinate descent and successive convex approximation techniques in the CO approach, while applying the soft actor-critic algorithm in the RL approach. The simulation results show that both approaches can solve the proposed optimization problem and obtain good results. Moreover, the RL approach establishes emergency communications more rapidly than the CO approach once the training process has been completed.</div></div>","PeriodicalId":48631,"journal":{"name":"Digital Communications and Networks","volume":"11 1","pages":"Pages 200-209"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Trajectory optimization for UAV-enabled relaying with reinforcement learning\",\"authors\":\"Chiya Zhang , Xinjie Li , Chunlong He , Xingquan Li , Dongping Lin\",\"doi\":\"10.1016/j.dcan.2023.07.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this paper, we investigate the application of the Unmanned Aerial Vehicle (UAV)-enabled relaying system in emergency communications, where one UAV is applied as a relay to help transmit information from ground users to a Base Station (BS). We maximize the total transmitted data from the users to the BS, by optimizing the user communication scheduling and association along with the power allocation and the trajectory of the UAV. To solve this non-convex optimization problem, we propose the traditional Convex Optimization (CO) and the Reinforcement Learning (RL)-based approaches. Specifically, we apply the block coordinate descent and successive convex approximation techniques in the CO approach, while applying the soft actor-critic algorithm in the RL approach. The simulation results show that both approaches can solve the proposed optimization problem and obtain good results. Moreover, the RL approach establishes emergency communications more rapidly than the CO approach once the training process has been completed.</div></div>\",\"PeriodicalId\":48631,\"journal\":{\"name\":\"Digital Communications and Networks\",\"volume\":\"11 1\",\"pages\":\"Pages 200-209\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Communications and Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352864823001311\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Communications and Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352864823001311","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
Trajectory optimization for UAV-enabled relaying with reinforcement learning
In this paper, we investigate the application of the Unmanned Aerial Vehicle (UAV)-enabled relaying system in emergency communications, where one UAV is applied as a relay to help transmit information from ground users to a Base Station (BS). We maximize the total transmitted data from the users to the BS, by optimizing the user communication scheduling and association along with the power allocation and the trajectory of the UAV. To solve this non-convex optimization problem, we propose the traditional Convex Optimization (CO) and the Reinforcement Learning (RL)-based approaches. Specifically, we apply the block coordinate descent and successive convex approximation techniques in the CO approach, while applying the soft actor-critic algorithm in the RL approach. The simulation results show that both approaches can solve the proposed optimization problem and obtain good results. Moreover, the RL approach establishes emergency communications more rapidly than the CO approach once the training process has been completed.
期刊介绍:
Digital Communications and Networks is a prestigious journal that emphasizes on communication systems and networks. We publish only top-notch original articles and authoritative reviews, which undergo rigorous peer-review. We are proud to announce that all our articles are fully Open Access and can be accessed on ScienceDirect. Our journal is recognized and indexed by eminent databases such as the Science Citation Index Expanded (SCIE) and Scopus.
In addition to regular articles, we may also consider exceptional conference papers that have been significantly expanded. Furthermore, we periodically release special issues that focus on specific aspects of the field.
In conclusion, Digital Communications and Networks is a leading journal that guarantees exceptional quality and accessibility for researchers and scholars in the field of communication systems and networks.