Jiansong Miao;Shanling Bai;Shahid Mumtaz;Qian Zhang;Junsheng Mu
{"title":"无人机辅助 MEC 网络视频流的实用性优化:DRL 方法","authors":"Jiansong Miao;Shanling Bai;Shahid Mumtaz;Qian Zhang;Junsheng Mu","doi":"10.1109/TGCN.2024.3352173","DOIUrl":null,"url":null,"abstract":"The integration of unmanned aerial vehicles (UAVs) in future communication networks has received great attention, and it plays an essential role in many applications, such as military reconnaissance, fire monitoring, etc. In this paper, we consider a UAV-aided video transmission system based on mobile edge computing (MEC). Considering the short latency requirements, the UAV acts as a MEC server to transcode the videos and as a relay to forward the transcoded videos to the ground base station. Subject to constraints on discrete variables and short latency, we aim to maximize the cumulative utility by jointly optimizing the power allocation, video transcoding policy, computational resources allocation, and UAV flight trajectory. The above non-convex optimization problem is modeled as a Markov decision process (MDP) and solved by a deep deterministic policy gradient (DDPG) algorithm to realize continuous action control by policy iteration. Simulation results show that the DDPG algorithm performs better than deep Q-learning network algorithm (DQN) and actor-critic (AC) algorithm.","PeriodicalId":13052,"journal":{"name":"IEEE Transactions on Green Communications and Networking","volume":"8 2","pages":"878-889"},"PeriodicalIF":5.3000,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Utility-Oriented Optimization for Video Streaming in UAV-Aided MEC Network: A DRL Approach\",\"authors\":\"Jiansong Miao;Shanling Bai;Shahid Mumtaz;Qian Zhang;Junsheng Mu\",\"doi\":\"10.1109/TGCN.2024.3352173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The integration of unmanned aerial vehicles (UAVs) in future communication networks has received great attention, and it plays an essential role in many applications, such as military reconnaissance, fire monitoring, etc. In this paper, we consider a UAV-aided video transmission system based on mobile edge computing (MEC). Considering the short latency requirements, the UAV acts as a MEC server to transcode the videos and as a relay to forward the transcoded videos to the ground base station. Subject to constraints on discrete variables and short latency, we aim to maximize the cumulative utility by jointly optimizing the power allocation, video transcoding policy, computational resources allocation, and UAV flight trajectory. The above non-convex optimization problem is modeled as a Markov decision process (MDP) and solved by a deep deterministic policy gradient (DDPG) algorithm to realize continuous action control by policy iteration. Simulation results show that the DDPG algorithm performs better than deep Q-learning network algorithm (DQN) and actor-critic (AC) algorithm.\",\"PeriodicalId\":13052,\"journal\":{\"name\":\"IEEE Transactions on Green Communications and Networking\",\"volume\":\"8 2\",\"pages\":\"878-889\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Green Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10388042/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Green Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10388042/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
Utility-Oriented Optimization for Video Streaming in UAV-Aided MEC Network: A DRL Approach
The integration of unmanned aerial vehicles (UAVs) in future communication networks has received great attention, and it plays an essential role in many applications, such as military reconnaissance, fire monitoring, etc. In this paper, we consider a UAV-aided video transmission system based on mobile edge computing (MEC). Considering the short latency requirements, the UAV acts as a MEC server to transcode the videos and as a relay to forward the transcoded videos to the ground base station. Subject to constraints on discrete variables and short latency, we aim to maximize the cumulative utility by jointly optimizing the power allocation, video transcoding policy, computational resources allocation, and UAV flight trajectory. The above non-convex optimization problem is modeled as a Markov decision process (MDP) and solved by a deep deterministic policy gradient (DDPG) algorithm to realize continuous action control by policy iteration. Simulation results show that the DDPG algorithm performs better than deep Q-learning network algorithm (DQN) and actor-critic (AC) algorithm.