Medhat Elsayed;Roghayeh Joda;Fahime Khoramnejad;David Chan;Akram Bin Sediq;Gary Boudreau;Melike Erol-Kantarci
{"title":"Energy-Efficient Carrier Aggregation in 5G Using Constrained Multi-Agent MDP","authors":"Medhat Elsayed;Roghayeh Joda;Fahime Khoramnejad;David Chan;Akram Bin Sediq;Gary Boudreau;Melike Erol-Kantarci","doi":"10.1109/TGCN.2024.3386066","DOIUrl":null,"url":null,"abstract":"Carrier Aggregation (CA) is a promising technology in LTE and 5G networks that enhances the throughput of the users. However, since each User Equipment (UE) has to continuously monitor the activated Component Carriers (CCs) in CA, the UE energy consumption increases. To reduce the energy consumption while maximizing the throughput of UEs, we propose a dynamic and proactive CC management scheme for 5G, using a Q-Learning algorithm. To address our problem, we first model the corresponding Constrained Multi-agent Markov Decision Process (CMMDP) model and then utilize the Q-Learning algorithm to solve it. The time inter-arrival and the size of the next incoming bursts of data are proactively predicted and, along with the data in the buffer, are considered in the state space and the reward function of the machine learning model. Our proposed scheme is compared to three baseline schemes. In the first and second baseline algorithms, all CCs and only single CC are activated for each UE, respectively. For the last baseline algorithm, we simplify our Reinforcement Learning (RL) algorithm, in which the remaining data in the scheduling buffer of users is not considered and also the throughput and the number of activated CCs is balanced in the low traffic load. Simulation results reveal that our proposed Q-Learning algorithm outperforms the baselines. It achieves the same throughput as the all CC activation algorithm while reducing the UE power consumption by about 20%. These benefits are achieved by dynamically activating and deactivating CCs according to the UE traffic pattern.","PeriodicalId":13052,"journal":{"name":"IEEE Transactions on Green Communications and Networking","volume":"8 4","pages":"1595-1606"},"PeriodicalIF":5.3000,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Green Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10494403/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Carrier Aggregation (CA) is a promising technology in LTE and 5G networks that enhances the throughput of the users. However, since each User Equipment (UE) has to continuously monitor the activated Component Carriers (CCs) in CA, the UE energy consumption increases. To reduce the energy consumption while maximizing the throughput of UEs, we propose a dynamic and proactive CC management scheme for 5G, using a Q-Learning algorithm. To address our problem, we first model the corresponding Constrained Multi-agent Markov Decision Process (CMMDP) model and then utilize the Q-Learning algorithm to solve it. The time inter-arrival and the size of the next incoming bursts of data are proactively predicted and, along with the data in the buffer, are considered in the state space and the reward function of the machine learning model. Our proposed scheme is compared to three baseline schemes. In the first and second baseline algorithms, all CCs and only single CC are activated for each UE, respectively. For the last baseline algorithm, we simplify our Reinforcement Learning (RL) algorithm, in which the remaining data in the scheduling buffer of users is not considered and also the throughput and the number of activated CCs is balanced in the low traffic load. Simulation results reveal that our proposed Q-Learning algorithm outperforms the baselines. It achieves the same throughput as the all CC activation algorithm while reducing the UE power consumption by about 20%. These benefits are achieved by dynamically activating and deactivating CCs according to the UE traffic pattern.