{"title":"A Multi-Agent DRL-Based Power Allocation Mechanism for Multi-Cell NOMA Networks","authors":"MohammadAmin Lotfolahi, H. Ferng","doi":"10.1109/APWCS60142.2023.10234069","DOIUrl":null,"url":null,"abstract":"To effectively address the power allocation (PA) problem to maximize the energy efficiency (EE) for a non-orthogonal multiple access (NOMA) system, a novel action mapper alongside a multi-agent deep reinforcement learning (MADRL)-based algorithm is designed in this paper. The action mapper discretizes and merges similar actions into one action so that the action space can be significantly reduced. Then, it is integrated with a multi-agent proximal policy optimization (MAPPO) algorithm to efficiently perform the PA task. Our proposed MADRL-based algorithm with the reduced action mapper is able to find the sub-optimal solution according to the current environmental condition. Supported by our simulation results, our proposed mechanism can significantly improve the EE as compared to the closely related approaches.","PeriodicalId":375211,"journal":{"name":"2023 VTS Asia Pacific Wireless Communications Symposium (APWCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 VTS Asia Pacific Wireless Communications Symposium (APWCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APWCS60142.2023.10234069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To effectively address the power allocation (PA) problem to maximize the energy efficiency (EE) for a non-orthogonal multiple access (NOMA) system, a novel action mapper alongside a multi-agent deep reinforcement learning (MADRL)-based algorithm is designed in this paper. The action mapper discretizes and merges similar actions into one action so that the action space can be significantly reduced. Then, it is integrated with a multi-agent proximal policy optimization (MAPPO) algorithm to efficiently perform the PA task. Our proposed MADRL-based algorithm with the reduced action mapper is able to find the sub-optimal solution according to the current environmental condition. Supported by our simulation results, our proposed mechanism can significantly improve the EE as compared to the closely related approaches.