{"title":"无小区毫米波大规模MIMO中的功率分配:基于深度确定性策略梯度","authors":"Yu Zhao, Fengming Zhang, Yangjun Gao, Chaoqi Fu","doi":"10.1109/WCCCT56755.2023.10052585","DOIUrl":null,"url":null,"abstract":"This paper studies the weighted sum-spectral efficiency (SE) power allocation problem in cell-free (CF) mmWave massive multiple-input multiple-output (MIMO) with mobile user equipments (UEs). This is a non-convex problem that needs to be solved within a time constraint of the channel state information (CSI) variation. Although several heuristic methods, e.g., the weighted minimum mean square error (WMMSE) algorithm, have been developed to solve this problem, these methods entail considerable computational complexity therefore hardly meet the time constraint. To address this issue, we propose a deep reinforcement learning (DRL) solution based on the deep deterministic policy gradient (DDPG) method. It only needs several layers of neural network to perform the power allocation. The numerical results, obtained from a particular 3GPP scenario, show that the proposed DDPG method outperforms the existing algorithms.","PeriodicalId":112978,"journal":{"name":"2023 6th World Conference on Computing and Communication Technologies (WCCCT)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Power Allocation in Cell-Free mmWave Massive MIMO: Using Deep Deterministic Policy Gradient\",\"authors\":\"Yu Zhao, Fengming Zhang, Yangjun Gao, Chaoqi Fu\",\"doi\":\"10.1109/WCCCT56755.2023.10052585\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies the weighted sum-spectral efficiency (SE) power allocation problem in cell-free (CF) mmWave massive multiple-input multiple-output (MIMO) with mobile user equipments (UEs). This is a non-convex problem that needs to be solved within a time constraint of the channel state information (CSI) variation. Although several heuristic methods, e.g., the weighted minimum mean square error (WMMSE) algorithm, have been developed to solve this problem, these methods entail considerable computational complexity therefore hardly meet the time constraint. To address this issue, we propose a deep reinforcement learning (DRL) solution based on the deep deterministic policy gradient (DDPG) method. It only needs several layers of neural network to perform the power allocation. The numerical results, obtained from a particular 3GPP scenario, show that the proposed DDPG method outperforms the existing algorithms.\",\"PeriodicalId\":112978,\"journal\":{\"name\":\"2023 6th World Conference on Computing and Communication Technologies (WCCCT)\",\"volume\":\"69 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 6th World Conference on Computing and Communication Technologies (WCCCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCCCT56755.2023.10052585\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th World Conference on Computing and Communication Technologies (WCCCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCCCT56755.2023.10052585","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Power Allocation in Cell-Free mmWave Massive MIMO: Using Deep Deterministic Policy Gradient
This paper studies the weighted sum-spectral efficiency (SE) power allocation problem in cell-free (CF) mmWave massive multiple-input multiple-output (MIMO) with mobile user equipments (UEs). This is a non-convex problem that needs to be solved within a time constraint of the channel state information (CSI) variation. Although several heuristic methods, e.g., the weighted minimum mean square error (WMMSE) algorithm, have been developed to solve this problem, these methods entail considerable computational complexity therefore hardly meet the time constraint. To address this issue, we propose a deep reinforcement learning (DRL) solution based on the deep deterministic policy gradient (DDPG) method. It only needs several layers of neural network to perform the power allocation. The numerical results, obtained from a particular 3GPP scenario, show that the proposed DDPG method outperforms the existing algorithms.