{"title":"5G及以上毫米波系统的双深度q -学习和基于SAC的混合波束成形","authors":"Youness Arjoune, S. Faruque","doi":"10.1109/EIT51626.2021.9491918","DOIUrl":null,"url":null,"abstract":"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.","PeriodicalId":162816,"journal":{"name":"2021 IEEE International Conference on Electro Information Technology (EIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems\",\"authors\":\"Youness Arjoune, S. Faruque\",\"doi\":\"10.1109/EIT51626.2021.9491918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.\",\"PeriodicalId\":162816,\"journal\":{\"name\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT51626.2021.9491918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electro Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT51626.2021.9491918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems
Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.