5G及以上毫米波系统的双深度q -学习和基于SAC的混合波束成形

2021 IEEE International Conference on Electro Information Technology (EIT) Pub Date : 2021-05-14 DOI:10.1109/EIT51626.2021.9491918

Youness Arjoune, S. Faruque

{"title":"5G及以上毫米波系统的双深度q -学习和基于SAC的混合波束成形","authors":"Youness Arjoune, S. Faruque","doi":"10.1109/EIT51626.2021.9491918","DOIUrl":null,"url":null,"abstract":"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.","PeriodicalId":162816,"journal":{"name":"2021 IEEE International Conference on Electro Information Technology (EIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems\",\"authors\":\"Youness Arjoune, S. Faruque\",\"doi\":\"10.1109/EIT51626.2021.9491918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.\",\"PeriodicalId\":162816,\"journal\":{\"name\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT51626.2021.9491918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electro Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT51626.2021.9491918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

当使用毫米波(mmWave)频段的大规模多输入多输出(MIMO)系统时，全数字波束形成技术成本高，功耗大。混合波束形成，使用很少数量的射频链和移相器网络，是一种成本低且节能的替代波束形成解决方案。然而，由移相器施加的单位模量约束使得这个问题本质上是非凸的，因此可能会导致难以承受的计算复杂性。因此，设计能够进一步提高频谱效率、硬件效率和计算效率的混合波束形成算法至关重要。因此，在本文中，我们的目标是利用深度强化学习理论来解决混合波束形成问题，该理论已经成功地解决了几个高维非凸问题。虽然深度强化学习之前已经被提出，但几乎所有先前的研究都集中在q学习上，而q学习被认为存在高估偏差。与以往的研究不同，我们提出了两种方法来解决单用户大规模MIMO (SU-MIMO)的混合波束形成问题:1)具有重放经验的双深度Q学习和离散动作空间设置的软目标网络更新方法;2)连续动作空间设置的软演员评价方法。利用数值模拟对这些模型进行了测试，并利用光谱效率和计算效率对这些模型进行了评估。本文表明，这些模型在降低计算复杂度的同时，在频谱效率方面达到了接近最优的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems

Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Conference on Electro Information Technology (EIT)

自引率

0.00%

发文量