5G及以上毫米波系统的双深度q -学习和基于SAC的混合波束成形

Youness Arjoune, S. Faruque
{"title":"5G及以上毫米波系统的双深度q -学习和基于SAC的混合波束成形","authors":"Youness Arjoune, S. Faruque","doi":"10.1109/EIT51626.2021.9491918","DOIUrl":null,"url":null,"abstract":"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.","PeriodicalId":162816,"journal":{"name":"2021 IEEE International Conference on Electro Information Technology (EIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems\",\"authors\":\"Youness Arjoune, S. Faruque\",\"doi\":\"10.1109/EIT51626.2021.9491918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.\",\"PeriodicalId\":162816,\"journal\":{\"name\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Electro Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT51626.2021.9491918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electro Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT51626.2021.9491918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

当使用毫米波(mmWave)频段的大规模多输入多输出(MIMO)系统时,全数字波束形成技术成本高,功耗大。混合波束形成,使用很少数量的射频链和移相器网络,是一种成本低且节能的替代波束形成解决方案。然而,由移相器施加的单位模量约束使得这个问题本质上是非凸的,因此可能会导致难以承受的计算复杂性。因此,设计能够进一步提高频谱效率、硬件效率和计算效率的混合波束形成算法至关重要。因此,在本文中,我们的目标是利用深度强化学习理论来解决混合波束形成问题,该理论已经成功地解决了几个高维非凸问题。虽然深度强化学习之前已经被提出,但几乎所有先前的研究都集中在q学习上,而q学习被认为存在高估偏差。与以往的研究不同,我们提出了两种方法来解决单用户大规模MIMO (SU-MIMO)的混合波束形成问题:1)具有重放经验的双深度Q学习和离散动作空间设置的软目标网络更新方法;2)连续动作空间设置的软演员评价方法。利用数值模拟对这些模型进行了测试,并利用光谱效率和计算效率对这些模型进行了评估。本文表明,这些模型在降低计算复杂度的同时,在频谱效率方面达到了接近最优的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Double Deep Q-Learning and SAC Based Hybrid Beamforming for 5G and Beyond Millimeter-Wave Systems
Fully digital beamforming techniques are costly and power hungry when employed with massive multiple input multiple output (MIMO) systems at the millimeter-Wave (mmWave) bands. Hybrid beamforming, which uses a very few number of radio frequency chain with a network of phase shifters, is a cost- and an energy-efficient alternative beamforming solution. However, the unit modulus constraint imposed by the phase shifters makes this problem inherently nonconvex, therefore may induce unaffordable computational complexity. Hence, designing hybrid beamforming algorithms which further improve the spectral efficiency, hardware efficiency, and computational efficiency is of crucial importance. Therefore, in this paper, we aim at solving hybrid beamforming using the theory of deep reinforcement learning, which has been successful in solving several high-dimensional nonconvex problems. Although deep reinforcement learning has been previously proposed, nearly all prior studies focus on Q-learning, which is known to suffer from overestimation bias. Different from these previous studies, we propose to solve hybrid beamforming for a single-user massive MIMO (SU-MIMO) using two methods 1) a double deep Q- learning with replay experience and soft target network updates method for discrete action space setting; 2) a soft actor critic method for continuous action space setting. These models are tested using numerical simulations and evaluated using the spectral efficiency and computational efficiency. This paper shows that these models can achieve a near-optimal performance in terms of the spectral efficiency while reducing the computational complexity.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信