Jianzhong Yi, Chao Dong, K. Niu, Qiulin Xue, Junping Zhang
{"title":"基于强化学习的毫米波系统自适应波束切换算法","authors":"Jianzhong Yi, Chao Dong, K. Niu, Qiulin Xue, Junping Zhang","doi":"10.1109/ICCC56324.2022.10065827","DOIUrl":null,"url":null,"abstract":"In millimeter wave (mmWave) frequency band, the link quality is greatly affected by the environment. Coupled with the dense deployment of mmWave access points (APs) and the using of beamforming technology, beam handovers frequently occur in mobile communication systems. This paper optimizes the mmWave beam handover process using the Q-Learning method. Our proposed algorithm learns and senses the complex communication environment by the beam reporting information from the users. We consider the effects of handover cost and historical beam quality during the handover process. The adaptive handover threshold is obtained by querying the Q table according to the current state. Simulation results show that our proposed algorithm reduces the number of beam handover times and improves the system performance compared with the original scheme in the 3GPP protocol.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"121 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Beam Handover Algorithm Based on Reinforcement Learning for mmWave System\",\"authors\":\"Jianzhong Yi, Chao Dong, K. Niu, Qiulin Xue, Junping Zhang\",\"doi\":\"10.1109/ICCC56324.2022.10065827\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In millimeter wave (mmWave) frequency band, the link quality is greatly affected by the environment. Coupled with the dense deployment of mmWave access points (APs) and the using of beamforming technology, beam handovers frequently occur in mobile communication systems. This paper optimizes the mmWave beam handover process using the Q-Learning method. Our proposed algorithm learns and senses the complex communication environment by the beam reporting information from the users. We consider the effects of handover cost and historical beam quality during the handover process. The adaptive handover threshold is obtained by querying the Q table according to the current state. Simulation results show that our proposed algorithm reduces the number of beam handover times and improves the system performance compared with the original scheme in the 3GPP protocol.\",\"PeriodicalId\":263098,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"volume\":\"121 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC56324.2022.10065827\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive Beam Handover Algorithm Based on Reinforcement Learning for mmWave System
In millimeter wave (mmWave) frequency band, the link quality is greatly affected by the environment. Coupled with the dense deployment of mmWave access points (APs) and the using of beamforming technology, beam handovers frequently occur in mobile communication systems. This paper optimizes the mmWave beam handover process using the Q-Learning method. Our proposed algorithm learns and senses the complex communication environment by the beam reporting information from the users. We consider the effects of handover cost and historical beam quality during the handover process. The adaptive handover threshold is obtained by querying the Q table according to the current state. Simulation results show that our proposed algorithm reduces the number of beam handover times and improves the system performance compared with the original scheme in the 3GPP protocol.