S. Hashima, Kohei Hatano, H. Kasban, M. Rihan, E. M. Mohamed
{"title":"Multiagent Multi-Armed Bandit Techniques for Millimeter Wave Concurrent Beamforming","authors":"S. Hashima, Kohei Hatano, H. Kasban, M. Rihan, E. M. Mohamed","doi":"10.1109/JAC-ECC51597.2020.9355899","DOIUrl":null,"url":null,"abstract":"This paper leverages multiagent multi-armed bandit (MA-MAB) schemes to handle efficiently the millimeter wave (mmWave) concurrent transmission problem. MA-MAB selects the beams with the maximum long-term reward, mostly data rate, from the concurrent links. The mmWave access points (APs) are the agents of the bandit game, while the arms are the existing beam directions. Towards that, MA-KLUCB and MA-EXP3 are proposed and tested via applying them within each AP in a selfish concurrent beamforming scenario. It turns out that the proposed algorithms provide near-optimal performances as compared to the exhaustive search of all concurrent beams’ combinations. Also, they demonstrate reasonable fast convergence rates too.","PeriodicalId":146890,"journal":{"name":"2020 8th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","volume":"2800 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JAC-ECC51597.2020.9355899","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper leverages multiagent multi-armed bandit (MA-MAB) schemes to handle efficiently the millimeter wave (mmWave) concurrent transmission problem. MA-MAB selects the beams with the maximum long-term reward, mostly data rate, from the concurrent links. The mmWave access points (APs) are the agents of the bandit game, while the arms are the existing beam directions. Towards that, MA-KLUCB and MA-EXP3 are proposed and tested via applying them within each AP in a selfish concurrent beamforming scenario. It turns out that the proposed algorithms provide near-optimal performances as compared to the exhaustive search of all concurrent beams’ combinations. Also, they demonstrate reasonable fast convergence rates too.