{"title":"Joint Beam Management and Relay Selection Using Deep Reinforcement Learning for MmWave UAV Relay Networks","authors":"Dohyun Kim, Miguel R. Castellanos, R. Heath","doi":"10.1109/MILCOM55135.2022.10017754","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicle (UAV) relays are useful in tactical millimeter wave (mmWave) networks to overcome blockages and improve link resilience. Getting the most benefits from relays, though, requires efficient strategies for relay selection and for beam management. In this paper, we use deep reinforcement learning (DRL) to jointly select unblocked UAV relays and to perform beam management. The proposed DRL-based algorithm uses rate feedback from the receiver to learn a policy that adapts to the dynamic channel conditions. We show with numerical evaluation that the proposed method outperforms baselines without prior channel knowledge. Moreover, the DRL-based algorithm can maintain high spectral efficiency even under frequent blockages.","PeriodicalId":239804,"journal":{"name":"MILCOM 2022 - 2022 IEEE Military Communications Conference (MILCOM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MILCOM 2022 - 2022 IEEE Military Communications Conference (MILCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MILCOM55135.2022.10017754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Unmanned aerial vehicle (UAV) relays are useful in tactical millimeter wave (mmWave) networks to overcome blockages and improve link resilience. Getting the most benefits from relays, though, requires efficient strategies for relay selection and for beam management. In this paper, we use deep reinforcement learning (DRL) to jointly select unblocked UAV relays and to perform beam management. The proposed DRL-based algorithm uses rate feedback from the receiver to learn a policy that adapts to the dynamic channel conditions. We show with numerical evaluation that the proposed method outperforms baselines without prior channel knowledge. Moreover, the DRL-based algorithm can maintain high spectral efficiency even under frequent blockages.