Yi Zhang, Wee Peng Tay, K. H. Li, M. Esseghir, D. Gaïti
{"title":"Distributed opportunistic spectrum access with spatial reuse in cognitive radio networks","authors":"Yi Zhang, Wee Peng Tay, K. H. Li, M. Esseghir, D. Gaïti","doi":"10.1109/GlobalSIP.2014.7032321","DOIUrl":null,"url":null,"abstract":"We formulate and study a multi-user multi-armed bandit (MAB) problem for opportunistic spectrum access (OSA) that exploits the temporal-spatial reuse of PU channels so that SUs who do not interfere with each other can make use of the same PU channel. We propose a three-stage distributed channel allocation policy for OSA, where SUs collaboratively find an optimal channel access grouping, and independently learn the channel availability statistics to maximize the total expected number of successful SU transmissions. We adopt a distributed synchronous greedy graph coloring algorithm to cluster SUs into maximal independent sets, and a distributed average consensus algorithm to learn the sizes of the independent sets, with SUs belonging to a larger set being assigned a smaller access rank. Each SU then independently learns the PU channel statistics using a revised ε-greedy policy based on its assigned access rank. We provide the theoretical upper bound for the regret, and simulations suggest that our proposed policy has a significantly smaller regret than a random access policy and an adaptive randomization policy.","PeriodicalId":362306,"journal":{"name":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP.2014.7032321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We formulate and study a multi-user multi-armed bandit (MAB) problem for opportunistic spectrum access (OSA) that exploits the temporal-spatial reuse of PU channels so that SUs who do not interfere with each other can make use of the same PU channel. We propose a three-stage distributed channel allocation policy for OSA, where SUs collaboratively find an optimal channel access grouping, and independently learn the channel availability statistics to maximize the total expected number of successful SU transmissions. We adopt a distributed synchronous greedy graph coloring algorithm to cluster SUs into maximal independent sets, and a distributed average consensus algorithm to learn the sizes of the independent sets, with SUs belonging to a larger set being assigned a smaller access rank. Each SU then independently learns the PU channel statistics using a revised ε-greedy policy based on its assigned access rank. We provide the theoretical upper bound for the regret, and simulations suggest that our proposed policy has a significantly smaller regret than a random access policy and an adaptive randomization policy.