{"title":"Channel probing for opportunistic access with multi-channel sensing","authors":"Keqin Liu, Qing Zhao","doi":"10.1109/ACSSC.2008.5074369","DOIUrl":null,"url":null,"abstract":"We consider an opportunistic communication system consisting of multiple independent channels with time-varying states. We formulate the problem of optimal sequential channel selection as a restless multi-armed bandit process, for which a powerful policy-Whittle's index policy-can be implemented based on the indexability of the system. We obtain Whittle's index in closed-form under the average reward criterion, which leads to the direct implementation of Whittle's index policy. To evaluate the performance of Whittle's index policy, we provide simple algorithms to calculate an upper bound of the optimal performance. The tightness of the upper bound and the near-optimal performance of Whittle's index policy are illustrated with simulation examples. When channels are stochastically identical, we show that Whittle's index policy is equivalent to the myopic policy, which has a simple and robust structure. Based on this structure, we establish the approximation factors of the performance of Whittle's index policy. Furthermore, we show that Whittle's index policy is optimal under certain conditions.","PeriodicalId":416114,"journal":{"name":"2008 42nd Asilomar Conference on Signals, Systems and Computers","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 42nd Asilomar Conference on Signals, Systems and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.2008.5074369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
We consider an opportunistic communication system consisting of multiple independent channels with time-varying states. We formulate the problem of optimal sequential channel selection as a restless multi-armed bandit process, for which a powerful policy-Whittle's index policy-can be implemented based on the indexability of the system. We obtain Whittle's index in closed-form under the average reward criterion, which leads to the direct implementation of Whittle's index policy. To evaluate the performance of Whittle's index policy, we provide simple algorithms to calculate an upper bound of the optimal performance. The tightness of the upper bound and the near-optimal performance of Whittle's index policy are illustrated with simulation examples. When channels are stochastically identical, we show that Whittle's index policy is equivalent to the myopic policy, which has a simple and robust structure. Based on this structure, we establish the approximation factors of the performance of Whittle's index policy. Furthermore, we show that Whittle's index policy is optimal under certain conditions.