Efficient Learning in Stationary and Non-stationary OSA Scenario with QoS Guaranty

Navikkumar Modi, P. Mary, C. Moy
{"title":"Efficient Learning in Stationary and Non-stationary OSA Scenario with QoS Guaranty","authors":"Navikkumar Modi, P. Mary, C. Moy","doi":"10.4108/eai.9-1-2017.152098","DOIUrl":null,"url":null,"abstract":"In this work, the opportunistic spectrum access (OSA) problem is addressed with stationary and nonstationary Markov multi-armed bandit (MAB) frameworks. We propose a novel index based algorithm named QoS-UCB that balances exploration in terms of occupancy and quality, e.g. signal to noise ratio (SNR) for transmission, for stationary environments. Furthermore, we propose another learning policy, named discounted QoS-UCB (DQoS-UCB), for the non-stationary case. Our contribution in terms of numerical analysis is twofold: i) In stationary OSA scenario, we numerically compare our QoS-UCB policy with an existing UCB1 and also show that QoS-UCB outperforms UCB1 in terms of regret and ii) in non-stationary OSA scenario, numerical results state that proposed DQoS-UCB policy outperforms other simple UCBs and also QoS-UCB policy. To the best of our knowledge, this is the first learning algorithm which adapts to nonstationary Markov MAB framework and also quantifies channel quality information. Received on XXXX; accepted on XXXX; published on XXXX","PeriodicalId":288158,"journal":{"name":"EAI Endorsed Trans. Wirel. Spectr.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Trans. Wirel. Spectr.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.9-1-2017.152098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this work, the opportunistic spectrum access (OSA) problem is addressed with stationary and nonstationary Markov multi-armed bandit (MAB) frameworks. We propose a novel index based algorithm named QoS-UCB that balances exploration in terms of occupancy and quality, e.g. signal to noise ratio (SNR) for transmission, for stationary environments. Furthermore, we propose another learning policy, named discounted QoS-UCB (DQoS-UCB), for the non-stationary case. Our contribution in terms of numerical analysis is twofold: i) In stationary OSA scenario, we numerically compare our QoS-UCB policy with an existing UCB1 and also show that QoS-UCB outperforms UCB1 in terms of regret and ii) in non-stationary OSA scenario, numerical results state that proposed DQoS-UCB policy outperforms other simple UCBs and also QoS-UCB policy. To the best of our knowledge, this is the first learning algorithm which adapts to nonstationary Markov MAB framework and also quantifies channel quality information. Received on XXXX; accepted on XXXX; published on XXXX
具有QoS保证的平稳和非平稳OSA场景下的高效学习
在这项工作中,利用平稳和非平稳马尔可夫多臂强盗(MAB)框架解决了机会性频谱接入(OSA)问题。我们提出了一种新的基于索引的算法,称为QoS-UCB,它在占用和质量方面平衡了探索,例如在固定环境中传输的信噪比(SNR)。此外,针对非平稳情况,我们提出了另一种学习策略,称为折扣QoS-UCB (DQoS-UCB)。我们在数值分析方面的贡献是双重的:i)在平稳OSA场景中,我们将我们的QoS-UCB策略与现有的UCB1进行了数值比较,并表明QoS-UCB在遗憾方面优于UCB1; ii)在非平稳OSA场景中,数值结果表明,提出的DQoS-UCB策略优于其他简单的ucb和QoS-UCB策略。据我们所知,这是第一个适应非平稳马尔可夫MAB框架并量化信道质量信息的学习算法。XXXX年收到;XXXX日验收;发表于XXXX
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信