非平稳多臂强盗的动态频谱接入

2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications Pub Date : 2008-07-06 DOI:10.1109/SPAWC.2008.4641641

Ben Hadj Alaya-Feki, É. Moulines, Alain LeCornec

{"title":"非平稳多臂强盗的动态频谱接入","authors":"Ben Hadj Alaya-Feki, É. Moulines, Alain LeCornec","doi":"10.1109/SPAWC.2008.4641641","DOIUrl":null,"url":null,"abstract":"Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.","PeriodicalId":197154,"journal":{"name":"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Dynamic spectrum access with non-stationary Multi-Armed Bandit\",\"authors\":\"Ben Hadj Alaya-Feki, É. Moulines, Alain LeCornec\",\"doi\":\"10.1109/SPAWC.2008.4641641\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.\",\"PeriodicalId\":197154,\"journal\":{\"name\":\"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPAWC.2008.4641641\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAWC.2008.4641641","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

动态频谱接入(DSA)是认知无线电领域的一个新兴概念，旨在通过对频谱资源的可靠二次接入来提高频谱利用率。DSA的主要挑战是在不对主要用户造成干扰的情况下检测频谱机会并有效利用它们。为了实现这个目标，我们建议使用一种强化学习方法:Multi - Armed Bandit (MAB)。MAB方法为次要用户提供必要的规则和策略，以实现DSA中开发和探索之间的权衡。在IEEE802.11介质访问模型上对不同的MAB策略进行了测试，并在动态环境中进行了评估。我们的研究表明，MAB是一种可行的DSA解决方案。此外，MAB算法的性能可以通过有限的内部参数调整来提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Dynamic spectrum access with non-stationary Multi-Armed Bandit

Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications

自引率

0.00%

发文量