{"title":"非平稳多臂强盗的动态频谱接入","authors":"Ben Hadj Alaya-Feki, É. Moulines, Alain LeCornec","doi":"10.1109/SPAWC.2008.4641641","DOIUrl":null,"url":null,"abstract":"Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.","PeriodicalId":197154,"journal":{"name":"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Dynamic spectrum access with non-stationary Multi-Armed Bandit\",\"authors\":\"Ben Hadj Alaya-Feki, É. Moulines, Alain LeCornec\",\"doi\":\"10.1109/SPAWC.2008.4641641\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.\",\"PeriodicalId\":197154,\"journal\":{\"name\":\"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPAWC.2008.4641641\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE 9th Workshop on Signal Processing Advances in Wireless Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAWC.2008.4641641","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic spectrum access with non-stationary Multi-Armed Bandit
Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.