用Epsilon-Greedy动作选择评价无线电信道效用

Journal of Telecommunictions and Information Technology Pub Date : 2021-09-30 DOI:10.26636/jtit.2021.153621

Krzysztof Malon

{"title":"用Epsilon-Greedy动作选择评价无线电信道效用","authors":"Krzysztof Malon","doi":"10.26636/jtit.2021.153621","DOIUrl":null,"url":null,"abstract":"This paper presents an algorithm that supports the dynamic spectrum access process in cognitive radio networks by generating a sorted list of best radio channels or by identifying those frequency ranges that are not in use temporarily. The concept is based on the reinforcement learning technique named Q-learning. To evaluate the utility of individual radio channels, spectrum monitoring is performed. In the presented solution, the epsilon-greedy action selection method is used to indicate which channel should be monitored next. The article includes a description of the proposed algorithm, scenarios, metrics, and simulation results showing the correct operation of the approach relied upon to evaluate the utility of radio channels and the epsilon-greedy action selection method. Based on the performed tests, it is possible to determine algorithm parameters that should be used in this proposed deployment. The paper also presents a comparison of the results with two other action selection methods. Keywords—cognitive radio, dynamic spectrum access, spectrum monitoring, machine learning, Q-learning.","PeriodicalId":227678,"journal":{"name":"Journal of Telecommunictions and Information Technology","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Evaluation of Radio Channel Utility using Epsilon-Greedy Action Selection\",\"authors\":\"Krzysztof Malon\",\"doi\":\"10.26636/jtit.2021.153621\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an algorithm that supports the dynamic spectrum access process in cognitive radio networks by generating a sorted list of best radio channels or by identifying those frequency ranges that are not in use temporarily. The concept is based on the reinforcement learning technique named Q-learning. To evaluate the utility of individual radio channels, spectrum monitoring is performed. In the presented solution, the epsilon-greedy action selection method is used to indicate which channel should be monitored next. The article includes a description of the proposed algorithm, scenarios, metrics, and simulation results showing the correct operation of the approach relied upon to evaluate the utility of radio channels and the epsilon-greedy action selection method. Based on the performed tests, it is possible to determine algorithm parameters that should be used in this proposed deployment. The paper also presents a comparison of the results with two other action selection methods. Keywords—cognitive radio, dynamic spectrum access, spectrum monitoring, machine learning, Q-learning.\",\"PeriodicalId\":227678,\"journal\":{\"name\":\"Journal of Telecommunictions and Information Technology\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Telecommunictions and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26636/jtit.2021.153621\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Telecommunictions and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26636/jtit.2021.153621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文提出了一种算法，该算法通过生成最佳无线电信道的排序列表或识别暂时不使用的频率范围来支持认知无线电网络中的动态频谱访问过程。这个概念是基于被称为Q-learning的强化学习技术。为了评估单个无线电信道的效用，进行了频谱监测。在提出的解决方案中，使用贪心动作选择方法来指示下一步应该监视哪个通道。本文包括对所提出的算法、场景、指标的描述，以及显示该方法的正确操作的仿真结果，该方法依赖于评估无线电信道的效用和epsilon贪婪动作选择方法。根据执行的测试，可以确定在此建议部署中应使用的算法参数。本文还将结果与另外两种动作选择方法进行了比较。关键词:认知无线电，动态频谱接入，频谱监测，机器学习，q -学习。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Evaluation of Radio Channel Utility using Epsilon-Greedy Action Selection

This paper presents an algorithm that supports the dynamic spectrum access process in cognitive radio networks by generating a sorted list of best radio channels or by identifying those frequency ranges that are not in use temporarily. The concept is based on the reinforcement learning technique named Q-learning. To evaluate the utility of individual radio channels, spectrum monitoring is performed. In the presented solution, the epsilon-greedy action selection method is used to indicate which channel should be monitored next. The article includes a description of the proposed algorithm, scenarios, metrics, and simulation results showing the correct operation of the approach relied upon to evaluate the utility of radio channels and the epsilon-greedy action selection method. Based on the performed tests, it is possible to determine algorithm parameters that should be used in this proposed deployment. The paper also presents a comparison of the results with two other action selection methods. Keywords—cognitive radio, dynamic spectrum access, spectrum monitoring, machine learning, Q-learning.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Telecommunictions and Information Technology

自引率

0.00%

发文量