Evaluation of Radio Channel Utility using Epsilon-Greedy Action Selection

Krzysztof Malon
{"title":"Evaluation of Radio Channel Utility using Epsilon-Greedy Action Selection","authors":"Krzysztof Malon","doi":"10.26636/jtit.2021.153621","DOIUrl":null,"url":null,"abstract":"This paper presents an algorithm that supports the dynamic spectrum access process in cognitive radio networks by generating a sorted list of best radio channels or by identifying those frequency ranges that are not in use temporarily. The concept is based on the reinforcement learning technique named Q-learning. To evaluate the utility of individual radio channels, spectrum monitoring is performed. In the presented solution, the epsilon-greedy action selection method is used to indicate which channel should be monitored next. The article includes a description of the proposed algorithm, scenarios, metrics, and simulation results showing the correct operation of the approach relied upon to evaluate the utility of radio channels and the epsilon-greedy action selection method. Based on the performed tests, it is possible to determine algorithm parameters that should be used in this proposed deployment. The paper also presents a comparison of the results with two other action selection methods. Keywords—cognitive radio, dynamic spectrum access, spectrum monitoring, machine learning, Q-learning.","PeriodicalId":227678,"journal":{"name":"Journal of Telecommunictions and Information Technology","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Telecommunictions and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26636/jtit.2021.153621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This paper presents an algorithm that supports the dynamic spectrum access process in cognitive radio networks by generating a sorted list of best radio channels or by identifying those frequency ranges that are not in use temporarily. The concept is based on the reinforcement learning technique named Q-learning. To evaluate the utility of individual radio channels, spectrum monitoring is performed. In the presented solution, the epsilon-greedy action selection method is used to indicate which channel should be monitored next. The article includes a description of the proposed algorithm, scenarios, metrics, and simulation results showing the correct operation of the approach relied upon to evaluate the utility of radio channels and the epsilon-greedy action selection method. Based on the performed tests, it is possible to determine algorithm parameters that should be used in this proposed deployment. The paper also presents a comparison of the results with two other action selection methods. Keywords—cognitive radio, dynamic spectrum access, spectrum monitoring, machine learning, Q-learning.
用Epsilon-Greedy动作选择评价无线电信道效用
本文提出了一种算法,该算法通过生成最佳无线电信道的排序列表或识别暂时不使用的频率范围来支持认知无线电网络中的动态频谱访问过程。这个概念是基于被称为Q-learning的强化学习技术。为了评估单个无线电信道的效用,进行了频谱监测。在提出的解决方案中,使用贪心动作选择方法来指示下一步应该监视哪个通道。本文包括对所提出的算法、场景、指标的描述,以及显示该方法的正确操作的仿真结果,该方法依赖于评估无线电信道的效用和epsilon贪婪动作选择方法。根据执行的测试,可以确定在此建议部署中应使用的算法参数。本文还将结果与另外两种动作选择方法进行了比较。关键词:认知无线电,动态频谱接入,频谱监测,机器学习,q -学习。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信