混合合作/竞争动态频谱接入的强化学习

2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN) Pub Date : 2019-11-01 DOI:10.1109/DySPAN.2019.8935725

C. Bowyer, David Greene, Tyler Ward, Marco Menéndez, J. Shea, T. Wong

{"title":"混合合作/竞争动态频谱接入的强化学习","authors":"C. Bowyer, David Greene, Tyler Ward, Marco Menéndez, J. Shea, T. Wong","doi":"10.1109/DySPAN.2019.8935725","DOIUrl":null,"url":null,"abstract":"A dynamic spectrum sharing problem with a mixed collaborative/competitive objective and partial information about peers’ performances that arises from the DARPA Spectrum Collaboration Challenge is considered. Because of the very high complexity of the problem and the enormous size of the state space, it is broken down into the subproblems of channel selection, flow admission control, and transmission schedule assignment. The channel selection problem is the focus of this paper. A reinforcement learning algorithm based on a reduced state is developed to select channels, and a neural network is used as a function approximator to fill in missing values in the resulting input-action matrix. The performance is compared with that obtained by a hand-tuned expert system.","PeriodicalId":278172,"journal":{"name":"2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Reinforcement Learning for Mixed Cooperative/Competitive Dynamic Spectrum Access\",\"authors\":\"C. Bowyer, David Greene, Tyler Ward, Marco Menéndez, J. Shea, T. Wong\",\"doi\":\"10.1109/DySPAN.2019.8935725\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A dynamic spectrum sharing problem with a mixed collaborative/competitive objective and partial information about peers’ performances that arises from the DARPA Spectrum Collaboration Challenge is considered. Because of the very high complexity of the problem and the enormous size of the state space, it is broken down into the subproblems of channel selection, flow admission control, and transmission schedule assignment. The channel selection problem is the focus of this paper. A reinforcement learning algorithm based on a reduced state is developed to select channels, and a neural network is used as a function approximator to fill in missing values in the resulting input-action matrix. The performance is compared with that obtained by a hand-tuned expert system.\",\"PeriodicalId\":278172,\"journal\":{\"name\":\"2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DySPAN.2019.8935725\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DySPAN.2019.8935725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

摘要

考虑了DARPA频谱协作挑战中出现的具有混合协作/竞争目标和部分同行性能信息的动态频谱共享问题。由于该问题的复杂性和状态空间的巨大，将其分解为通道选择、流量接纳控制和传输调度分配等子问题。信道选择问题是本文研究的重点。提出了一种基于简化状态的强化学习算法来选择通道，并使用神经网络作为函数逼近器来填充结果输入-动作矩阵中的缺失值。并与手工调优专家系统的性能进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning for Mixed Cooperative/Competitive Dynamic Spectrum Access

A dynamic spectrum sharing problem with a mixed collaborative/competitive objective and partial information about peers’ performances that arises from the DARPA Spectrum Collaboration Challenge is considered. Because of the very high complexity of the problem and the enormous size of the state space, it is broken down into the subproblems of channel selection, flow admission control, and transmission schedule assignment. The channel selection problem is the focus of this paper. A reinforcement learning algorithm based on a reduced state is developed to select channels, and a neural network is used as a function approximator to fill in missing values in the resulting input-action matrix. The performance is compared with that obtained by a hand-tuned expert system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)

自引率

0.00%

发文量