J. Lundén, V. Koivunen, S. Kulkarni, H. Poor, Smarad CoE
{"title":"基于强化学习的认知无线网络分布式多智能体感知策略","authors":"J. Lundén, V. Koivunen, S. Kulkarni, H. Poor, Smarad CoE","doi":"10.1109/DYSPAN.2011.5936261","DOIUrl":null,"url":null,"abstract":"In this paper a distributed multiagent, multiband reinforcement learning based sensing policy for cognitive radio ad hoc networks is proposed. The proposed sensing policy employs secondary user (SU) collaboration through local interactions. The goal is to maximize the amount of available spectrum found for secondary use given a desired diversity order, i.e. a desired number of SUs sensing simultaneously each frequency band. The SUs in the cognitive radio network make local decisions based on their own and their neighbors' local test statistics or decisions to identify unused spectrum locally. Thus, the network builds a locally available map of spectrum occupancy of its geographical area. Simulation results show that the proposed sensing policy provides a significant increase in the amount of available spectrum found for secondary use compared to a random sensing policy.","PeriodicalId":119856,"journal":{"name":"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"48","resultStr":"{\"title\":\"Reinforcement learning based distributed multiagent sensing policy for cognitive radio networks\",\"authors\":\"J. Lundén, V. Koivunen, S. Kulkarni, H. Poor, Smarad CoE\",\"doi\":\"10.1109/DYSPAN.2011.5936261\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper a distributed multiagent, multiband reinforcement learning based sensing policy for cognitive radio ad hoc networks is proposed. The proposed sensing policy employs secondary user (SU) collaboration through local interactions. The goal is to maximize the amount of available spectrum found for secondary use given a desired diversity order, i.e. a desired number of SUs sensing simultaneously each frequency band. The SUs in the cognitive radio network make local decisions based on their own and their neighbors' local test statistics or decisions to identify unused spectrum locally. Thus, the network builds a locally available map of spectrum occupancy of its geographical area. Simulation results show that the proposed sensing policy provides a significant increase in the amount of available spectrum found for secondary use compared to a random sensing policy.\",\"PeriodicalId\":119856,\"journal\":{\"name\":\"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"48\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DYSPAN.2011.5936261\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DYSPAN.2011.5936261","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement learning based distributed multiagent sensing policy for cognitive radio networks
In this paper a distributed multiagent, multiband reinforcement learning based sensing policy for cognitive radio ad hoc networks is proposed. The proposed sensing policy employs secondary user (SU) collaboration through local interactions. The goal is to maximize the amount of available spectrum found for secondary use given a desired diversity order, i.e. a desired number of SUs sensing simultaneously each frequency band. The SUs in the cognitive radio network make local decisions based on their own and their neighbors' local test statistics or decisions to identify unused spectrum locally. Thus, the network builds a locally available map of spectrum occupancy of its geographical area. Simulation results show that the proposed sensing policy provides a significant increase in the amount of available spectrum found for secondary use compared to a random sensing policy.