{"title":"Decentralized cognitive MAC protocol design based on POMDP and Q-Learning","authors":"Zhongli Lan, Hong Jiang, Xiaoli Wu","doi":"10.1109/ChinaCom.2012.6417543","DOIUrl":null,"url":null,"abstract":"A decentralized cognitive MAC protocol is proposed in this paper, whose core depends on the partially observed Markov process (POMDP) and Q-Learning. Limited by the hardware and environment, a secondary user may not be able to have ability to sense the entire spectrum space. Therefore, the POMDP is exploited to model the secondary network. In this paper, Q-Learning is applied to solve the POMDP because it can make full advantage of the past observation and decision experiences to optimize current action, and needs not transfer the POMDP into a belief MDP. The numeral simulation results show that Q-Learning-based decentralized cognitive MAC protocol improves the overall performance of the networks.","PeriodicalId":143739,"journal":{"name":"7th International Conference on Communications and Networking in China","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"7th International Conference on Communications and Networking in China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ChinaCom.2012.6417543","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
A decentralized cognitive MAC protocol is proposed in this paper, whose core depends on the partially observed Markov process (POMDP) and Q-Learning. Limited by the hardware and environment, a secondary user may not be able to have ability to sense the entire spectrum space. Therefore, the POMDP is exploited to model the secondary network. In this paper, Q-Learning is applied to solve the POMDP because it can make full advantage of the past observation and decision experiences to optimize current action, and needs not transfer the POMDP into a belief MDP. The numeral simulation results show that Q-Learning-based decentralized cognitive MAC protocol improves the overall performance of the networks.