{"title":"一种改进的基于强化学习的多媒体无线网络呼叫接纳控制方案","authors":"Yueyun Chen, Cuixia Jia","doi":"10.1109/WNIS.2009.91","DOIUrl":null,"url":null,"abstract":"This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.","PeriodicalId":280001,"journal":{"name":"2009 International Conference on Wireless Networks and Information Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Improved Call Admission Control Scheme Based on Reinforcement Learning for Multimedia Wireless Networks\",\"authors\":\"Yueyun Chen, Cuixia Jia\",\"doi\":\"10.1109/WNIS.2009.91\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.\",\"PeriodicalId\":280001,\"journal\":{\"name\":\"2009 International Conference on Wireless Networks and Information Systems\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Wireless Networks and Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WNIS.2009.91\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Networks and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WNIS.2009.91","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Call Admission Control Scheme Based on Reinforcement Learning for Multimedia Wireless Networks
This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.