一种改进的基于强化学习的多媒体无线网络呼叫接纳控制方案

2009 International Conference on Wireless Networks and Information Systems Pub Date : 2009-12-28 DOI:10.1109/WNIS.2009.91

Yueyun Chen, Cuixia Jia

{"title":"一种改进的基于强化学习的多媒体无线网络呼叫接纳控制方案","authors":"Yueyun Chen, Cuixia Jia","doi":"10.1109/WNIS.2009.91","DOIUrl":null,"url":null,"abstract":"This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.","PeriodicalId":280001,"journal":{"name":"2009 International Conference on Wireless Networks and Information Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Improved Call Admission Control Scheme Based on Reinforcement Learning for Multimedia Wireless Networks\",\"authors\":\"Yueyun Chen, Cuixia Jia\",\"doi\":\"10.1109/WNIS.2009.91\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.\",\"PeriodicalId\":280001,\"journal\":{\"name\":\"2009 International Conference on Wireless Networks and Information Systems\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Wireless Networks and Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WNIS.2009.91\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Networks and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WNIS.2009.91","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文提出了一种改进的呼叫接纳控制方案，在保证移动终端服务质量的同时，优化网络运营商的收入。将呼叫接纳控制(CAC)问题建模为一个半马尔可夫决策过程(SMDP)，并采用Q-learning强化学习(RL)算法求解SMDP。在q -学习算法中，每一类业务接受和拒绝新呼叫的奖励函数不仅取决于已使用的带宽、新呼叫到达率、平均服务时间和价格，还取决于每一类流量的新呼叫负载与切换呼叫负载的比率和请求带宽。通过奖励函数可以很好地执行CAC方案。仿真结果表明，在业务量较大的情况下，CAC方案可以获得较高的收益，同时大大降低切换呼叫掉线概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Improved Call Admission Control Scheme Based on Reinforcement Learning for Multimedia Wireless Networks

This paper presents an improved call admission control scheme to optimize the network operators’ revenue while guarantying the quality of service (QoS) to the mobile terminals. The problem of call admission control (CAC) is modeled as a Semi-Markov decision process (SMDP), and the SMDP is solved by a reinforcement learning (RL) algorithm known as Q-learning. In the Q-learning algorithm, the reward functions for the acceptance and the rejection of new calls for each class of service not only depend on used bandwidth, new call arrival rate, average service time and price, but also the ratio of new call load and the handoff call load and the requested bandwidth of each class of traffic. The CAC scheme would be well performed through the reward functions. Simulations results show that the CAC scheme can obtain high revenue while greatly reducing handoff call dropping probability when the traffic loads are heavy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 International Conference on Wireless Networks and Information Systems

自引率

0.00%

发文量