An Index Policy for Dynamic Fading-Channel Allocation to Heterogeneous Mobile Users with Partial Observations

J. Nio-Mora
{"title":"An Index Policy for Dynamic Fading-Channel Allocation to Heterogeneous Mobile Users with Partial Observations","authors":"J. Nio-Mora","doi":"10.1109/NGI.2008.38","DOIUrl":null,"url":null,"abstract":"This paper addresses a system model where multiple heterogeneous mobile users share a common wireless channel to a base station, extending work of Koole et al. (2001) which considered the corresponding homogeneous-user case. The channel's bandwidth is to be dynamically allocated to different users, based on partial information on their connectivity status. During each time slot, a user may or may not be connected to the base station, where the Gilbert-Elliott model is assumed that a user's connectivity evolves as a two-state (on-off) Markov chain. Transmitting from/to a user reveals its true connectivity status, while those of other users are only partially known by tracking their probabilities of being connected. The goal is to design a tractable dynamic channel allocation policy that comes close to maximizing the infinite-horizon discounted or long-run average value of the through put minus transmission costs. The paper exploits a restless bandit problem formulation drawing on and extending to the Partially Observed Markov Decision Process (POMDP) setting the powerful indexation theory introduced by Whittle (1988) and developed by the author, to obtain a new dynamic priority-index policy that is readily implementable. Computational results are presented showing that the proposed policy can substantially outperform the conventional greedy policy in instances with two heterogeneous users.","PeriodicalId":182496,"journal":{"name":"2008 Next Generation Internet Networks","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Next Generation Internet Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NGI.2008.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26

Abstract

This paper addresses a system model where multiple heterogeneous mobile users share a common wireless channel to a base station, extending work of Koole et al. (2001) which considered the corresponding homogeneous-user case. The channel's bandwidth is to be dynamically allocated to different users, based on partial information on their connectivity status. During each time slot, a user may or may not be connected to the base station, where the Gilbert-Elliott model is assumed that a user's connectivity evolves as a two-state (on-off) Markov chain. Transmitting from/to a user reveals its true connectivity status, while those of other users are only partially known by tracking their probabilities of being connected. The goal is to design a tractable dynamic channel allocation policy that comes close to maximizing the infinite-horizon discounted or long-run average value of the through put minus transmission costs. The paper exploits a restless bandit problem formulation drawing on and extending to the Partially Observed Markov Decision Process (POMDP) setting the powerful indexation theory introduced by Whittle (1988) and developed by the author, to obtain a new dynamic priority-index policy that is readily implementable. Computational results are presented showing that the proposed policy can substantially outperform the conventional greedy policy in instances with two heterogeneous users.
基于部分观测的异构移动用户动态衰落信道分配索引策略
本文讨论了一个系统模型,其中多个异构移动用户共享到基站的公共无线信道,扩展了Koole等人(2001)的工作,该工作考虑了相应的同质用户情况。信道的带宽将根据用户连接状态的部分信息动态地分配给不同的用户。在每个时隙中,用户可能连接到基站,也可能不连接到基站,其中Gilbert-Elliott模型假设用户的连接演变为两状态(开-关)马尔可夫链。从用户发送/发送到用户显示其真实连接状态,而其他用户只能通过跟踪其连接概率来部分了解其连接状态。目标是设计一种易于处理的动态信道分配策略,该策略接近于最大化通过投入减去传输成本的无限地平线贴现或长期平均值。本文利用Whittle(1988)提出并由作者发展的强大指数化理论,在部分可观察马尔可夫决策过程(POMDP)的基础上,利用一个不安分的强盗问题公式,得到了一种易于实现的新的动态优先级-指数化策略。计算结果表明,在两个异构用户的情况下,该策略的性能明显优于传统的贪婪策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信