Call Admission Control in Wireless DS-CDMA Systems using Actor-Critic Reinforcement Learning

2007 2nd International Symposium on Wireless Pervasive Computing Pub Date : 2007-04-16 DOI:10.1109/ISWPC.2007.342590

Pitipong Chanloha, W. Usaha

引用次数: 1

Abstract

This paper addresses the call admission control (CAC) problem for multiple services in the uplink of a cellular system using direct sequential code division multiple access (DS-CDMA) when taking into account the physical layer channel and receiver structure at the base station. The problem is formulated as a semi-Markov decision process (SMDP) with constraints on the blocking probabilities and signal-to-interference ratio (SIR). The objective is to find a CAC policy which maximizes the throughput while still satisfying these quality-of-service (QoS) constraints. To solve for a near optimal CAC policy, an online decision-making algorithm based on an actor-critic with temporal-difference learning from a paper is modified by parameterizing the reward signal to deal with the QoS constraints. The proposed algorithm circumvents the computational complexity experienced in conventional dynamic programming techniques

查看原文本刊更多论文

基于actor - critical强化学习的无线DS-CDMA系统呼叫接纳控制

本文在考虑基站物理层信道和接收机结构的情况下，研究了采用直接顺序码分多址(DS-CDMA)的蜂窝系统上行链路中多个业务的呼叫接纳控制问题。将该问题表述为具有阻塞概率和信噪比约束的半马尔可夫决策过程(SMDP)。目标是找到一个CAC策略，使吞吐量最大化，同时仍然满足这些服务质量(QoS)约束。为了求解近似最优的CAC策略，通过参数化奖励信号来处理QoS约束，改进了一种基于具有时间差学习的actor-critic在线决策算法。该算法克服了传统动态规划技术的计算复杂性

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 2nd International Symposium on Wireless Pervasive Computing

自引率

0.00%

发文量