Age Minimization of Multiple Flows using Reinforcement Learning

2019 International Conference on Computing, Networking and Communications (ICNC) Pub Date : 2019-02-01 DOI:10.1109/ICCNC.2019.8685524

Hasan Burhan Beytur, E. Uysal-Biyikoglu

引用次数: 28

Abstract

Age of Information (AoI) is a recently proposed performance metric measuring the freshness of data at the receiving side of a flow. This metric is particularly suited to status-update type information flows, like those occurring in machine-type communication (MTC), remote monitoring and similar applications. In this paper, we consider the problem of AoI-optimal scheduling of multiple flows served by a single server. The performance of scheduling algorithms proposed in previous literature has been shown under limited assumptions, due to the analytical intractability of the problem. The goal of this paper is to apply reinforcement learning methods to achieve scheduling decisions that are resilient to network conditions and packet arrival processes. Specifically, Policy Gradients and Deep Q-Learning methods are employed. These can adapt to the network without a priori knowledge of its parameters. We study the resulting performance relative to a benchmark, the MAF algorithm, which is known to be optimal under certain conditions.

查看原文本刊更多论文

使用强化学习的多流年龄最小化

信息时代(Age of Information, AoI)是最近提出的一种性能指标，用于衡量流接收端数据的新鲜度。这个度量特别适合于状态更新类型的信息流，比如发生在机器类型通信(MTC)、远程监控和类似应用程序中的信息流。本文研究了由单个服务器服务的多流的aoi最优调度问题。由于问题的分析难解性，以往文献中提出的调度算法的性能在有限的假设下得到了证明。本文的目标是应用强化学习方法来实现对网络条件和数据包到达过程具有弹性的调度决策。具体来说，使用了策略梯度和深度Q-Learning方法。它们可以适应网络，而不需要先验的参数知识。我们研究了相对于基准的结果性能，MAF算法，已知在某些条件下是最优的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Computing, Networking and Communications (ICNC)

自引率

0.00%

发文量