Delay-aware TDMA Scheduling with Deep Reinforcement Learning in Tactical MANET

2020 International Conference on Information and Communication Technology Convergence (ICTC) Pub Date : 2020-10-21 DOI:10.1109/ICTC49870.2020.9289080

Gwan-sik Wi, Sunghwa Son, Kyung-Joon Park

引用次数: 5

Abstract

In tactical networks, traffic should be delivered in a timely manner satisfying the quality of service (QoS) requirements for survivability and mission success. In this paper, we propose a centralized TDMA slot scheduling based on deep reinforcement learning (DRL) to guarantee the QoS requirements by minimizing end-to-end delay. We consider situations in which mission criticality of tactical traffic is dynamically changing. We introduce a DRL actor-critic algorithm to find a TDMA scheduling policy to minimize the weighted end-to-end delay which is a new metric reflecting the mission criticality of tactical traffic. The simulation results verify that the proposed scheduling policy can guarantee QoS requirements in tactical networks.

查看原文本刊更多论文

战术自组网中基于深度强化学习的延迟感知TDMA调度

在战术网络中，通信量应及时交付，以满足生存性和任务成功的服务质量(QoS)要求。在本文中，我们提出了一种基于深度强化学习(DRL)的集中式TDMA时隙调度，通过最小化端到端延迟来保证QoS要求。我们考虑了战术交通任务关键度动态变化的情况。提出了一种DRL actor- critical算法来寻找一种TDMA调度策略，使加权端到端延迟最小化，这是一种反映战术通信量任务临界性的新度量。仿真结果验证了所提出的调度策略能够保证战术网络的QoS要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 International Conference on Information and Communication Technology Convergence (ICTC)

自引率

0.00%

发文量