战术自组网中基于深度强化学习的延迟感知TDMA调度

2020 International Conference on Information and Communication Technology Convergence (ICTC) Pub Date : 2020-10-21 DOI:10.1109/ICTC49870.2020.9289080

Gwan-sik Wi, Sunghwa Son, Kyung-Joon Park

{"title":"战术自组网中基于深度强化学习的延迟感知TDMA调度","authors":"Gwan-sik Wi, Sunghwa Son, Kyung-Joon Park","doi":"10.1109/ICTC49870.2020.9289080","DOIUrl":null,"url":null,"abstract":"In tactical networks, traffic should be delivered in a timely manner satisfying the quality of service (QoS) requirements for survivability and mission success. In this paper, we propose a centralized TDMA slot scheduling based on deep reinforcement learning (DRL) to guarantee the QoS requirements by minimizing end-to-end delay. We consider situations in which mission criticality of tactical traffic is dynamically changing. We introduce a DRL actor-critic algorithm to find a TDMA scheduling policy to minimize the weighted end-to-end delay which is a new metric reflecting the mission criticality of tactical traffic. The simulation results verify that the proposed scheduling policy can guarantee QoS requirements in tactical networks.","PeriodicalId":282243,"journal":{"name":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Delay-aware TDMA Scheduling with Deep Reinforcement Learning in Tactical MANET\",\"authors\":\"Gwan-sik Wi, Sunghwa Son, Kyung-Joon Park\",\"doi\":\"10.1109/ICTC49870.2020.9289080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In tactical networks, traffic should be delivered in a timely manner satisfying the quality of service (QoS) requirements for survivability and mission success. In this paper, we propose a centralized TDMA slot scheduling based on deep reinforcement learning (DRL) to guarantee the QoS requirements by minimizing end-to-end delay. We consider situations in which mission criticality of tactical traffic is dynamically changing. We introduce a DRL actor-critic algorithm to find a TDMA scheduling policy to minimize the weighted end-to-end delay which is a new metric reflecting the mission criticality of tactical traffic. The simulation results verify that the proposed scheduling policy can guarantee QoS requirements in tactical networks.\",\"PeriodicalId\":282243,\"journal\":{\"name\":\"2020 International Conference on Information and Communication Technology Convergence (ICTC)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Information and Communication Technology Convergence (ICTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTC49870.2020.9289080\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTC49870.2020.9289080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

在战术网络中，通信量应及时交付，以满足生存性和任务成功的服务质量(QoS)要求。在本文中，我们提出了一种基于深度强化学习(DRL)的集中式TDMA时隙调度，通过最小化端到端延迟来保证QoS要求。我们考虑了战术交通任务关键度动态变化的情况。提出了一种DRL actor- critical算法来寻找一种TDMA调度策略，使加权端到端延迟最小化，这是一种反映战术通信量任务临界性的新度量。仿真结果验证了所提出的调度策略能够保证战术网络的QoS要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Delay-aware TDMA Scheduling with Deep Reinforcement Learning in Tactical MANET

In tactical networks, traffic should be delivered in a timely manner satisfying the quality of service (QoS) requirements for survivability and mission success. In this paper, we propose a centralized TDMA slot scheduling based on deep reinforcement learning (DRL) to guarantee the QoS requirements by minimizing end-to-end delay. We consider situations in which mission criticality of tactical traffic is dynamically changing. We introduce a DRL actor-critic algorithm to find a TDMA scheduling policy to minimize the weighted end-to-end delay which is a new metric reflecting the mission criticality of tactical traffic. The simulation results verify that the proposed scheduling policy can guarantee QoS requirements in tactical networks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Information and Communication Technology Convergence (ICTC)

自引率

0.00%

发文量