A 5G-TSN joint resource scheduling algorithm based on optimized deep reinforcement learning model for industrial networks

IF 4.4 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Ad Hoc Networks Pub Date : 2025-01-26 DOI:10.1016/j.adhoc.2025.103783

Yang Zhang , Lei Sun , Zhangchao Ma , Jianquan Wang , Meixia Fu , Jinoo Joung

{"title":"A 5G-TSN joint resource scheduling algorithm based on optimized deep reinforcement learning model for industrial networks","authors":"Yang Zhang , Lei Sun , Zhangchao Ma , Jianquan Wang , Meixia Fu , Jinoo Joung","doi":"10.1016/j.adhoc.2025.103783","DOIUrl":null,"url":null,"abstract":"<div><div>As the Industrial Internet of Things (IIoT) evolves, the rapid growth of connected devices in industrial networks generates massive amounts of data. These transmissions impose stringent requirements on network communications, including reliable bounded latency and high throughput. To address these challenges, the integration of the fifth-generation (5G) mobile cellular networks and Time-Sensitive Networking (TSN) has emerged as a prominent solution for scheduling diverse traffic flows. While Deep Reinforcement Learning (DRL) algorithms have been widely employed to tackle scheduling issues within the 5G-TSN architecture, existing approaches often neglect throughput optimization in multi-user scenarios and the impact of Channel Quality Indicators (CQI) on resource allocation. To overcome these limitations, this study introduces ME-DDPG, a novel joint resource scheduling algorithm. ME-DDPG extends the Deep Deterministic Policy Gradient (DDPG) model by embedding a Modulation and Coding Scheme (MCS)-based priority scheme. This improvement in computational efficiency is critical for real-time scheduling in IIoT environments. Specifically, ME-DDPG provides latency guarantees for time-triggered applications, ensures throughput for video applications, and maximizes overall system throughput across 5 G and TSN domains. Simulation results demonstrate that the proposed ME-DDPG achieves 100 % latency reliability for time-triggered flows and improves system throughput by 10.84 % over existing algorithms under varying Gate Control List (GCL) configurations and user ratios. Furthermore, due to the combination of MCS-based resource allocation scheme with DDPG model, the proposed ME-DDPG achieves faster convergence speed of the reward function compared to the original DDPG method.</div></div>","PeriodicalId":55555,"journal":{"name":"Ad Hoc Networks","volume":"170 ","pages":"Article 103783"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ad Hoc Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1570870525000319","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

As the Industrial Internet of Things (IIoT) evolves, the rapid growth of connected devices in industrial networks generates massive amounts of data. These transmissions impose stringent requirements on network communications, including reliable bounded latency and high throughput. To address these challenges, the integration of the fifth-generation (5G) mobile cellular networks and Time-Sensitive Networking (TSN) has emerged as a prominent solution for scheduling diverse traffic flows. While Deep Reinforcement Learning (DRL) algorithms have been widely employed to tackle scheduling issues within the 5G-TSN architecture, existing approaches often neglect throughput optimization in multi-user scenarios and the impact of Channel Quality Indicators (CQI) on resource allocation. To overcome these limitations, this study introduces ME-DDPG, a novel joint resource scheduling algorithm. ME-DDPG extends the Deep Deterministic Policy Gradient (DDPG) model by embedding a Modulation and Coding Scheme (MCS)-based priority scheme. This improvement in computational efficiency is critical for real-time scheduling in IIoT environments. Specifically, ME-DDPG provides latency guarantees for time-triggered applications, ensures throughput for video applications, and maximizes overall system throughput across 5 G and TSN domains. Simulation results demonstrate that the proposed ME-DDPG achieves 100 % latency reliability for time-triggered flows and improves system throughput by 10.84 % over existing algorithms under varying Gate Control List (GCL) configurations and user ratios. Furthermore, due to the combination of MCS-based resource allocation scheme with DDPG model, the proposed ME-DDPG achieves faster convergence speed of the reward function compared to the original DDPG method.

查看原文本刊更多论文

基于优化深度强化学习模型的工业网络5G-TSN联合资源调度算法

随着工业物联网（IIoT）的发展，工业网络中连接设备的快速增长产生了大量数据。这些传输对网络通信提出了严格的要求，包括可靠的有界延迟和高吞吐量。为了应对这些挑战，第五代（5G）移动蜂窝网络和时间敏感网络（TSN）的集成已经成为调度各种流量的重要解决方案。虽然深度强化学习（DRL）算法已被广泛用于解决5G-TSN架构中的调度问题，但现有方法往往忽略了多用户场景下的吞吐量优化以及信道质量指标（CQI）对资源分配的影响。为了克服这些局限性，本研究引入了一种新的联合资源调度算法ME-DDPG。ME-DDPG通过嵌入一个基于调制和编码方案（MCS）的优先级方案扩展了深度确定性策略梯度（DDPG）模型。计算效率的提高对于工业物联网环境中的实时调度至关重要。具体来说，ME-DDPG为时间触发型应用提供时延保障，为视频应用提供吞吐量保障，并在5g和TSN域实现系统整体吞吐量最大化。仿真结果表明，在不同的门控制列表（GCL）配置和用户比例下，所提出的ME-DDPG算法对时间触发流实现了100%的延迟可靠性，并将系统吞吐量提高了10.84%。此外，由于将基于mcs的资源分配方案与DDPG模型相结合，所提出的ME-DDPG比原DDPG方法实现了更快的奖励函数收敛速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ad Hoc Networks 工程技术-电信学

CiteScore

10.20

自引率

4.20%

发文量

131

审稿时长

4.8 months

期刊介绍： The Ad Hoc Networks is an international and archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in ad hoc and sensor networking areas. The Ad Hoc Networks considers original, high quality and unpublished contributions addressing all aspects of ad hoc and sensor networks. Specific areas of interest include, but are not limited to: Mobile and Wireless Ad Hoc Networks Sensor Networks Wireless Local and Personal Area Networks Home Networks Ad Hoc Networks of Autonomous Intelligent Systems Novel Architectures for Ad Hoc and Sensor Networks Self-organizing Network Architectures and Protocols Transport Layer Protocols Routing protocols (unicast, multicast, geocast, etc.) Media Access Control Techniques Error Control Schemes Power-Aware, Low-Power and Energy-Efficient Designs Synchronization and Scheduling Issues Mobility Management Mobility-Tolerant Communication Protocols Location Tracking and Location-based Services Resource and Information Management Security and Fault-Tolerance Issues Hardware and Software Platforms, Systems, and Testbeds Experimental and Prototype Results Quality-of-Service Issues Cross-Layer Interactions Scalability Issues Performance Analysis and Simulation of Protocols.