{"title":"基于强化学习的时间敏感网络的可靠路由和调度","authors":"Hao Cheng;Lei Yang;Qingfeng Zhang;Weiping Zhu","doi":"10.1109/TNSE.2025.3546100","DOIUrl":null,"url":null,"abstract":"Time Sensitive Network (TSN) provides strict low latency and bounded jitter requirements for applications such as industrial systems, autonomous driving, etc. One of the important problems in TSN is to achieve high reliability and low latency by effectively routing and scheduling time-sensitive data flows. Existing work applies heuristic or integer programming to address flow routing and scheduling, yet often fail to achieve optimal solutions quickly. In this paper, we propose a new Reinforcement Learning (RL) based approach for routing and scheduling of redundant data flows, aiming to achieve load balancing on the network links as well as meeting the reliability and delay constraints. Our approach first leverages a simple heuristic algorithm to decide the redundant path candidate set, and then incorporates Proximal Policy Optimization (PPO) method to choose the most suitable multi-routing flows from the candidates, which can be aware of the network status dynamically to reduce the load on the bottleneck link of the network. On this basis, we further retrain the RL model by fine-tuning to adapt to the online environment. The simulation results show that our proposed solution outperforms the benchmark algorithms in terms of the degree of network balance by 38.7% in offline network environments and in terms of average delay by 14.0% in online network environments.","PeriodicalId":54229,"journal":{"name":"IEEE Transactions on Network Science and Engineering","volume":"12 4","pages":"2415-2427"},"PeriodicalIF":6.7000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reliable Routing and Scheduling in Time Sensitive Networks Based on Reinforcement Learning\",\"authors\":\"Hao Cheng;Lei Yang;Qingfeng Zhang;Weiping Zhu\",\"doi\":\"10.1109/TNSE.2025.3546100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time Sensitive Network (TSN) provides strict low latency and bounded jitter requirements for applications such as industrial systems, autonomous driving, etc. One of the important problems in TSN is to achieve high reliability and low latency by effectively routing and scheduling time-sensitive data flows. Existing work applies heuristic or integer programming to address flow routing and scheduling, yet often fail to achieve optimal solutions quickly. In this paper, we propose a new Reinforcement Learning (RL) based approach for routing and scheduling of redundant data flows, aiming to achieve load balancing on the network links as well as meeting the reliability and delay constraints. Our approach first leverages a simple heuristic algorithm to decide the redundant path candidate set, and then incorporates Proximal Policy Optimization (PPO) method to choose the most suitable multi-routing flows from the candidates, which can be aware of the network status dynamically to reduce the load on the bottleneck link of the network. On this basis, we further retrain the RL model by fine-tuning to adapt to the online environment. The simulation results show that our proposed solution outperforms the benchmark algorithms in terms of the degree of network balance by 38.7% in offline network environments and in terms of average delay by 14.0% in online network environments.\",\"PeriodicalId\":54229,\"journal\":{\"name\":\"IEEE Transactions on Network Science and Engineering\",\"volume\":\"12 4\",\"pages\":\"2415-2427\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Network Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10914503/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10914503/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Reliable Routing and Scheduling in Time Sensitive Networks Based on Reinforcement Learning
Time Sensitive Network (TSN) provides strict low latency and bounded jitter requirements for applications such as industrial systems, autonomous driving, etc. One of the important problems in TSN is to achieve high reliability and low latency by effectively routing and scheduling time-sensitive data flows. Existing work applies heuristic or integer programming to address flow routing and scheduling, yet often fail to achieve optimal solutions quickly. In this paper, we propose a new Reinforcement Learning (RL) based approach for routing and scheduling of redundant data flows, aiming to achieve load balancing on the network links as well as meeting the reliability and delay constraints. Our approach first leverages a simple heuristic algorithm to decide the redundant path candidate set, and then incorporates Proximal Policy Optimization (PPO) method to choose the most suitable multi-routing flows from the candidates, which can be aware of the network status dynamically to reduce the load on the bottleneck link of the network. On this basis, we further retrain the RL model by fine-tuning to adapt to the online environment. The simulation results show that our proposed solution outperforms the benchmark algorithms in terms of the degree of network balance by 38.7% in offline network environments and in terms of average delay by 14.0% in online network environments.
期刊介绍:
The proposed journal, called the IEEE Transactions on Network Science and Engineering (TNSE), is committed to timely publishing of peer-reviewed technical articles that deal with the theory and applications of network science and the interconnections among the elements in a system that form a network. In particular, the IEEE Transactions on Network Science and Engineering publishes articles on understanding, prediction, and control of structures and behaviors of networks at the fundamental level. The types of networks covered include physical or engineered networks, information networks, biological networks, semantic networks, economic networks, social networks, and ecological networks. Aimed at discovering common principles that govern network structures, network functionalities and behaviors of networks, the journal seeks articles on understanding, prediction, and control of structures and behaviors of networks. Another trans-disciplinary focus of the IEEE Transactions on Network Science and Engineering is the interactions between and co-evolution of different genres of networks.