Near-real-time 6G service operation enabled by distributed intelligence and in-band telemetry

IF 4 2区计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Optical Communications and Networking Pub Date : 2025-02-26 DOI:10.1364/JOCN.533789

P. Gonzalez;F. Alhamed;H. Shakespear-Miles;S. Barzegar;F. Paolucci;A. Sgambelluri;J. J. Vegas Olmos;M. Ruiz;L. Velasco

{"title":"Near-real-time 6G service operation enabled by distributed intelligence and in-band telemetry","authors":"P. Gonzalez;F. Alhamed;H. Shakespear-Miles;S. Barzegar;F. Paolucci;A. Sgambelluri;J. J. Vegas Olmos;M. Ruiz;L. Velasco","doi":"10.1364/JOCN.533789","DOIUrl":null,"url":null,"abstract":"The combination of highly dynamic network services requiring stringent quality of service (QoS), especially in terms of end-to-end (e2e) delay, together with capital and operational cost reduction cannot be faced using centralized software-defined networking (SDN) solutions only. In particular, such expected dynamicity requires autonomous near-real-time operation fed with pervasive telemetry to make per-service decisions that ensure the committed QoS, while reducing overprovisioning as much as possible. In this paper, we propose a distributed control architecture based on multi-agent systems (MASs) to assist the SDN controller in the control of network services near-real-time. Per-traffic flow telemetry data are collected from the packet nodes, distributed through the agents in the control plane, and analyzed to assure performance and to anticipate any degradation. Measurements feed flow agents, which are based on deep reinforcement learning (DRL) models, to make routing decisions aiming at ensuring flow performance. In the case when QoS degradation is detected, we propose algorithms to analyze its cause, which can be a result of some bottleneck in the network. We show how the latter is detected and additional capacity is requested of the SDN controller, which in turn creates an optical bypass to provide additional capacity. The proposed solution is demonstrated experimentally on a federated testbed connecting UPC and CNIT premises. Focused first on the control plane, the feasibility of the proposed architecture and workflows is experimentally assessed. After that, the performance of the near-real-time operation is evaluated at the data plane to verify that the maximum e2e delay is not exceeded for multiple flows, showing the effectiveness of predictive QoS evaluation together with infrastructure and service reconfiguration.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 3","pages":"A247-A258"},"PeriodicalIF":4.0000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10906307/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

The combination of highly dynamic network services requiring stringent quality of service (QoS), especially in terms of end-to-end (e2e) delay, together with capital and operational cost reduction cannot be faced using centralized software-defined networking (SDN) solutions only. In particular, such expected dynamicity requires autonomous near-real-time operation fed with pervasive telemetry to make per-service decisions that ensure the committed QoS, while reducing overprovisioning as much as possible. In this paper, we propose a distributed control architecture based on multi-agent systems (MASs) to assist the SDN controller in the control of network services near-real-time. Per-traffic flow telemetry data are collected from the packet nodes, distributed through the agents in the control plane, and analyzed to assure performance and to anticipate any degradation. Measurements feed flow agents, which are based on deep reinforcement learning (DRL) models, to make routing decisions aiming at ensuring flow performance. In the case when QoS degradation is detected, we propose algorithms to analyze its cause, which can be a result of some bottleneck in the network. We show how the latter is detected and additional capacity is requested of the SDN controller, which in turn creates an optical bypass to provide additional capacity. The proposed solution is demonstrated experimentally on a federated testbed connecting UPC and CNIT premises. Focused first on the control plane, the feasibility of the proposed architecture and workflows is experimentally assessed. After that, the performance of the near-real-time operation is evaluated at the data plane to verify that the maximum e2e delay is not exceeded for multiple flows, showing the effectiveness of predictive QoS evaluation together with infrastructure and service reconfiguration.

查看原文本刊更多论文

通过分布式智能和带内遥测实现近实时6G服务操作

高度动态的网络服务需要严格的服务质量（QoS），特别是在端到端（e2e）延迟方面，再加上资本和运营成本的降低，仅使用集中式软件定义网络（SDN）解决方案是无法解决的。特别是，这种预期的动态性需要自治的近实时操作，通过普遍的遥测来做出每个服务的决策，以确保所承诺的QoS，同时尽可能减少过度供应。本文提出了一种基于多智能体系统（MASs）的分布式控制体系结构，以帮助SDN控制器实现对网络服务的近实时控制。从数据包节点收集每个流量遥测数据，通过控制平面中的代理进行分发，并进行分析以确保性能并预测任何降级。流量代理基于深度强化学习（DRL）模型进行测量，以做出旨在确保流量性能的路由决策。在检测到QoS退化的情况下，我们提出了算法来分析其原因，这可能是由于网络中的某些瓶颈造成的。我们将展示如何检测到后者并向SDN控制器请求额外容量，SDN控制器反过来创建光bypass以提供额外容量。该方案在连接UPC和CNIT的联合测试台上进行了实验验证。首先关注控制平面，实验评估了所提出的架构和工作流程的可行性。之后，在数据平面对近实时操作的性能进行评估，验证多个流不超过最大端到端延迟，显示了预测QoS评估与基础设施和服务重构的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Optical Communications and Networking 工程技术-电信学

CiteScore

9.40

自引率

16.00%

发文量

104

审稿时长

4 months

期刊介绍： The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.