{"title":"An Adaptive Service Function Chains Mapping With Multi-Task Deep Reinforcement Learning","authors":"Wenting Wei;Qingyi Wang;Huaxi Gu;Danyang Zheng;Ning Zhang;Celimuge Wu","doi":"10.1109/TNSE.2025.3556390","DOIUrl":null,"url":null,"abstract":"Network function virtualization (NFV) facilitates different virtual network functions (VNF) to be dynamically chained in sequence to offer new services in a flexible, scalable, and cost-effective manner. Recent years have witnessed the increasing diverse service demands from the ever-increasing new applications, which has posed significant challenges to the efficient and sequential execution of VNFs to achieve specific objectives, especially under conditions of shared resources. To address these challenges, substantial efforts have been dedicated to enhancing resource utilization and minimizing the costs associated with service function chains (SFCs), while maintaining high quality of service. However, an overemphasis on cost reduction can sometimes result in network congestion, which ultimately degrades both network performance and service quality. Given the time-varying and unpredictable characteristics of SFCs, it is essential to leverage their temporal features, along with those of network states, for adaptive SFC mapping. In this paper, we introduce an adaptive online SFC mapping algorithm to reduce operational costs and alleviate network congestion. This is achieved through the adaptive allocation of VNFs and the control of traffic routing between them. Our approach incorporates multi-task deep reinforcement learning to manage the coexistence of multiple SFC requests with varying resource requirements. Specifically, we integrate a long short-term memory (LSTM) layer into our model to capture the temporal dynamics of network states and resource demands, thereby enabling more effective long-term planning. To address the issue of reward sparsity, we implement a hierarchical reward mechanism and reward shaping techniques. Experimental results demonstrate that our algorithm achieves near-optimal performance in optimizing service delay, bandwidth consumption, and network congestion, while also ensuring a high acceptance rate for user requests.","PeriodicalId":54229,"journal":{"name":"IEEE Transactions on Network Science and Engineering","volume":"12 4","pages":"3093-3107"},"PeriodicalIF":6.7000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10946227/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Network function virtualization (NFV) facilitates different virtual network functions (VNF) to be dynamically chained in sequence to offer new services in a flexible, scalable, and cost-effective manner. Recent years have witnessed the increasing diverse service demands from the ever-increasing new applications, which has posed significant challenges to the efficient and sequential execution of VNFs to achieve specific objectives, especially under conditions of shared resources. To address these challenges, substantial efforts have been dedicated to enhancing resource utilization and minimizing the costs associated with service function chains (SFCs), while maintaining high quality of service. However, an overemphasis on cost reduction can sometimes result in network congestion, which ultimately degrades both network performance and service quality. Given the time-varying and unpredictable characteristics of SFCs, it is essential to leverage their temporal features, along with those of network states, for adaptive SFC mapping. In this paper, we introduce an adaptive online SFC mapping algorithm to reduce operational costs and alleviate network congestion. This is achieved through the adaptive allocation of VNFs and the control of traffic routing between them. Our approach incorporates multi-task deep reinforcement learning to manage the coexistence of multiple SFC requests with varying resource requirements. Specifically, we integrate a long short-term memory (LSTM) layer into our model to capture the temporal dynamics of network states and resource demands, thereby enabling more effective long-term planning. To address the issue of reward sparsity, we implement a hierarchical reward mechanism and reward shaping techniques. Experimental results demonstrate that our algorithm achieves near-optimal performance in optimizing service delay, bandwidth consumption, and network congestion, while also ensuring a high acceptance rate for user requests.
期刊介绍:
The proposed journal, called the IEEE Transactions on Network Science and Engineering (TNSE), is committed to timely publishing of peer-reviewed technical articles that deal with the theory and applications of network science and the interconnections among the elements in a system that form a network. In particular, the IEEE Transactions on Network Science and Engineering publishes articles on understanding, prediction, and control of structures and behaviors of networks at the fundamental level. The types of networks covered include physical or engineered networks, information networks, biological networks, semantic networks, economic networks, social networks, and ecological networks. Aimed at discovering common principles that govern network structures, network functionalities and behaviors of networks, the journal seeks articles on understanding, prediction, and control of structures and behaviors of networks. Another trans-disciplinary focus of the IEEE Transactions on Network Science and Engineering is the interactions between and co-evolution of different genres of networks.