Multi-Agent DRL-Based Two-Timescale Resource Allocation for Network Slicing in V2X Communications

IF 4.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Network and Service Management Pub Date : 2024-09-05 DOI:10.1109/TNSM.2024.3454758

Binbin Lu;Yuan Wu;Liping Qian;Sheng Zhou;Haixia Zhang;Rongxing Lu

{"title":"Multi-Agent DRL-Based Two-Timescale Resource Allocation for Network Slicing in V2X Communications","authors":"Binbin Lu;Yuan Wu;Liping Qian;Sheng Zhou;Haixia Zhang;Rongxing Lu","doi":"10.1109/TNSM.2024.3454758","DOIUrl":null,"url":null,"abstract":"Network slicing has been envisioned to play a crucial role in supporting various vehicular applications with diverse performance requirements in dynamic Vehicle-to-Everything (V2X) communications systems. However, time-varying Service Level Agreements (SLAs) of slices and fast-changing network topologies in V2X scenarios may introduce new challenges for enabling efficient inter-slice resource provisioning to guarantee the Quality of Service (QoS) while avoiding both resource over-provisioning and under-provisioning. Moreover, the conventional centralized resource allocation schemes requiring global slice information may degrade the data privacy provided by dedicated resource provisioning. To address these challenges, in this paper, we propose a two-timescale resource management mechanism for providing diverse V2X slices with customized resources. In the long timescale, we propose a Proximal Policy Optimization-based multi-agent deep reinforcement learning algorithm for dynamically allocating bandwidth resources to different slices for guaranteeing their SLAs. Under the coordination of agents, each agent only observes its partial state space rather than the global information to adjust the resource requests, which can enhance the privacy protection. Moreover, an expert demonstration mechanism is proposed to guide the action policy for reducing the invalid action exploration and accelerating the convergence of agents. In the short-term time slot, with our proposed Cross Entropy and Successive Convex Approximation algorithm, each slice allocates its available physical resource blocks and optimizes its transmit power to meet the QoS. Simulation results show our proposed two-timescale resource allocation scheme for network slicing can achieve maximum 8.4% performance gains in terms of spectral efficiency while guaranteeing the QoS requirements of users compared to the baseline approaches.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"21 6","pages":"6744-6758"},"PeriodicalIF":4.7000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network and Service Management","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10666879/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Network slicing has been envisioned to play a crucial role in supporting various vehicular applications with diverse performance requirements in dynamic Vehicle-to-Everything (V2X) communications systems. However, time-varying Service Level Agreements (SLAs) of slices and fast-changing network topologies in V2X scenarios may introduce new challenges for enabling efficient inter-slice resource provisioning to guarantee the Quality of Service (QoS) while avoiding both resource over-provisioning and under-provisioning. Moreover, the conventional centralized resource allocation schemes requiring global slice information may degrade the data privacy provided by dedicated resource provisioning. To address these challenges, in this paper, we propose a two-timescale resource management mechanism for providing diverse V2X slices with customized resources. In the long timescale, we propose a Proximal Policy Optimization-based multi-agent deep reinforcement learning algorithm for dynamically allocating bandwidth resources to different slices for guaranteeing their SLAs. Under the coordination of agents, each agent only observes its partial state space rather than the global information to adjust the resource requests, which can enhance the privacy protection. Moreover, an expert demonstration mechanism is proposed to guide the action policy for reducing the invalid action exploration and accelerating the convergence of agents. In the short-term time slot, with our proposed Cross Entropy and Successive Convex Approximation algorithm, each slice allocates its available physical resource blocks and optimizes its transmit power to meet the QoS. Simulation results show our proposed two-timescale resource allocation scheme for network slicing can achieve maximum 8.4% performance gains in terms of spectral efficiency while guaranteeing the QoS requirements of users compared to the baseline approaches.

查看原文本刊更多论文

基于 DRL 的多代理双时标资源分配用于 V2X 通信中的网络分片

网络切片被设想为在支持动态车联网（V2X）通信系统中具有不同性能要求的各种车辆应用方面发挥关键作用。然而，在V2X场景中，切片的时变服务水平协议（sla）和快速变化的网络拓扑可能会给实现高效的片间资源配置带来新的挑战，从而保证服务质量（QoS），同时避免资源过度配置和不足配置。此外，传统的集中式资源分配方案需要全局片信息，可能会降低专用资源分配所提供的数据保密性。为了应对这些挑战，在本文中，我们提出了一种双时间尺度资源管理机制，用于提供具有定制资源的各种V2X片。在长时间尺度下，我们提出了一种基于近端策略优化的多智能体深度强化学习算法，用于动态分配带宽资源到不同的片，以保证其sla。在agent协调下，每个agent只观察其局部状态空间而不是全局信息来调整资源请求，增强了对隐私的保护。此外，提出了一种专家示范机制来指导行动策略，以减少无效行动探索，加速智能体的收敛。在短时时隙内，利用交叉熵和连续凸逼近算法，每个分片分配其可用的物理资源块，并优化其发射功率以满足QoS要求。仿真结果表明，与基线方法相比，我们提出的网络切片双时间尺度资源分配方案在保证用户QoS要求的同时，在频谱效率方面的性能提升最大可达8.4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Network and Service Management Computer Science-Computer Networks and Communications

CiteScore

9.30

自引率

15.10%

发文量

325

期刊介绍： IEEE Transactions on Network and Service Management will publish (online only) peerreviewed archival quality papers that advance the state-of-the-art and practical applications of network and service management. Theoretical research contributions (presenting new concepts and techniques) and applied contributions (reporting on experiences and experiments with actual systems) will be encouraged. These transactions will focus on the key technical issues related to: Management Models, Architectures and Frameworks; Service Provisioning, Reliability and Quality Assurance; Management Functions; Enabling Technologies; Information and Communication Models; Policies; Applications and Case Studies; Emerging Technologies and Standards.