{"title":"基于深度强化学习的LEO卫星物联网端到端跨域网络切片","authors":"Mingjun Liao , Ruyan Wang , Puning Zhang","doi":"10.1016/j.phycom.2025.102821","DOIUrl":null,"url":null,"abstract":"<div><div>Low earth orbit satellite Internet of Things (LEO SIoT) represents a pivotal infrastructure for enabling global, ubiquitous, and low-latency communications. Network slicing has emerged as a promising paradigm to address the core challenges of dynamic resource allocation and service differentiation in LEO SIoT. To ensure end-to-end Quality of Service (QoS) across slices, we propose an end-to-end cross-domain slicing framework. This framework comprises a centralized cross-domain coordinator and multiple domain-specific controllers. An adaptive cross-domain delay-balancing strategy is devised for the coordinator to allocate delay budgets across domains. Based on the allocated delay budgets, a radio access network (RAN) slicing policy is developed for the RAN controller using a Dueling Double Deep Q-Network (D3QN), enabling dynamic wireless resource allocation that guarantees low-latency for ultra-reliable low-latency communication (URLLC) services while optimizing throughput for enhanced mobile broadband (eMBB) users. To cope with the rapidly evolving LEO topology, a rollback mechanism-based authentic boundary Proximal Policy Optimization (RMABPPO) algorithm, enhanced with an integrated Graph Attention Network and Sequence-to-Sequence module (iGATSeq), is introduced for core network (CN) slicing. Simulation results demonstrate that the proposed end-to-end cross-domain slicing solution not only ensures Quality of Experience (QoE) for both eMBB and URLLC users, but also significantly improves resource utilization in LEO SIoT.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"73 ","pages":"Article 102821"},"PeriodicalIF":2.2000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-end cross-domain network slicing for LEO satellite Internet of Things using deep reinforcement learning\",\"authors\":\"Mingjun Liao , Ruyan Wang , Puning Zhang\",\"doi\":\"10.1016/j.phycom.2025.102821\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Low earth orbit satellite Internet of Things (LEO SIoT) represents a pivotal infrastructure for enabling global, ubiquitous, and low-latency communications. Network slicing has emerged as a promising paradigm to address the core challenges of dynamic resource allocation and service differentiation in LEO SIoT. To ensure end-to-end Quality of Service (QoS) across slices, we propose an end-to-end cross-domain slicing framework. This framework comprises a centralized cross-domain coordinator and multiple domain-specific controllers. An adaptive cross-domain delay-balancing strategy is devised for the coordinator to allocate delay budgets across domains. Based on the allocated delay budgets, a radio access network (RAN) slicing policy is developed for the RAN controller using a Dueling Double Deep Q-Network (D3QN), enabling dynamic wireless resource allocation that guarantees low-latency for ultra-reliable low-latency communication (URLLC) services while optimizing throughput for enhanced mobile broadband (eMBB) users. To cope with the rapidly evolving LEO topology, a rollback mechanism-based authentic boundary Proximal Policy Optimization (RMABPPO) algorithm, enhanced with an integrated Graph Attention Network and Sequence-to-Sequence module (iGATSeq), is introduced for core network (CN) slicing. Simulation results demonstrate that the proposed end-to-end cross-domain slicing solution not only ensures Quality of Experience (QoE) for both eMBB and URLLC users, but also significantly improves resource utilization in LEO SIoT.</div></div>\",\"PeriodicalId\":48707,\"journal\":{\"name\":\"Physical Communication\",\"volume\":\"73 \",\"pages\":\"Article 102821\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1874490725002241\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1874490725002241","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
低地球轨道卫星物联网(LEO SIoT)代表了实现全球、无处不在和低延迟通信的关键基础设施。网络切片已经成为解决LEO SIoT中动态资源分配和服务差异化核心挑战的一种有前途的范式。为了保证端到端的服务质量(QoS),我们提出了一个端到端的跨域切片框架。这个框架包括一个集中的跨域协调器和多个特定于域的控制器。设计了一种自适应跨域延迟平衡策略,用于协调器跨域分配延迟预算。基于分配的延迟预算,采用Dueling Double Deep Q-Network (D3QN)为RAN控制器开发了无线接入网(RAN)切片策略,实现了动态无线资源分配,保证了超可靠低延迟通信(URLLC)服务的低延迟,同时优化了增强型移动宽带(eMBB)用户的吞吐量。为了应对快速发展的LEO拓扑结构,引入了一种基于回滚机制的真实边界近端策略优化(RMABPPO)算法,并集成了图注意网络和序列到序列模块(iGATSeq),用于核心网络(CN)切片。仿真结果表明,提出的端到端跨域切片方案不仅保证了eMBB和URLLC用户的体验质量(QoE),而且显著提高了LEO SIoT的资源利用率。
End-to-end cross-domain network slicing for LEO satellite Internet of Things using deep reinforcement learning
Low earth orbit satellite Internet of Things (LEO SIoT) represents a pivotal infrastructure for enabling global, ubiquitous, and low-latency communications. Network slicing has emerged as a promising paradigm to address the core challenges of dynamic resource allocation and service differentiation in LEO SIoT. To ensure end-to-end Quality of Service (QoS) across slices, we propose an end-to-end cross-domain slicing framework. This framework comprises a centralized cross-domain coordinator and multiple domain-specific controllers. An adaptive cross-domain delay-balancing strategy is devised for the coordinator to allocate delay budgets across domains. Based on the allocated delay budgets, a radio access network (RAN) slicing policy is developed for the RAN controller using a Dueling Double Deep Q-Network (D3QN), enabling dynamic wireless resource allocation that guarantees low-latency for ultra-reliable low-latency communication (URLLC) services while optimizing throughput for enhanced mobile broadband (eMBB) users. To cope with the rapidly evolving LEO topology, a rollback mechanism-based authentic boundary Proximal Policy Optimization (RMABPPO) algorithm, enhanced with an integrated Graph Attention Network and Sequence-to-Sequence module (iGATSeq), is introduced for core network (CN) slicing. Simulation results demonstrate that the proposed end-to-end cross-domain slicing solution not only ensures Quality of Experience (QoE) for both eMBB and URLLC users, but also significantly improves resource utilization in LEO SIoT.
期刊介绍:
PHYCOM: Physical Communication is an international and archival journal providing complete coverage of all topics of interest to those involved in all aspects of physical layer communications. Theoretical research contributions presenting new techniques, concepts or analyses, applied contributions reporting on experiences and experiments, and tutorials are published.
Topics of interest include but are not limited to:
Physical layer issues of Wireless Local Area Networks, WiMAX, Wireless Mesh Networks, Sensor and Ad Hoc Networks, PCS Systems; Radio access protocols and algorithms for the physical layer; Spread Spectrum Communications; Channel Modeling; Detection and Estimation; Modulation and Coding; Multiplexing and Carrier Techniques; Broadband Wireless Communications; Wireless Personal Communications; Multi-user Detection; Signal Separation and Interference rejection: Multimedia Communications over Wireless; DSP Applications to Wireless Systems; Experimental and Prototype Results; Multiple Access Techniques; Space-time Processing; Synchronization Techniques; Error Control Techniques; Cryptography; Software Radios; Tracking; Resource Allocation and Inference Management; Multi-rate and Multi-carrier Communications; Cross layer Design and Optimization; Propagation and Channel Characterization; OFDM Systems; MIMO Systems; Ultra-Wideband Communications; Cognitive Radio System Architectures; Platforms and Hardware Implementations for the Support of Cognitive, Radio Systems; Cognitive Radio Resource Management and Dynamic Spectrum Sharing.