{"title":"CCS-MASAC Resource Allocation Method for Collaborative Cluster Satellite Systems in 6G","authors":"Jiayi He;Zhiyong Liu;Jiayi Wang;Juan Dong;Qinyu Zhang","doi":"10.1109/JIOT.2025.3575158","DOIUrl":null,"url":null,"abstract":"The collaborative cluster satellite system (CCS) within the 6G network establishes the foundation for robust services in the future Star-Earth integrated network by coordinating multiple low-earth orbit (LEO) satellites for collaborative observation missions and efficient space mission processing. This article proposes a model-based soft actor-critic (SAC) algorithm, CCS-MASAC, for optimizing throughput in clustered satellite systems within 6G networks. The algorithm integrates the clustering degree of CCS with the entropy regularization term in SAC, proposing an adaptive adjustment method. Unlike existing studies, in this work, we adopt an environment model-based policy optimization approach for the first time. Model-based policy optimization focuses on improving the sample efficiency of reinforcement learning (RL) algorithms. It allows agents to learn iteratively in both real and simulated environments, which improves sample efficiency, convergence, and algorithm robustness. To address the dimensionality explosion in single-agent RL algorithms, we extend this approach to a multiagent RL algorithm by defining observable neighborhoods for each agent, further enhancing performance. Simulation results indicate that the CCS-MASAC algorithm proposed in this article enhances throughput by 15%–20% and accelerates convergence by 30% compared to existing algorithms, including the multiagent deep Q-network (MADQN), multiagent proximal policy optimization (MAPPO), multiagent deep deterministic policy gradient (MADDPG) and multiagent double and dueling deep Q-learning (MAD3QL). The scalability and robustness of the algorithms are verified by scalability experiments and experiments under dynamic channel conditions. This research provides new solutions for throughput optimization and resource management in CCS systems.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 15","pages":"31797-31812"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11018608/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The collaborative cluster satellite system (CCS) within the 6G network establishes the foundation for robust services in the future Star-Earth integrated network by coordinating multiple low-earth orbit (LEO) satellites for collaborative observation missions and efficient space mission processing. This article proposes a model-based soft actor-critic (SAC) algorithm, CCS-MASAC, for optimizing throughput in clustered satellite systems within 6G networks. The algorithm integrates the clustering degree of CCS with the entropy regularization term in SAC, proposing an adaptive adjustment method. Unlike existing studies, in this work, we adopt an environment model-based policy optimization approach for the first time. Model-based policy optimization focuses on improving the sample efficiency of reinforcement learning (RL) algorithms. It allows agents to learn iteratively in both real and simulated environments, which improves sample efficiency, convergence, and algorithm robustness. To address the dimensionality explosion in single-agent RL algorithms, we extend this approach to a multiagent RL algorithm by defining observable neighborhoods for each agent, further enhancing performance. Simulation results indicate that the CCS-MASAC algorithm proposed in this article enhances throughput by 15%–20% and accelerates convergence by 30% compared to existing algorithms, including the multiagent deep Q-network (MADQN), multiagent proximal policy optimization (MAPPO), multiagent deep deterministic policy gradient (MADDPG) and multiagent double and dueling deep Q-learning (MAD3QL). The scalability and robustness of the algorithms are verified by scalability experiments and experiments under dynamic channel conditions. This research provides new solutions for throughput optimization and resource management in CCS systems.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.