{"title":"Joint Physical and Network Layers Design for STARS-Assisted Multi-Cellular Edge Caching","authors":"Zhaoming Hu;Chao Fang;Ruikang Zhong;Yuanwei Liu","doi":"10.1109/TWC.2024.3453672","DOIUrl":null,"url":null,"abstract":"A simultaneously transmitting and reflecting surface (STARS) assisted multi-user downlink multiple-input signal-output (MISO) multi-cellular edge caching system is investigated. The deployment of STARS enhances the coverage of base stations (BSs), particularly at cellular boundaries. However, this advancement introduces a complex user association issue that necessitates the consideration of both caching state and channel state information (CSI). In this paper, we formulate a joint optimization problem involving content caching, user association, active beamforming at BS, and passive beamforming at STARS for minimizing long-term power consumption. We propose two algorithms for the formulated problem: 1) A two time-scale cooperative twin delayed deep deterministic policy gradients (TD3). Considering the distinct time scales of the pushing and delivering phases in edge caching, the Markov decision process (MDP) models of dual time scales are constructed and two deep reinforcement learning (DRL) agents work together to jointly address the optimization problem. 2) A bio-inspired DRL framework, especially, a particle swarm optimization (PSO)-inspired TD3 algorithm is introduced in detail. Inspired by the behavior of the biological population in nature, this algorithm regards agents as individuals and enables the concurrent training of multiple agents while they interact with global information via a biological population information interaction mode, thereby enhancing the performance of power optimization. The numerical results demonstrate that the STARS-assisted multi-cellular edge caching system has advantages over traditional cellular systems, especially in scenarios where the number of mobile users and Zipf skewness factor is large. Moreover, the proposed two time-scale cooperative TD3 and PSO-inspired TD3 algorithms are superior in reducing network power consumption than conventional TD3.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"23 11","pages":"17446-17460"},"PeriodicalIF":8.9000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10678886/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
A simultaneously transmitting and reflecting surface (STARS) assisted multi-user downlink multiple-input signal-output (MISO) multi-cellular edge caching system is investigated. The deployment of STARS enhances the coverage of base stations (BSs), particularly at cellular boundaries. However, this advancement introduces a complex user association issue that necessitates the consideration of both caching state and channel state information (CSI). In this paper, we formulate a joint optimization problem involving content caching, user association, active beamforming at BS, and passive beamforming at STARS for minimizing long-term power consumption. We propose two algorithms for the formulated problem: 1) A two time-scale cooperative twin delayed deep deterministic policy gradients (TD3). Considering the distinct time scales of the pushing and delivering phases in edge caching, the Markov decision process (MDP) models of dual time scales are constructed and two deep reinforcement learning (DRL) agents work together to jointly address the optimization problem. 2) A bio-inspired DRL framework, especially, a particle swarm optimization (PSO)-inspired TD3 algorithm is introduced in detail. Inspired by the behavior of the biological population in nature, this algorithm regards agents as individuals and enables the concurrent training of multiple agents while they interact with global information via a biological population information interaction mode, thereby enhancing the performance of power optimization. The numerical results demonstrate that the STARS-assisted multi-cellular edge caching system has advantages over traditional cellular systems, especially in scenarios where the number of mobile users and Zipf skewness factor is large. Moreover, the proposed two time-scale cooperative TD3 and PSO-inspired TD3 algorithms are superior in reducing network power consumption than conventional TD3.
期刊介绍:
The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols.
The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies.
Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.