Yi Luo , Shaochen Zhang , Qintuya Si , Xin Zhao , Yang Liu , Tianshuang Qiu
{"title":"Energy-efficient resource allocation based on deep reinforcement learning for massive MIMO-NOMA-SWIPT system","authors":"Yi Luo , Shaochen Zhang , Qintuya Si , Xin Zhao , Yang Liu , Tianshuang Qiu","doi":"10.1016/j.aeue.2025.155947","DOIUrl":null,"url":null,"abstract":"<div><div>Massive multiple-input multiple-output (MIMO) and non-orthogonal multiple access (NOMA) can improve the energy efficiency (EE) of simultaneous wireless information and power transfer (SWIPT) systems by jointly optimizing the power allocation coefficient and the time slot handover coefficient. However, improper allocation of resources severely restricts the improvement of EE and substantially elevates communication costs of SWIPT. To address these issues, this study proposes a joint resource allocation algorithm for optimizing user scheduling, power allocation, and power splitting based on distributed multi-agent double deep Q-network and double multi-agent deep deterministic policy gradient (MADDQN-DMADDPG) network. Specifically, a double deep Q-network (DDQN) is used to optimize user scheduling by decoupling the action selection from action evaluation, which resolves the Q-value overestimation and mitigates multi-user interference. Then, to improve the training stability, a deep deterministic policy gradient (DDPG) is utilized to optimize the power allocation and power splitting by leveraging its ability to handle continuous action spaces. Moreover, we introduce the distributed multi-agent learning to bolster the learning capabilities of DDQN and DDPG, ensuring accuracy and efficiency of resource allocation. Simulations demonstrate that the proposed algorithm can significantly improve the overall EE of system with fast convergence speed.</div></div>","PeriodicalId":50844,"journal":{"name":"Aeu-International Journal of Electronics and Communications","volume":"201 ","pages":"Article 155947"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aeu-International Journal of Electronics and Communications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1434841125002882","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Massive multiple-input multiple-output (MIMO) and non-orthogonal multiple access (NOMA) can improve the energy efficiency (EE) of simultaneous wireless information and power transfer (SWIPT) systems by jointly optimizing the power allocation coefficient and the time slot handover coefficient. However, improper allocation of resources severely restricts the improvement of EE and substantially elevates communication costs of SWIPT. To address these issues, this study proposes a joint resource allocation algorithm for optimizing user scheduling, power allocation, and power splitting based on distributed multi-agent double deep Q-network and double multi-agent deep deterministic policy gradient (MADDQN-DMADDPG) network. Specifically, a double deep Q-network (DDQN) is used to optimize user scheduling by decoupling the action selection from action evaluation, which resolves the Q-value overestimation and mitigates multi-user interference. Then, to improve the training stability, a deep deterministic policy gradient (DDPG) is utilized to optimize the power allocation and power splitting by leveraging its ability to handle continuous action spaces. Moreover, we introduce the distributed multi-agent learning to bolster the learning capabilities of DDQN and DDPG, ensuring accuracy and efficiency of resource allocation. Simulations demonstrate that the proposed algorithm can significantly improve the overall EE of system with fast convergence speed.
期刊介绍:
AEÜ is an international scientific journal which publishes both original works and invited tutorials. The journal''s scope covers all aspects of theory and design of circuits, systems and devices for electronics, signal processing, and communication, including:
signal and system theory, digital signal processing
network theory and circuit design
information theory, communication theory and techniques, modulation, source and channel coding
switching theory and techniques, communication protocols
optical communications
microwave theory and techniques, radar, sonar
antennas, wave propagation
AEÜ publishes full papers and letters with very short turn around time but a high standard review process. Review cycles are typically finished within twelve weeks by application of modern electronic communication facilities.