Alif Rahmatullah Umar;Hasan Albinsaid;Chia-Po Wei;Chih-Peng Li
{"title":"ris辅助多无人机MU-MISO通信网络的深度强化学习:和速率和能效最大化","authors":"Alif Rahmatullah Umar;Hasan Albinsaid;Chia-Po Wei;Chih-Peng Li","doi":"10.1109/OJVT.2025.3589661","DOIUrl":null,"url":null,"abstract":"Uncrewed aerial vehicles (UAVs) have emerged as a promising solution for enhancing wireless networks, especially in challenging environments. However, recent studies that integrate reconfigurable intelligent surfaces (RIS) with UAVs tend to focus on limited aspects, such as single-UAV deployments or partial optimization of system parameters, thereby neglecting a comprehensive system-level design. To overcome these limitations, we propose a multi-user MISO communication network that leverages RIS-assisted UAVs to maximize both sum-rate and energy efficiency as two distinct objectives. Our approach stands out by considering multiple UAVs and incorporating four critical constraints: UAV flying areas, power limitations, transmit beamforming, and RIS requirements. We formulate separate optimization problems for sum-rate and energy efficiency, and address them using deep reinforcement learning (DRL) algorithms, namely proximal policy optimization (PPO) and deep deterministic policy gradient (DDPG). By jointly optimizing UAV coordinates, the transmit beamforming matrix, and RIS phase shifts, our method achieves significant performance improvements under dynamic environmental conditions. Extensive simulations show that our comprehensive strategy delivers higher sum-rates and enhanced energy efficiency, underscoring its practical potential for next-generation RIS-assisted UAV communication systems.","PeriodicalId":34270,"journal":{"name":"IEEE Open Journal of Vehicular Technology","volume":"6 ","pages":"2033-2047"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11081475","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning for RIS-Assisted Multi-UAV MU-MISO Communication Networks: Sum-Rate and Energy Efficiency Maximization\",\"authors\":\"Alif Rahmatullah Umar;Hasan Albinsaid;Chia-Po Wei;Chih-Peng Li\",\"doi\":\"10.1109/OJVT.2025.3589661\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Uncrewed aerial vehicles (UAVs) have emerged as a promising solution for enhancing wireless networks, especially in challenging environments. However, recent studies that integrate reconfigurable intelligent surfaces (RIS) with UAVs tend to focus on limited aspects, such as single-UAV deployments or partial optimization of system parameters, thereby neglecting a comprehensive system-level design. To overcome these limitations, we propose a multi-user MISO communication network that leverages RIS-assisted UAVs to maximize both sum-rate and energy efficiency as two distinct objectives. Our approach stands out by considering multiple UAVs and incorporating four critical constraints: UAV flying areas, power limitations, transmit beamforming, and RIS requirements. We formulate separate optimization problems for sum-rate and energy efficiency, and address them using deep reinforcement learning (DRL) algorithms, namely proximal policy optimization (PPO) and deep deterministic policy gradient (DDPG). By jointly optimizing UAV coordinates, the transmit beamforming matrix, and RIS phase shifts, our method achieves significant performance improvements under dynamic environmental conditions. Extensive simulations show that our comprehensive strategy delivers higher sum-rates and enhanced energy efficiency, underscoring its practical potential for next-generation RIS-assisted UAV communication systems.\",\"PeriodicalId\":34270,\"journal\":{\"name\":\"IEEE Open Journal of Vehicular Technology\",\"volume\":\"6 \",\"pages\":\"2033-2047\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11081475\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of Vehicular Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11081475/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Vehicular Technology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11081475/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Deep Reinforcement Learning for RIS-Assisted Multi-UAV MU-MISO Communication Networks: Sum-Rate and Energy Efficiency Maximization
Uncrewed aerial vehicles (UAVs) have emerged as a promising solution for enhancing wireless networks, especially in challenging environments. However, recent studies that integrate reconfigurable intelligent surfaces (RIS) with UAVs tend to focus on limited aspects, such as single-UAV deployments or partial optimization of system parameters, thereby neglecting a comprehensive system-level design. To overcome these limitations, we propose a multi-user MISO communication network that leverages RIS-assisted UAVs to maximize both sum-rate and energy efficiency as two distinct objectives. Our approach stands out by considering multiple UAVs and incorporating four critical constraints: UAV flying areas, power limitations, transmit beamforming, and RIS requirements. We formulate separate optimization problems for sum-rate and energy efficiency, and address them using deep reinforcement learning (DRL) algorithms, namely proximal policy optimization (PPO) and deep deterministic policy gradient (DDPG). By jointly optimizing UAV coordinates, the transmit beamforming matrix, and RIS phase shifts, our method achieves significant performance improvements under dynamic environmental conditions. Extensive simulations show that our comprehensive strategy delivers higher sum-rates and enhanced energy efficiency, underscoring its practical potential for next-generation RIS-assisted UAV communication systems.