Linh T. Hoang;Chuyen T. Nguyen;Hoang D. Le;Anh T. Pham
{"title":"地面/非地面集成网络中无人机-BS 机队合作轨迹设计的多代理强化学习","authors":"Linh T. Hoang;Chuyen T. Nguyen;Hoang D. Le;Anh T. Pham","doi":"10.23919/comex.2024XBL0084","DOIUrl":null,"url":null,"abstract":"Aerial base stations (ABSs) have been envisioned as a promising technology toward ubiquitous coverage and seamless high-rate connectivity in sixth-generation (6G) wireless networks. With the inherent mobility but limited communication range, the placement of ABSs should adapt to the time-varying network conditions, e.g., the user distribution and wireless channel state. This letter investigates the cooperative trajectory of UAVs in an integrated terrestrial and non-terrestrial network (TNTN), where unmanned aerial vehicles (UAVs) are deployed as ABSs to supplement the terrestrial macro base station (mBS). We formulate an optimization problem to maximize the number of users with a certain minimum data rate, which is solved using multi-agent reinforcement learning (MARL). Numerical results show that the proposed design is superior to conventional approaches for the cooperative trajectory of UAVs, including K-means clustering-based and single-agent reinforcement learning (SARL)-based methods.","PeriodicalId":54101,"journal":{"name":"IEICE Communications Express","volume":"13 8","pages":"327-330"},"PeriodicalIF":0.3000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10554673","citationCount":"0","resultStr":"{\"title\":\"Multi-Agent Reinforcement Learning for Cooperative Trajectory Design of UAV-BS Fleets in Terrestrial/Non-Terrestrial Integrated Networks\",\"authors\":\"Linh T. Hoang;Chuyen T. Nguyen;Hoang D. Le;Anh T. Pham\",\"doi\":\"10.23919/comex.2024XBL0084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aerial base stations (ABSs) have been envisioned as a promising technology toward ubiquitous coverage and seamless high-rate connectivity in sixth-generation (6G) wireless networks. With the inherent mobility but limited communication range, the placement of ABSs should adapt to the time-varying network conditions, e.g., the user distribution and wireless channel state. This letter investigates the cooperative trajectory of UAVs in an integrated terrestrial and non-terrestrial network (TNTN), where unmanned aerial vehicles (UAVs) are deployed as ABSs to supplement the terrestrial macro base station (mBS). We formulate an optimization problem to maximize the number of users with a certain minimum data rate, which is solved using multi-agent reinforcement learning (MARL). Numerical results show that the proposed design is superior to conventional approaches for the cooperative trajectory of UAVs, including K-means clustering-based and single-agent reinforcement learning (SARL)-based methods.\",\"PeriodicalId\":54101,\"journal\":{\"name\":\"IEICE Communications Express\",\"volume\":\"13 8\",\"pages\":\"327-330\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2024-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10554673\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEICE Communications Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10554673/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Communications Express","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10554673/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multi-Agent Reinforcement Learning for Cooperative Trajectory Design of UAV-BS Fleets in Terrestrial/Non-Terrestrial Integrated Networks
Aerial base stations (ABSs) have been envisioned as a promising technology toward ubiquitous coverage and seamless high-rate connectivity in sixth-generation (6G) wireless networks. With the inherent mobility but limited communication range, the placement of ABSs should adapt to the time-varying network conditions, e.g., the user distribution and wireless channel state. This letter investigates the cooperative trajectory of UAVs in an integrated terrestrial and non-terrestrial network (TNTN), where unmanned aerial vehicles (UAVs) are deployed as ABSs to supplement the terrestrial macro base station (mBS). We formulate an optimization problem to maximize the number of users with a certain minimum data rate, which is solved using multi-agent reinforcement learning (MARL). Numerical results show that the proposed design is superior to conventional approaches for the cooperative trajectory of UAVs, including K-means clustering-based and single-agent reinforcement learning (SARL)-based methods.