{"title":"多波束卫星系统动态带宽分配的深度强化学习","authors":"Shijun Ma, Xin Hu, Xianglai Liao, Weidong Wang","doi":"10.1109/ICCCS52626.2021.9449160","DOIUrl":null,"url":null,"abstract":"Future multi-beam satellite (MBS) network is an essential part of the air-space-ground integrated network, which is the future blueprint of 6G. As the MBS network scales up, how to allocation scarce bandwidth spectrum resources efficiently and dynamically while ensuring the Quality of Service (QoS) of the users has become a great challenge. In this paper, we designed a dynamic bandwidth allocation framework using Proximal Policy Optimization (DBA-PPO) to meet the time-varying traffic demand, maximize utilization and guarantee the QoS of the users in the MBS system. The experimental results show that the proposed bandwidth allocation algorithm can be flexible to achieve the desired effectiveness with low complexity and is more cost-effective for the large scale MBS communications scenario.","PeriodicalId":376290,"journal":{"name":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Deep Reinforcement Learning for Dynamic Bandwidth Allocation in Multi-Beam Satellite Systems\",\"authors\":\"Shijun Ma, Xin Hu, Xianglai Liao, Weidong Wang\",\"doi\":\"10.1109/ICCCS52626.2021.9449160\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Future multi-beam satellite (MBS) network is an essential part of the air-space-ground integrated network, which is the future blueprint of 6G. As the MBS network scales up, how to allocation scarce bandwidth spectrum resources efficiently and dynamically while ensuring the Quality of Service (QoS) of the users has become a great challenge. In this paper, we designed a dynamic bandwidth allocation framework using Proximal Policy Optimization (DBA-PPO) to meet the time-varying traffic demand, maximize utilization and guarantee the QoS of the users in the MBS system. The experimental results show that the proposed bandwidth allocation algorithm can be flexible to achieve the desired effectiveness with low complexity and is more cost-effective for the large scale MBS communications scenario.\",\"PeriodicalId\":376290,\"journal\":{\"name\":\"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCS52626.2021.9449160\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS52626.2021.9449160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning for Dynamic Bandwidth Allocation in Multi-Beam Satellite Systems
Future multi-beam satellite (MBS) network is an essential part of the air-space-ground integrated network, which is the future blueprint of 6G. As the MBS network scales up, how to allocation scarce bandwidth spectrum resources efficiently and dynamically while ensuring the Quality of Service (QoS) of the users has become a great challenge. In this paper, we designed a dynamic bandwidth allocation framework using Proximal Policy Optimization (DBA-PPO) to meet the time-varying traffic demand, maximize utilization and guarantee the QoS of the users in the MBS system. The experimental results show that the proposed bandwidth allocation algorithm can be flexible to achieve the desired effectiveness with low complexity and is more cost-effective for the large scale MBS communications scenario.