João Borges, Ailton Pinto De Oliveira, Felipe Henrique Bastos E Bastos, Daniel Takashi Ne Do Nascimento Suzuki, Emerson Santos De Oliveira Junior, Lucas Matni Bezerra, C. Nahum, P. Batista, Aldebaro Barreto Da Rocha Klautau Junior
{"title":"基于鱼子酱模拟的调度和Mimo波束选择的强化学习","authors":"João Borges, Ailton Pinto De Oliveira, Felipe Henrique Bastos E Bastos, Daniel Takashi Ne Do Nascimento Suzuki, Emerson Santos De Oliveira Junior, Lucas Matni Bezerra, C. Nahum, P. Batista, Aldebaro Barreto Da Rocha Klautau Junior","doi":"10.23919/ITUK53220.2021.9662100","DOIUrl":null,"url":null,"abstract":"This paper describes a framework for research on Reinforcement Learning (RL) applied to scheduling and MIMO beam selection. This framework consists of asking the RL agent to schedule a user and then choose the index of a beamforming codebook to serve it. A key aspect of this problem is that the simulation of the communication system and the artificial intelligence engine is based on a virtual world created with AirSim and the Unreal Engine. These components enable the so-called CAVIAR methodology, which leads to highly realistic 3D scenarios. This paper describes the communication and RL modeling adopted in the framework and also presents statistics concerning the implemented RL environment, such as data traffic, as well as results for three baseline systems.","PeriodicalId":423554,"journal":{"name":"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Reinforcement Learning for Scheduling and Mimo beam Selection using Caviar Simulations\",\"authors\":\"João Borges, Ailton Pinto De Oliveira, Felipe Henrique Bastos E Bastos, Daniel Takashi Ne Do Nascimento Suzuki, Emerson Santos De Oliveira Junior, Lucas Matni Bezerra, C. Nahum, P. Batista, Aldebaro Barreto Da Rocha Klautau Junior\",\"doi\":\"10.23919/ITUK53220.2021.9662100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a framework for research on Reinforcement Learning (RL) applied to scheduling and MIMO beam selection. This framework consists of asking the RL agent to schedule a user and then choose the index of a beamforming codebook to serve it. A key aspect of this problem is that the simulation of the communication system and the artificial intelligence engine is based on a virtual world created with AirSim and the Unreal Engine. These components enable the so-called CAVIAR methodology, which leads to highly realistic 3D scenarios. This paper describes the communication and RL modeling adopted in the framework and also presents statistics concerning the implemented RL environment, such as data traffic, as well as results for three baseline systems.\",\"PeriodicalId\":423554,\"journal\":{\"name\":\"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ITUK53220.2021.9662100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ITUK53220.2021.9662100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning for Scheduling and Mimo beam Selection using Caviar Simulations
This paper describes a framework for research on Reinforcement Learning (RL) applied to scheduling and MIMO beam selection. This framework consists of asking the RL agent to schedule a user and then choose the index of a beamforming codebook to serve it. A key aspect of this problem is that the simulation of the communication system and the artificial intelligence engine is based on a virtual world created with AirSim and the Unreal Engine. These components enable the so-called CAVIAR methodology, which leads to highly realistic 3D scenarios. This paper describes the communication and RL modeling adopted in the framework and also presents statistics concerning the implemented RL environment, such as data traffic, as well as results for three baseline systems.