强化学习在认知空间通信中的应用

2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW) Pub Date : 2019-06-01 DOI:10.1109/CCAAW.2019.8904912

Carson D. Schubert, Rigoberto Roche', J. Briones

{"title":"强化学习在认知空间通信中的应用","authors":"Carson D. Schubert, Rigoberto Roche', J. Briones","doi":"10.1109/CCAAW.2019.8904912","DOIUrl":null,"url":null,"abstract":"The future of space exploration depends on robust, reliable communication systems. As the number of such communication systems increase, automation is fast becoming a requirement to achieve this goal. A reinforcement learning solution can be employed as a possible automation method for such systems. The goal of this study is to build a reinforcement learning algorithm which optimizes data throughput of a single actor. A training environment was created to simulate a link within the NASA Space Communication and Navigation (SCaN) infrastructure, using state of the art simulation tools developed by the SCaN Center for Engineering, Networks, Integration, and Communications (SCENIC) laboratory at NASA Glenn Research Center to obtain the closest possible representation of the real operating environment. Reinforcement learning was then used to train an agent inside this environment to maximize data throughput. The simulation environment contained a single actor in low earth orbit capable of communicating with twenty-five ground stations that compose the Near-Earth Network (NEN). Initial experiments showed promising training results, so additional complexity was added by augmenting simulation data with link fading profiles obtained from real communication events with the International Space Station. A grid search was performed to find the optimal hyperparameters and model architecture for the agent. Using the results of the grid search, an agent was trained on the augmented training data. Testing shows that the agent performs well inside the training environment and can be used as a foundation for future studies with added complexity and eventually tested in the real space environment.","PeriodicalId":196580,"journal":{"name":"2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning Applied to Cognitive Space Communications\",\"authors\":\"Carson D. Schubert, Rigoberto Roche', J. Briones\",\"doi\":\"10.1109/CCAAW.2019.8904912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The future of space exploration depends on robust, reliable communication systems. As the number of such communication systems increase, automation is fast becoming a requirement to achieve this goal. A reinforcement learning solution can be employed as a possible automation method for such systems. The goal of this study is to build a reinforcement learning algorithm which optimizes data throughput of a single actor. A training environment was created to simulate a link within the NASA Space Communication and Navigation (SCaN) infrastructure, using state of the art simulation tools developed by the SCaN Center for Engineering, Networks, Integration, and Communications (SCENIC) laboratory at NASA Glenn Research Center to obtain the closest possible representation of the real operating environment. Reinforcement learning was then used to train an agent inside this environment to maximize data throughput. The simulation environment contained a single actor in low earth orbit capable of communicating with twenty-five ground stations that compose the Near-Earth Network (NEN). Initial experiments showed promising training results, so additional complexity was added by augmenting simulation data with link fading profiles obtained from real communication events with the International Space Station. A grid search was performed to find the optimal hyperparameters and model architecture for the agent. Using the results of the grid search, an agent was trained on the augmented training data. Testing shows that the agent performs well inside the training environment and can be used as a foundation for future studies with added complexity and eventually tested in the real space environment.\",\"PeriodicalId\":196580,\"journal\":{\"name\":\"2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCAAW.2019.8904912\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCAAW.2019.8904912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

太空探索的未来取决于强大、可靠的通信系统。随着这种通信系统数量的增加，自动化正迅速成为实现这一目标的必要条件。强化学习解决方案可以作为这类系统的一种可能的自动化方法。本研究的目标是建立一个强化学习算法，以优化单个参与者的数据吞吐量。利用NASA格伦研究中心工程、网络、集成和通信扫描中心(SCENIC)实验室开发的最先进的仿真工具，创建了一个训练环境来模拟NASA空间通信和导航(SCaN)基础设施中的链路，以获得最接近真实操作环境的表示。然后使用强化学习来训练该环境中的代理，以最大化数据吞吐量。模拟环境包含一个在低地球轨道上的参与者，能够与组成近地网络的25个地面站进行通信。最初的实验显示了良好的训练结果，因此通过从国际空间站的真实通信事件中获得链路衰落剖面来增强模拟数据，增加了额外的复杂性。通过网格搜索找到智能体的最优超参数和模型结构。利用网格搜索的结果，在增强的训练数据上训练agent。测试表明，该智能体在训练环境中表现良好，可以作为未来增加复杂性研究的基础，并最终在真实空间环境中进行测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning Applied to Cognitive Space Communications

The future of space exploration depends on robust, reliable communication systems. As the number of such communication systems increase, automation is fast becoming a requirement to achieve this goal. A reinforcement learning solution can be employed as a possible automation method for such systems. The goal of this study is to build a reinforcement learning algorithm which optimizes data throughput of a single actor. A training environment was created to simulate a link within the NASA Space Communication and Navigation (SCaN) infrastructure, using state of the art simulation tools developed by the SCaN Center for Engineering, Networks, Integration, and Communications (SCENIC) laboratory at NASA Glenn Research Center to obtain the closest possible representation of the real operating environment. Reinforcement learning was then used to train an agent inside this environment to maximize data throughput. The simulation environment contained a single actor in low earth orbit capable of communicating with twenty-five ground stations that compose the Near-Earth Network (NEN). Initial experiments showed promising training results, so additional complexity was added by augmenting simulation data with link fading profiles obtained from real communication events with the International Space Station. A grid search was performed to find the optimal hyperparameters and model architecture for the agent. Using the results of the grid search, an agent was trained on the augmented training data. Testing shows that the agent performs well inside the training environment and can be used as a foundation for future studies with added complexity and eventually tested in the real space environment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)

自引率

0.00%

发文量