Tera-SpaceCom:基于 GNN 的深度强化学习,用于 TeraHertz 频带空间网络中的联合资源分配和任务卸载

Zhifeng Hu, Chong Han, Wolfgang Gerstacker, Ian F. Akyildiz
{"title":"Tera-SpaceCom:基于 GNN 的深度强化学习,用于 TeraHertz 频带空间网络中的联合资源分配和任务卸载","authors":"Zhifeng Hu, Chong Han, Wolfgang Gerstacker, Ian F. Akyildiz","doi":"arxiv-2409.07911","DOIUrl":null,"url":null,"abstract":"Terahertz (THz) space communications (Tera-SpaceCom) is envisioned as a\npromising technology to enable various space science and communication\napplications. Mainly, the realm of Tera-SpaceCom consists of THz sensing for\nspace exploration, data centers in space providing cloud services for space\nexploration tasks, and a low earth orbit (LEO) mega-constellation relaying\nthese tasks to ground stations (GSs) or data centers via THz links. Moreover,\nto reduce the computational burden on data centers as well as resource\nconsumption and latency in the relaying process, the LEO mega-constellation\nprovides satellite edge computing (SEC) services to directly compute space\nexploration tasks without relaying these tasks to data centers. The LEO\nsatellites that receive space exploration tasks offload (i.e., distribute)\npartial tasks to their neighboring LEO satellites, to further reduce their\ncomputational burden. However, efficient joint communication resource\nallocation and computing task offloading for the Tera-SpaceCom SEC network is\nan NP-hard mixed-integer nonlinear programming problem (MINLP), due to the\ndiscrete nature of space exploration tasks and sub-arrays as well as the\ncontinuous nature of transmit power. To tackle this challenge, a graph neural\nnetwork (GNN)-deep reinforcement learning (DRL)-based joint resource allocation\nand task offloading (GRANT) algorithm is proposed with the target of long-term\nresource efficiency (RE). Particularly, GNNs learn relationships among\ndifferent satellites from their connectivity information. Furthermore,\nmulti-agent and multi-task mechanisms cooperatively train task offloading and\nresource allocation. Compared with benchmark solutions, GRANT not only achieves\nthe highest RE with relatively low latency, but realizes the fewest trainable\nparameters and the shortest running time.","PeriodicalId":501301,"journal":{"name":"arXiv - CS - Machine Learning","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks\",\"authors\":\"Zhifeng Hu, Chong Han, Wolfgang Gerstacker, Ian F. Akyildiz\",\"doi\":\"arxiv-2409.07911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Terahertz (THz) space communications (Tera-SpaceCom) is envisioned as a\\npromising technology to enable various space science and communication\\napplications. Mainly, the realm of Tera-SpaceCom consists of THz sensing for\\nspace exploration, data centers in space providing cloud services for space\\nexploration tasks, and a low earth orbit (LEO) mega-constellation relaying\\nthese tasks to ground stations (GSs) or data centers via THz links. Moreover,\\nto reduce the computational burden on data centers as well as resource\\nconsumption and latency in the relaying process, the LEO mega-constellation\\nprovides satellite edge computing (SEC) services to directly compute space\\nexploration tasks without relaying these tasks to data centers. The LEO\\nsatellites that receive space exploration tasks offload (i.e., distribute)\\npartial tasks to their neighboring LEO satellites, to further reduce their\\ncomputational burden. However, efficient joint communication resource\\nallocation and computing task offloading for the Tera-SpaceCom SEC network is\\nan NP-hard mixed-integer nonlinear programming problem (MINLP), due to the\\ndiscrete nature of space exploration tasks and sub-arrays as well as the\\ncontinuous nature of transmit power. To tackle this challenge, a graph neural\\nnetwork (GNN)-deep reinforcement learning (DRL)-based joint resource allocation\\nand task offloading (GRANT) algorithm is proposed with the target of long-term\\nresource efficiency (RE). Particularly, GNNs learn relationships among\\ndifferent satellites from their connectivity information. Furthermore,\\nmulti-agent and multi-task mechanisms cooperatively train task offloading and\\nresource allocation. Compared with benchmark solutions, GRANT not only achieves\\nthe highest RE with relatively low latency, but realizes the fewest trainable\\nparameters and the shortest running time.\",\"PeriodicalId\":501301,\"journal\":{\"name\":\"arXiv - CS - Machine Learning\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

太赫兹(THz)空间通信(Tera-SpaceCom)被认为是实现各种空间科学和通信应用的一项新兴技术。太赫兹空间通信领域主要包括用于空间探索的太赫兹传感、为空间探索任务提供云服务的空间数据中心,以及通过太赫兹链路将这些任务中继到地面站或数据中心的低地球轨道(LEO)超大型星座。此外,为了减少数据中心的计算负担以及中继过程中的资源消耗和延迟,低地轨道超大星座提供卫星边缘计算(SEC)服务,直接计算空间探索任务,而无需将这些任务中继给数据中心。接收空间探索任务的低地轨道卫星将部分任务卸载(即分配)给邻近的低地轨道卫星,以进一步减轻其计算负担。然而,由于太空探索任务和子阵列的离散性以及发射功率的连续性,Tera-SpaceCom SEC 网络的高效联合通信资源分配和计算任务卸载是一个 NP 难的混合整数非线性编程(MINLP)问题。为了应对这一挑战,我们提出了一种基于图神经网络(GNN)和深度强化学习(DRL)的联合资源分配和任务卸载(GRANT)算法,其目标是实现长期资源效率(RE)。特别是,GNN 从不同卫星的连接信息中学习它们之间的关系。此外,多代理和多任务机制合作训练任务卸载和资源分配。与基准解决方案相比,GRANT 不仅以相对较低的延迟实现了最高的资源效率,而且实现了最少的可训练参数和最短的运行时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
Terahertz (THz) space communications (Tera-SpaceCom) is envisioned as a promising technology to enable various space science and communication applications. Mainly, the realm of Tera-SpaceCom consists of THz sensing for space exploration, data centers in space providing cloud services for space exploration tasks, and a low earth orbit (LEO) mega-constellation relaying these tasks to ground stations (GSs) or data centers via THz links. Moreover, to reduce the computational burden on data centers as well as resource consumption and latency in the relaying process, the LEO mega-constellation provides satellite edge computing (SEC) services to directly compute space exploration tasks without relaying these tasks to data centers. The LEO satellites that receive space exploration tasks offload (i.e., distribute) partial tasks to their neighboring LEO satellites, to further reduce their computational burden. However, efficient joint communication resource allocation and computing task offloading for the Tera-SpaceCom SEC network is an NP-hard mixed-integer nonlinear programming problem (MINLP), due to the discrete nature of space exploration tasks and sub-arrays as well as the continuous nature of transmit power. To tackle this challenge, a graph neural network (GNN)-deep reinforcement learning (DRL)-based joint resource allocation and task offloading (GRANT) algorithm is proposed with the target of long-term resource efficiency (RE). Particularly, GNNs learn relationships among different satellites from their connectivity information. Furthermore, multi-agent and multi-task mechanisms cooperatively train task offloading and resource allocation. Compared with benchmark solutions, GRANT not only achieves the highest RE with relatively low latency, but realizes the fewest trainable parameters and the shortest running time.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信