Manuel M. H. Roth, Anupama Hegde, Thomas Delamotte, Andreas Knopp
{"title":"Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks","authors":"Manuel M. H. Roth, Anupama Hegde, Thomas Delamotte, Andreas Knopp","doi":"arxiv-2408.01979","DOIUrl":null,"url":null,"abstract":"Effective routing in satellite mega-constellations has become crucial to\nfacilitate the handling of increasing traffic loads, more complex network\narchitectures, as well as the integration into 6G networks. To enhance\nadaptability as well as robustness to unpredictable traffic demands, and to\nsolve dynamic routing environments efficiently, machine learning-based\nsolutions are being considered. For network control problems, such as\noptimizing packet forwarding decisions according to Quality of Service\nrequirements and maintaining network stability, deep reinforcement learning\ntechniques have demonstrated promising results. For this reason, we investigate\nthe viability of multi-agent deep Q-networks for routing in satellite\nconstellation networks. We focus specifically on reward shaping and quantifying\ntraining convergence for joint optimization of latency and load balancing in\nstatic and dynamic scenarios. To address identified drawbacks, we propose a\nnovel hybrid solution based on centralized learning and decentralized control.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.01979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Effective routing in satellite mega-constellations has become crucial to
facilitate the handling of increasing traffic loads, more complex network
architectures, as well as the integration into 6G networks. To enhance
adaptability as well as robustness to unpredictable traffic demands, and to
solve dynamic routing environments efficiently, machine learning-based
solutions are being considered. For network control problems, such as
optimizing packet forwarding decisions according to Quality of Service
requirements and maintaining network stability, deep reinforcement learning
techniques have demonstrated promising results. For this reason, we investigate
the viability of multi-agent deep Q-networks for routing in satellite
constellation networks. We focus specifically on reward shaping and quantifying
training convergence for joint optimization of latency and load balancing in
static and dynamic scenarios. To address identified drawbacks, we propose a
novel hybrid solution based on centralized learning and decentralized control.