Improving Scalability of 6G Network Automation with Distributed Deep Q-Networks

GLOBECOM 2022 - 2022 IEEE Global Communications Conference Pub Date : 2022-12-04 DOI:10.1109/GLOBECOM48099.2022.10000643

Sayantini Majumdar, L. Goratti, R. Trivisonno, G. Carle

{"title":"Improving Scalability of 6G Network Automation with Distributed Deep Q-Networks","authors":"Sayantini Majumdar, L. Goratti, R. Trivisonno, G. Carle","doi":"10.1109/GLOBECOM48099.2022.10000643","DOIUrl":null,"url":null,"abstract":"In recent years, owing to the architectural evolution of 6G towards decentralization, distributed intelligence is being studied extensively for 6G network automation. Distributed intelligence, based on Reinforcement Learning (RL), particularly Q-Learning (QL), has been proposed as a potential direction. The distributed framework consists of independent QL agents, attempting to reach their own individual objectives. The agents need to learn using a sufficient number of training steps before they converge to the optimal performance. After convergence, they can take reliable management actions. However, the scalability of QL could be severely hindered, particularly in the convergence time - when the number of QL agents increases. To overcome the scalability issue of QL, in this paper, we explore the potentials of the Deep Q-Network (DQN) algorithm, a function approximation-based method. Results show that DQN outperforms QL by at least 37% in terms of convergence time. In addition, we highlight that DQN is prone to divergence, which, if solved, could rapidly advance distributed intelligence for 6G.","PeriodicalId":313199,"journal":{"name":"GLOBECOM 2022 - 2022 IEEE Global Communications Conference","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GLOBECOM 2022 - 2022 IEEE Global Communications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM48099.2022.10000643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In recent years, owing to the architectural evolution of 6G towards decentralization, distributed intelligence is being studied extensively for 6G network automation. Distributed intelligence, based on Reinforcement Learning (RL), particularly Q-Learning (QL), has been proposed as a potential direction. The distributed framework consists of independent QL agents, attempting to reach their own individual objectives. The agents need to learn using a sufficient number of training steps before they converge to the optimal performance. After convergence, they can take reliable management actions. However, the scalability of QL could be severely hindered, particularly in the convergence time - when the number of QL agents increases. To overcome the scalability issue of QL, in this paper, we explore the potentials of the Deep Q-Network (DQN) algorithm, a function approximation-based method. Results show that DQN outperforms QL by at least 37% in terms of convergence time. In addition, we highlight that DQN is prone to divergence, which, if solved, could rapidly advance distributed intelligence for 6G.

查看原文本刊更多论文

利用分布式深度q -网络提高6G网络自动化的可扩展性

近年来，由于6G的架构向去中心化发展，分布式智能正在被广泛研究用于6G网络自动化。分布式智能，基于强化学习(RL)，特别是Q-Learning (QL)，被认为是一个潜在的方向。分布式框架由独立的QL代理组成，它们试图实现各自的目标。智能体需要使用足够数量的训练步骤来学习，才能收敛到最佳性能。融合后，它们可以采取可靠的管理行动。然而，QL的可伸缩性可能会受到严重阻碍，特别是在收敛时间——当QL代理的数量增加时。为了克服QL的可扩展性问题，本文探讨了基于函数近似的Deep Q-Network (DQN)算法的潜力。结果表明，DQN在收敛时间方面比QL至少高出37%。此外，我们强调DQN容易发散，如果解决这个问题，可以快速推进6G的分布式智能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

GLOBECOM 2022 - 2022 IEEE Global Communications Conference

自引率

0.00%

发文量