通过深度强化学习的分布式信息年龄调度与 NOMA

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Mobile Computing Pub Date : 2024-09-12 DOI:10.1109/TMC.2024.3459101

Congwei Zhang;Yifei Zou;Zuyuan Zhang;Dongxiao Yu;Jorge Torres Gómez;Tian Lan;Falko Dressler;Xiuzhen Cheng

{"title":"通过深度强化学习的分布式信息年龄调度与 NOMA","authors":"Congwei Zhang;Yifei Zou;Zuyuan Zhang;Dongxiao Yu;Jorge Torres Gómez;Tian Lan;Falko Dressler;Xiuzhen Cheng","doi":"10.1109/TMC.2024.3459101","DOIUrl":null,"url":null,"abstract":"Many emerging applications in edge computing require processing of huge volumes of data generated by end devices, using the freshest available information. In this paper, we address the distributed optimization of multi-user long-term average Age-of-Information (AoI) objectives in edge networks that use NOMA transmission. This poses a challenge of non-convex online optimization, which in existing work often requires either decision making in a combinatorial space or a global view of entire network states. To overcome this challenge, we propose a reinforcement learning-based framework that adopts a novel hierarchical decomposition of decision making. Specifically, we propose three different types of distributed agents to learn with respect to efficiency of AoI scheduling, fairness of AoI scheduling, as well as a high-level policy balancing these potentially conflicting design objectives. Not only does the proposed decomposition improve learning performance due to disentanglement of different design objectives/rewards, but it also enables the algorithm to learn the best policy while also learning the explanations – as actions can be directly compared in terms of the design objectives. Our evaluations show that the proposed algorithm improves the long-term average AoI by \n<inline-formula><tex-math>$200\\%{-}300\\%$</tex-math></inline-formula>\n and 400% compared to prior works with NOMA and the optimal solution without NOMA, respectively.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 1","pages":"30-44"},"PeriodicalIF":7.7000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distributed Age-of-Information Scheduling With NOMA via Deep Reinforcement Learning\",\"authors\":\"Congwei Zhang;Yifei Zou;Zuyuan Zhang;Dongxiao Yu;Jorge Torres Gómez;Tian Lan;Falko Dressler;Xiuzhen Cheng\",\"doi\":\"10.1109/TMC.2024.3459101\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many emerging applications in edge computing require processing of huge volumes of data generated by end devices, using the freshest available information. In this paper, we address the distributed optimization of multi-user long-term average Age-of-Information (AoI) objectives in edge networks that use NOMA transmission. This poses a challenge of non-convex online optimization, which in existing work often requires either decision making in a combinatorial space or a global view of entire network states. To overcome this challenge, we propose a reinforcement learning-based framework that adopts a novel hierarchical decomposition of decision making. Specifically, we propose three different types of distributed agents to learn with respect to efficiency of AoI scheduling, fairness of AoI scheduling, as well as a high-level policy balancing these potentially conflicting design objectives. Not only does the proposed decomposition improve learning performance due to disentanglement of different design objectives/rewards, but it also enables the algorithm to learn the best policy while also learning the explanations – as actions can be directly compared in terms of the design objectives. Our evaluations show that the proposed algorithm improves the long-term average AoI by \\n<inline-formula><tex-math>$200\\\\%{-}300\\\\%$</tex-math></inline-formula>\\n and 400% compared to prior works with NOMA and the optimal solution without NOMA, respectively.\",\"PeriodicalId\":50389,\"journal\":{\"name\":\"IEEE Transactions on Mobile Computing\",\"volume\":\"24 1\",\"pages\":\"30-44\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Mobile Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10678915/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10678915/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

边缘计算中的许多新兴应用需要处理终端设备生成的大量数据，使用最新的可用信息。在本文中，我们解决了使用NOMA传输的边缘网络中多用户长期平均信息年龄（AoI）目标的分布式优化问题。这对非凸在线优化提出了挑战，在现有的工作中，通常需要在组合空间或整个网络状态的全局视图中进行决策。为了克服这一挑战，我们提出了一种基于强化学习的框架，该框架采用了一种新的决策分层分解方法。具体来说，我们提出了三种不同类型的分布式代理来学习AoI调度的效率，AoI调度的公平性，以及平衡这些潜在冲突的设计目标的高级策略。由于不同设计目标/奖励的分离，所提出的分解不仅提高了学习性能，而且还使算法能够在学习解释的同时学习最佳策略——因为可以根据设计目标直接比较行动。我们的评估表明，与使用NOMA和不使用NOMA的最优解相比，该算法的长期平均AoI分别提高了200\%{-}300\%$和400% $。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Distributed Age-of-Information Scheduling With NOMA via Deep Reinforcement Learning

Many emerging applications in edge computing require processing of huge volumes of data generated by end devices, using the freshest available information. In this paper, we address the distributed optimization of multi-user long-term average Age-of-Information (AoI) objectives in edge networks that use NOMA transmission. This poses a challenge of non-convex online optimization, which in existing work often requires either decision making in a combinatorial space or a global view of entire network states. To overcome this challenge, we propose a reinforcement learning-based framework that adopts a novel hierarchical decomposition of decision making. Specifically, we propose three different types of distributed agents to learn with respect to efficiency of AoI scheduling, fairness of AoI scheduling, as well as a high-level policy balancing these potentially conflicting design objectives. Not only does the proposed decomposition improve learning performance due to disentanglement of different design objectives/rewards, but it also enables the algorithm to learn the best policy while also learning the explanations – as actions can be directly compared in terms of the design objectives. Our evaluations show that the proposed algorithm improves the long-term average AoI by

$200\%{-}300\%$

and 400% compared to prior works with NOMA and the optimal solution without NOMA, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.