数字孪生网络的双时标同步和迁移:多代理深度强化学习方法

Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang
{"title":"数字孪生网络的双时标同步和迁移:多代理深度强化学习方法","authors":"Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang","doi":"arxiv-2409.01092","DOIUrl":null,"url":null,"abstract":"Digital twins (DTs) have emerged as a promising enabler for representing the\nreal-time states of physical worlds and realizing self-sustaining systems. In\npractice, DTs of physical devices, such as mobile users (MUs), are commonly\ndeployed in multi-access edge computing (MEC) networks for the sake of reducing\nlatency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to\nregularly synchronize their status with their DTs. However, MU mobility\nintroduces significant challenges to DT synchronization. Firstly, MU mobility\ntriggers DT migration which could cause synchronization failures. Secondly, MUs\nrequire frequent synchronization with their DTs to ensure DT fidelity.\nNonetheless, DT migration among MEC servers, caused by MU mobility, may occur\ninfrequently. Accordingly, we propose a two-timescale DT synchronization and\nmigration framework with reliability consideration by establishing a non-convex\nstochastic problem to minimize the long-term average energy consumption of MUs.\nWe use Lyapunov theory to convert the reliability constraints and reformulate\nthe new problem as a partially observable Markov decision-making process\n(POMDP). Furthermore, we develop a heterogeneous agent proximal policy\noptimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical\nresults show that our proposed Beta-HAPPO method achieves significant\nimprovements in energy savings when compared with other benchmarks.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach\",\"authors\":\"Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang\",\"doi\":\"arxiv-2409.01092\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digital twins (DTs) have emerged as a promising enabler for representing the\\nreal-time states of physical worlds and realizing self-sustaining systems. In\\npractice, DTs of physical devices, such as mobile users (MUs), are commonly\\ndeployed in multi-access edge computing (MEC) networks for the sake of reducing\\nlatency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to\\nregularly synchronize their status with their DTs. However, MU mobility\\nintroduces significant challenges to DT synchronization. Firstly, MU mobility\\ntriggers DT migration which could cause synchronization failures. Secondly, MUs\\nrequire frequent synchronization with their DTs to ensure DT fidelity.\\nNonetheless, DT migration among MEC servers, caused by MU mobility, may occur\\ninfrequently. Accordingly, we propose a two-timescale DT synchronization and\\nmigration framework with reliability consideration by establishing a non-convex\\nstochastic problem to minimize the long-term average energy consumption of MUs.\\nWe use Lyapunov theory to convert the reliability constraints and reformulate\\nthe new problem as a partially observable Markov decision-making process\\n(POMDP). Furthermore, we develop a heterogeneous agent proximal policy\\noptimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical\\nresults show that our proposed Beta-HAPPO method achieves significant\\nimprovements in energy savings when compared with other benchmarks.\",\"PeriodicalId\":501280,\"journal\":{\"name\":\"arXiv - CS - Networking and Internet Architecture\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Networking and Internet Architecture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.01092\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

数字孪生(DTs)已成为表示物理世界实时状态和实现自持系统的一种前景广阔的工具。在实践中,移动用户(MU)等物理设备的数字孪生通常部署在多接入边缘计算(MEC)网络中,以降低延迟。为了确保 DT 的准确性和保真度,MU 必须定期将其状态与 DT 同步。然而,MU 的移动性给 DT 同步带来了巨大挑战。首先,MU 移动会引发 DT 迁移,从而导致同步失败。其次,MU 需要与其 DT 频繁同步,以确保 DT 的保真度。然而,由 MU 移动性引起的 MEC 服务器之间的 DT 迁移可能会频繁发生。因此,我们通过建立一个非凸随机问题来最小化 MU 的长期平均能耗,从而提出了一个考虑可靠性的双时标 DT 同步和迁移框架。我们使用 Lyapunov 理论来转换可靠性约束,并将新问题重新表述为部分可观测马尔可夫决策过程(POMDP)。此外,我们还开发了一种采用 Beta 分布的异构代理近端策略优化(Beta-HAPPO)方法来解决该问题。数值结果表明,与其他基准相比,我们提出的 Beta-HAPPO 方法显著提高了节能效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach
Digital twins (DTs) have emerged as a promising enabler for representing the real-time states of physical worlds and realizing self-sustaining systems. In practice, DTs of physical devices, such as mobile users (MUs), are commonly deployed in multi-access edge computing (MEC) networks for the sake of reducing latency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to regularly synchronize their status with their DTs. However, MU mobility introduces significant challenges to DT synchronization. Firstly, MU mobility triggers DT migration which could cause synchronization failures. Secondly, MUs require frequent synchronization with their DTs to ensure DT fidelity. Nonetheless, DT migration among MEC servers, caused by MU mobility, may occur infrequently. Accordingly, we propose a two-timescale DT synchronization and migration framework with reliability consideration by establishing a non-convex stochastic problem to minimize the long-term average energy consumption of MUs. We use Lyapunov theory to convert the reliability constraints and reformulate the new problem as a partially observable Markov decision-making process (POMDP). Furthermore, we develop a heterogeneous agent proximal policy optimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical results show that our proposed Beta-HAPPO method achieves significant improvements in energy savings when compared with other benchmarks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信