Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang
{"title":"数字孪生网络的双时标同步和迁移:多代理深度强化学习方法","authors":"Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang","doi":"arxiv-2409.01092","DOIUrl":null,"url":null,"abstract":"Digital twins (DTs) have emerged as a promising enabler for representing the\nreal-time states of physical worlds and realizing self-sustaining systems. In\npractice, DTs of physical devices, such as mobile users (MUs), are commonly\ndeployed in multi-access edge computing (MEC) networks for the sake of reducing\nlatency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to\nregularly synchronize their status with their DTs. However, MU mobility\nintroduces significant challenges to DT synchronization. Firstly, MU mobility\ntriggers DT migration which could cause synchronization failures. Secondly, MUs\nrequire frequent synchronization with their DTs to ensure DT fidelity.\nNonetheless, DT migration among MEC servers, caused by MU mobility, may occur\ninfrequently. Accordingly, we propose a two-timescale DT synchronization and\nmigration framework with reliability consideration by establishing a non-convex\nstochastic problem to minimize the long-term average energy consumption of MUs.\nWe use Lyapunov theory to convert the reliability constraints and reformulate\nthe new problem as a partially observable Markov decision-making process\n(POMDP). Furthermore, we develop a heterogeneous agent proximal policy\noptimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical\nresults show that our proposed Beta-HAPPO method achieves significant\nimprovements in energy savings when compared with other benchmarks.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach\",\"authors\":\"Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang\",\"doi\":\"arxiv-2409.01092\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digital twins (DTs) have emerged as a promising enabler for representing the\\nreal-time states of physical worlds and realizing self-sustaining systems. In\\npractice, DTs of physical devices, such as mobile users (MUs), are commonly\\ndeployed in multi-access edge computing (MEC) networks for the sake of reducing\\nlatency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to\\nregularly synchronize their status with their DTs. However, MU mobility\\nintroduces significant challenges to DT synchronization. Firstly, MU mobility\\ntriggers DT migration which could cause synchronization failures. Secondly, MUs\\nrequire frequent synchronization with their DTs to ensure DT fidelity.\\nNonetheless, DT migration among MEC servers, caused by MU mobility, may occur\\ninfrequently. Accordingly, we propose a two-timescale DT synchronization and\\nmigration framework with reliability consideration by establishing a non-convex\\nstochastic problem to minimize the long-term average energy consumption of MUs.\\nWe use Lyapunov theory to convert the reliability constraints and reformulate\\nthe new problem as a partially observable Markov decision-making process\\n(POMDP). Furthermore, we develop a heterogeneous agent proximal policy\\noptimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical\\nresults show that our proposed Beta-HAPPO method achieves significant\\nimprovements in energy savings when compared with other benchmarks.\",\"PeriodicalId\":501280,\"journal\":{\"name\":\"arXiv - CS - Networking and Internet Architecture\",\"volume\":\"70 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Networking and Internet Architecture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.01092\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach
Digital twins (DTs) have emerged as a promising enabler for representing the
real-time states of physical worlds and realizing self-sustaining systems. In
practice, DTs of physical devices, such as mobile users (MUs), are commonly
deployed in multi-access edge computing (MEC) networks for the sake of reducing
latency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to
regularly synchronize their status with their DTs. However, MU mobility
introduces significant challenges to DT synchronization. Firstly, MU mobility
triggers DT migration which could cause synchronization failures. Secondly, MUs
require frequent synchronization with their DTs to ensure DT fidelity.
Nonetheless, DT migration among MEC servers, caused by MU mobility, may occur
infrequently. Accordingly, we propose a two-timescale DT synchronization and
migration framework with reliability consideration by establishing a non-convex
stochastic problem to minimize the long-term average energy consumption of MUs.
We use Lyapunov theory to convert the reliability constraints and reformulate
the new problem as a partially observable Markov decision-making process
(POMDP). Furthermore, we develop a heterogeneous agent proximal policy
optimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical
results show that our proposed Beta-HAPPO method achieves significant
improvements in energy savings when compared with other benchmarks.