Hybrid deep reinforcement learning-based workload migrating and resource allocation policies for weighted cost minimization in edge collaboration networks

IF 6.2 2区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS
Hongchang Ke , Jia Zhao , Yan Ding , Lin Pan
{"title":"Hybrid deep reinforcement learning-based workload migrating and resource allocation policies for weighted cost minimization in edge collaboration networks","authors":"Hongchang Ke ,&nbsp;Jia Zhao ,&nbsp;Yan Ding ,&nbsp;Lin Pan","doi":"10.1016/j.future.2025.108002","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of efficient collaboration on 5G heterogeneous networks, computation and communication, mobile edge computing (MEC) and cloud–edge collaboration encounter several issues. These include multi-agent cooperation in deep reinforcement learning (DRL) for multi-task processing, the efficiency of action decisions by mobile edge computing server (MECS), and the ineffectiveness of traditional DRL algorithms in resource allocation. To address these challenges, a framework consisting of multiple wireless mobile terminals (WMTs) enabled by unmanned aerial vehicle (UAV) and multiple MEC servers is constructed, considering diversity and priorities of the workload generated by WMTs. Furthermore, to optimize edge collaborative workload offloading, migrating, and resource allocation decisions for minimizing the weighted cost of workload processing, we propose a hybrid DRL approach combining the dueling double deep Q-network (D3QN) and deep deterministic policy gradient (DDPG) algorithms, named OMRA-DRL. Regarding the proposed OMRA-DRL, a K-Means-based clustering algorithm groups similar workloads to simplify optimization. Additionally, a mixture-of-expert (MoE) system enables efficient action selection. Along with D3QN for better MECS selection, the maximum advantage selection strategy of the advantage function (MMSS-D3QN) is formulated to migrate workload clusters to multiple MECSs, achieving multi-edge cooperation. Comprehensive simulation experiments prove the convergence of OMRA-DRL under various parameters. Moreover, it outperforms the five benchmark algorithms in terms of average cumulative reward and unfinished ratio of workloads, with an increase of over 15% in average cumulative reward and a decrease of about 5% in unfinished rate of workloads, demonstrating its effectiveness in achieving weighted cost minimization in edge collaborative networks.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"174 ","pages":"Article 108002"},"PeriodicalIF":6.2000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25002973","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

In the context of efficient collaboration on 5G heterogeneous networks, computation and communication, mobile edge computing (MEC) and cloud–edge collaboration encounter several issues. These include multi-agent cooperation in deep reinforcement learning (DRL) for multi-task processing, the efficiency of action decisions by mobile edge computing server (MECS), and the ineffectiveness of traditional DRL algorithms in resource allocation. To address these challenges, a framework consisting of multiple wireless mobile terminals (WMTs) enabled by unmanned aerial vehicle (UAV) and multiple MEC servers is constructed, considering diversity and priorities of the workload generated by WMTs. Furthermore, to optimize edge collaborative workload offloading, migrating, and resource allocation decisions for minimizing the weighted cost of workload processing, we propose a hybrid DRL approach combining the dueling double deep Q-network (D3QN) and deep deterministic policy gradient (DDPG) algorithms, named OMRA-DRL. Regarding the proposed OMRA-DRL, a K-Means-based clustering algorithm groups similar workloads to simplify optimization. Additionally, a mixture-of-expert (MoE) system enables efficient action selection. Along with D3QN for better MECS selection, the maximum advantage selection strategy of the advantage function (MMSS-D3QN) is formulated to migrate workload clusters to multiple MECSs, achieving multi-edge cooperation. Comprehensive simulation experiments prove the convergence of OMRA-DRL under various parameters. Moreover, it outperforms the five benchmark algorithms in terms of average cumulative reward and unfinished ratio of workloads, with an increase of over 15% in average cumulative reward and a decrease of about 5% in unfinished rate of workloads, demonstrating its effectiveness in achieving weighted cost minimization in edge collaborative networks.

Abstract Image

边缘协作网络中基于混合深度强化学习的加权成本最小化工作负载迁移和资源分配策略
在5G异构网络高效协作的背景下,计算与通信、移动边缘计算(MEC)和云边缘协作遇到了几个问题。其中包括用于多任务处理的深度强化学习(DRL)中的多智能体合作,移动边缘计算服务器(MECS)的行动决策效率,以及传统DRL算法在资源分配方面的有效性。为了应对这些挑战,考虑到wmt产生的工作负载的多样性和优先级,构建了一个由无人机(UAV)支持的多个无线移动终端(wmt)和多个MEC服务器组成的框架。此外,为了优化边缘协作工作负载卸载、迁移和资源分配决策,以最小化工作负载处理的加权成本,我们提出了一种混合DRL方法,将双重深度q -网络(D3QN)和深度确定性策略梯度(DDPG)算法相结合,称为OMRA-DRL。对于所提出的OMRA-DRL,基于k - means的聚类算法将相似的工作负载分组以简化优化。此外,混合专家(MoE)系统可实现高效的动作选择。在D3QN优化MECS选择的同时,制定了优势函数的最大优势选择策略(MMSS-D3QN),将工作负载集群迁移到多个MECS,实现多边缘协作。综合仿真实验证明了OMRA-DRL在各种参数下的收敛性。在平均累计奖励和工作负载未完成率方面优于5种基准算法,平均累计奖励提高15%以上,工作负载未完成率降低约5%,证明了其在边缘协作网络中实现加权成本最小化的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
19.90
自引率
2.70%
发文量
376
审稿时长
10.6 months
期刊介绍: Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信