Hybrid deep reinforcement learning-based workload migrating and resource allocation policies for weighted cost minimization in edge collaboration networks

IF 6.2 2区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

Future Generation Computer Systems-The International Journal of Escience Pub Date : 2025-07-23 DOI:10.1016/j.future.2025.108002

Hongchang Ke , Jia Zhao , Yan Ding , Lin Pan

{"title":"Hybrid deep reinforcement learning-based workload migrating and resource allocation policies for weighted cost minimization in edge collaboration networks","authors":"Hongchang Ke , Jia Zhao , Yan Ding , Lin Pan","doi":"10.1016/j.future.2025.108002","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of efficient collaboration on 5G heterogeneous networks, computation and communication, mobile edge computing (MEC) and cloud–edge collaboration encounter several issues. These include multi-agent cooperation in deep reinforcement learning (DRL) for multi-task processing, the efficiency of action decisions by mobile edge computing server (MECS), and the ineffectiveness of traditional DRL algorithms in resource allocation. To address these challenges, a framework consisting of multiple wireless mobile terminals (WMTs) enabled by unmanned aerial vehicle (UAV) and multiple MEC servers is constructed, considering diversity and priorities of the workload generated by WMTs. Furthermore, to optimize edge collaborative workload offloading, migrating, and resource allocation decisions for minimizing the weighted cost of workload processing, we propose a hybrid DRL approach combining the dueling double deep Q-network (D3QN) and deep deterministic policy gradient (DDPG) algorithms, named OMRA-DRL. Regarding the proposed OMRA-DRL, a K-Means-based clustering algorithm groups similar workloads to simplify optimization. Additionally, a mixture-of-expert (MoE) system enables efficient action selection. Along with D3QN for better MECS selection, the maximum advantage selection strategy of the advantage function (MMSS-D3QN) is formulated to migrate workload clusters to multiple MECSs, achieving multi-edge cooperation. Comprehensive simulation experiments prove the convergence of OMRA-DRL under various parameters. Moreover, it outperforms the five benchmark algorithms in terms of average cumulative reward and unfinished ratio of workloads, with an increase of over 15% in average cumulative reward and a decrease of about 5% in unfinished rate of workloads, demonstrating its effectiveness in achieving weighted cost minimization in edge collaborative networks.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"174 ","pages":"Article 108002"},"PeriodicalIF":6.2000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25002973","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

In the context of efficient collaboration on 5G heterogeneous networks, computation and communication, mobile edge computing (MEC) and cloud–edge collaboration encounter several issues. These include multi-agent cooperation in deep reinforcement learning (DRL) for multi-task processing, the efficiency of action decisions by mobile edge computing server (MECS), and the ineffectiveness of traditional DRL algorithms in resource allocation. To address these challenges, a framework consisting of multiple wireless mobile terminals (WMTs) enabled by unmanned aerial vehicle (UAV) and multiple MEC servers is constructed, considering diversity and priorities of the workload generated by WMTs. Furthermore, to optimize edge collaborative workload offloading, migrating, and resource allocation decisions for minimizing the weighted cost of workload processing, we propose a hybrid DRL approach combining the dueling double deep Q-network (D3QN) and deep deterministic policy gradient (DDPG) algorithms, named OMRA-DRL. Regarding the proposed OMRA-DRL, a K-Means-based clustering algorithm groups similar workloads to simplify optimization. Additionally, a mixture-of-expert (MoE) system enables efficient action selection. Along with D3QN for better MECS selection, the maximum advantage selection strategy of the advantage function (MMSS-D3QN) is formulated to migrate workload clusters to multiple MECSs, achieving multi-edge cooperation. Comprehensive simulation experiments prove the convergence of OMRA-DRL under various parameters. Moreover, it outperforms the five benchmark algorithms in terms of average cumulative reward and unfinished ratio of workloads, with an increase of over 15% in average cumulative reward and a decrease of about 5% in unfinished rate of workloads, demonstrating its effectiveness in achieving weighted cost minimization in edge collaborative networks.

Abstract Image

查看原文本刊更多论文

边缘协作网络中基于混合深度强化学习的加权成本最小化工作负载迁移和资源分配策略

在5G异构网络高效协作的背景下，计算与通信、移动边缘计算（MEC）和云边缘协作遇到了几个问题。其中包括用于多任务处理的深度强化学习（DRL）中的多智能体合作，移动边缘计算服务器（MECS）的行动决策效率，以及传统DRL算法在资源分配方面的有效性。为了应对这些挑战，考虑到wmt产生的工作负载的多样性和优先级，构建了一个由无人机（UAV）支持的多个无线移动终端（wmt）和多个MEC服务器组成的框架。此外，为了优化边缘协作工作负载卸载、迁移和资源分配决策，以最小化工作负载处理的加权成本，我们提出了一种混合DRL方法，将双重深度q -网络（D3QN）和深度确定性策略梯度（DDPG）算法相结合，称为OMRA-DRL。对于所提出的OMRA-DRL，基于k - means的聚类算法将相似的工作负载分组以简化优化。此外，混合专家（MoE）系统可实现高效的动作选择。在D3QN优化MECS选择的同时，制定了优势函数的最大优势选择策略（MMSS-D3QN），将工作负载集群迁移到多个MECS，实现多边缘协作。综合仿真实验证明了OMRA-DRL在各种参数下的收敛性。在平均累计奖励和工作负载未完成率方面优于5种基准算法，平均累计奖励提高15%以上，工作负载未完成率降低约5%，证明了其在边缘协作网络中实现加权成本最小化的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Future Generation Computer Systems-The International Journal of Escience 工程技术-计算机：理论方法

CiteScore

19.90

自引率

2.70%

发文量

376

审稿时长

10.6 months

期刊介绍： Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.