Lei Jin, Junyan Chen, Rui Yao, Jiahao Chen, Xinmei Li
{"title":"SEROS: Shared exploration and reward optimization task scheduling strategy for multi-agent collaboration in edge computing networks","authors":"Lei Jin, Junyan Chen, Rui Yao, Jiahao Chen, Xinmei Li","doi":"10.1016/j.adhoc.2025.103992","DOIUrl":null,"url":null,"abstract":"<div><div>The integration of mobile edge computing (MEC) into dynamic wireless ad hoc networks has intensified the challenges of computational resource scheduling, particularly under queue load constraints. Current research primarily focuses on global average metrics while neglecting fairness in resource allocation among devices across cross-time slot task scenarios. This oversight leads to significant disparities in the resources allocated to different devices, with some devices consistently lacking computational resources due to imbalanced scheduling. To address these limitations, we propose SEROS (Shared Exploration and Reward Optimization Strategy), a multi-agent reinforcement learning framework designed for cross-timeslot task scheduling in MEC environments. The method dynamically balances local optimization objectives with global collaboration through a weighted shared reward mechanism while enhancing training efficiency via hybrid sample trajectory utilization, enabling adaptive task offloading decisions. First, we construct a mobile edge computing model incorporating queue load constraints to address cross-timeslot task scheduling challenges, improving resource utilization for time-sensitive workloads through delayed optimization objectives. Second, we design a collaborative incentive mechanism based on global–local reward balancing and develop a sample trajectory-sharing scheme to accelerate policy convergence while preserving agent specialization. Simulation experiments validate the effectiveness of SEROS, demonstrating that compared with baseline methods, the proposed approach exhibits superior comprehensive performance in task completion rate improved by 7% and inter-device completion rate concentration enhanced by 40%, along with stability and task completion time.</div></div>","PeriodicalId":55555,"journal":{"name":"Ad Hoc Networks","volume":"179 ","pages":"Article 103992"},"PeriodicalIF":4.8000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ad Hoc Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1570870525002409","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The integration of mobile edge computing (MEC) into dynamic wireless ad hoc networks has intensified the challenges of computational resource scheduling, particularly under queue load constraints. Current research primarily focuses on global average metrics while neglecting fairness in resource allocation among devices across cross-time slot task scenarios. This oversight leads to significant disparities in the resources allocated to different devices, with some devices consistently lacking computational resources due to imbalanced scheduling. To address these limitations, we propose SEROS (Shared Exploration and Reward Optimization Strategy), a multi-agent reinforcement learning framework designed for cross-timeslot task scheduling in MEC environments. The method dynamically balances local optimization objectives with global collaboration through a weighted shared reward mechanism while enhancing training efficiency via hybrid sample trajectory utilization, enabling adaptive task offloading decisions. First, we construct a mobile edge computing model incorporating queue load constraints to address cross-timeslot task scheduling challenges, improving resource utilization for time-sensitive workloads through delayed optimization objectives. Second, we design a collaborative incentive mechanism based on global–local reward balancing and develop a sample trajectory-sharing scheme to accelerate policy convergence while preserving agent specialization. Simulation experiments validate the effectiveness of SEROS, demonstrating that compared with baseline methods, the proposed approach exhibits superior comprehensive performance in task completion rate improved by 7% and inter-device completion rate concentration enhanced by 40%, along with stability and task completion time.
期刊介绍:
The Ad Hoc Networks is an international and archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in ad hoc and sensor networking areas. The Ad Hoc Networks considers original, high quality and unpublished contributions addressing all aspects of ad hoc and sensor networks. Specific areas of interest include, but are not limited to:
Mobile and Wireless Ad Hoc Networks
Sensor Networks
Wireless Local and Personal Area Networks
Home Networks
Ad Hoc Networks of Autonomous Intelligent Systems
Novel Architectures for Ad Hoc and Sensor Networks
Self-organizing Network Architectures and Protocols
Transport Layer Protocols
Routing protocols (unicast, multicast, geocast, etc.)
Media Access Control Techniques
Error Control Schemes
Power-Aware, Low-Power and Energy-Efficient Designs
Synchronization and Scheduling Issues
Mobility Management
Mobility-Tolerant Communication Protocols
Location Tracking and Location-based Services
Resource and Information Management
Security and Fault-Tolerance Issues
Hardware and Software Platforms, Systems, and Testbeds
Experimental and Prototype Results
Quality-of-Service Issues
Cross-Layer Interactions
Scalability Issues
Performance Analysis and Simulation of Protocols.