{"title":"Deep Reinforcement Learning-Based Contract Incentive and Computation Offloading for Mobile Edge Computing-Enabled Blockchain","authors":"Wenjie Zhang;Yi Liu;Hong Zhao;Chai Kiat Yeo","doi":"10.1109/TNSM.2025.3592979","DOIUrl":null,"url":null,"abstract":"The resolution of proof-of-work problem in blockchain requires significant amount of resources, while the lack of computing power on mobile devices limits the development of blockchain in mobile applications. To mitigate this issue, the combination of blockchain and mobile edge computing (MEC) has attracted much attention. In this paper, we consider an edge-enabled blockchain system that includes one single edge service provider (ESP), multiple types of miners and edge nodes. Each miner submits offloading request to ESP. In response, ESP designs contract to incentivize various types of edge nodes to contribute resources and offer computational services to the miners. This problem is a joint optimization problem of offloading decisions and contract design. Due to the time-variability of network environment, the randomness of miners’ task demands, and the asymmetric information between the ESP and edge nodes, solving this problem is challenging. We propose a deep reinforcement learning contract mechanism (DRLCM) for incentive-based computation offloading strategies, which divides the original problem into two sub-problems: computation offloading and contract design. Initially, the deep Q-network (DQN) algorithm is used to update the offloading decisions based on the evolving task demands and network conditions. Secondly, contract is designed to motivate edge nodes to participate in resource sharing. The problem is simplified by analyzing the necessary and sufficient conditions of feasible contract, and the Lagrange multiplier method is used to approximate the optimal contract. Simulation experiments demonstrate the effectiveness of the DRLCM algorithm, which shows better convergence and performance compared to traditional DQN, Double-DQN algorithms and Dueling-DQN.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"22 5","pages":"5137-5151"},"PeriodicalIF":5.4000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network and Service Management","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11098502/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The resolution of proof-of-work problem in blockchain requires significant amount of resources, while the lack of computing power on mobile devices limits the development of blockchain in mobile applications. To mitigate this issue, the combination of blockchain and mobile edge computing (MEC) has attracted much attention. In this paper, we consider an edge-enabled blockchain system that includes one single edge service provider (ESP), multiple types of miners and edge nodes. Each miner submits offloading request to ESP. In response, ESP designs contract to incentivize various types of edge nodes to contribute resources and offer computational services to the miners. This problem is a joint optimization problem of offloading decisions and contract design. Due to the time-variability of network environment, the randomness of miners’ task demands, and the asymmetric information between the ESP and edge nodes, solving this problem is challenging. We propose a deep reinforcement learning contract mechanism (DRLCM) for incentive-based computation offloading strategies, which divides the original problem into two sub-problems: computation offloading and contract design. Initially, the deep Q-network (DQN) algorithm is used to update the offloading decisions based on the evolving task demands and network conditions. Secondly, contract is designed to motivate edge nodes to participate in resource sharing. The problem is simplified by analyzing the necessary and sufficient conditions of feasible contract, and the Lagrange multiplier method is used to approximate the optimal contract. Simulation experiments demonstrate the effectiveness of the DRLCM algorithm, which shows better convergence and performance compared to traditional DQN, Double-DQN algorithms and Dueling-DQN.
期刊介绍:
IEEE Transactions on Network and Service Management will publish (online only) peerreviewed archival quality papers that advance the state-of-the-art and practical applications of network and service management. Theoretical research contributions (presenting new concepts and techniques) and applied contributions (reporting on experiences and experiments with actual systems) will be encouraged. These transactions will focus on the key technical issues related to: Management Models, Architectures and Frameworks; Service Provisioning, Reliability and Quality Assurance; Management Functions; Enabling Technologies; Information and Communication Models; Policies; Applications and Case Studies; Emerging Technologies and Standards.