{"title":"Joint Request Offloading and Resource Allocation for Long-Term Utility Optimization in Collaborative Edge Inference With Time-Coupled Resources","authors":"Jiale Huang;Jigang Wu;Yalan Wu;Jiaxin Wu","doi":"10.1109/TNSE.2025.3551148","DOIUrl":null,"url":null,"abstract":"Extensive research on edge inference has devoted in optimizing service performance for users. However, recent studies have overlooked the desired utility of application service provider (ASP), which is crucial for achieving long-term service provisioning. Besides, efficient request offloading and resource allocation are essential for optimizing long-term utility of ASP in dynamic networks with time-coupled resources. To address these issues, this paper formulates a long-term utility optimization problem in collaborative edge inference system. The objective is to maximize the long-term average utility of ASP, by jointly optimizing request offloading and resource allocation, under the time-coupled resource constraints. To solve the problem, a Lyapunov based online algorithm is proposed to decompose it into a series of one-slot deterministic problems by decoupling the time-coupled resource constraints. Only the current network states are required for one-slot problem. Then, the one-slot problem is converted into a master request offloading problem with an inner resource allocation problem. A distributed algorithm is proposed to derive the optimal decision to inner problem, while a coalition based algorithm is proposed to seek the stable solution to master problem. Experimental results show that, the proposed algorithm outperforms baseline algorithms for most cases, in terms of long-term average utility of ASP.","PeriodicalId":54229,"journal":{"name":"IEEE Transactions on Network Science and Engineering","volume":"12 4","pages":"2622-2639"},"PeriodicalIF":7.9000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10924670/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Extensive research on edge inference has devoted in optimizing service performance for users. However, recent studies have overlooked the desired utility of application service provider (ASP), which is crucial for achieving long-term service provisioning. Besides, efficient request offloading and resource allocation are essential for optimizing long-term utility of ASP in dynamic networks with time-coupled resources. To address these issues, this paper formulates a long-term utility optimization problem in collaborative edge inference system. The objective is to maximize the long-term average utility of ASP, by jointly optimizing request offloading and resource allocation, under the time-coupled resource constraints. To solve the problem, a Lyapunov based online algorithm is proposed to decompose it into a series of one-slot deterministic problems by decoupling the time-coupled resource constraints. Only the current network states are required for one-slot problem. Then, the one-slot problem is converted into a master request offloading problem with an inner resource allocation problem. A distributed algorithm is proposed to derive the optimal decision to inner problem, while a coalition based algorithm is proposed to seek the stable solution to master problem. Experimental results show that, the proposed algorithm outperforms baseline algorithms for most cases, in terms of long-term average utility of ASP.
期刊介绍:
The proposed journal, called the IEEE Transactions on Network Science and Engineering (TNSE), is committed to timely publishing of peer-reviewed technical articles that deal with the theory and applications of network science and the interconnections among the elements in a system that form a network. In particular, the IEEE Transactions on Network Science and Engineering publishes articles on understanding, prediction, and control of structures and behaviors of networks at the fundamental level. The types of networks covered include physical or engineered networks, information networks, biological networks, semantic networks, economic networks, social networks, and ecological networks. Aimed at discovering common principles that govern network structures, network functionalities and behaviors of networks, the journal seeks articles on understanding, prediction, and control of structures and behaviors of networks. Another trans-disciplinary focus of the IEEE Transactions on Network Science and Engineering is the interactions between and co-evolution of different genres of networks.