{"title":"Delay-Prioritized and Reliable Task Scheduling With Long-Term Load Balancing in Computing Power Networks","authors":"Renchao Xie;Li Feng;Qinqin Tang;Tao Huang;Zehui Xiong;Tianjiao Chen;Ran Zhang","doi":"10.1109/TSC.2024.3495500","DOIUrl":null,"url":null,"abstract":"In the era driven by big data and algorithms, the efficient collaboration of pervasive computing power is crucial for rapidly meeting computing demands and enhancing resource utilization. However, current mainstream end-edge-cloud collaboration faces challenges of computing isolation, adversely affecting resource efficiency and user experience. The Computing Power Network (CPN) is a novel architecture designed to sense and collaborate ubiquitous computing resources through networks. Nevertheless, the expansion of its scope and the integration of networks complicate task scheduling. To address this, we design a collaborative scheduling system that considers the joint selection of computing nodes and network links, aiming to reduce delay, enhance reliability, and ensure long-term load balance. First, we propose a delay-prioritized reliable scheduling policy based on a dual-priority mechanism for forwarding and computing. Second, we define the scheduling problem as a Constrained Markov Decision Process (CMDP) and introduce Lyapunov optimization to transform constraints into instantaneous optimizations, achieving a long-term balanced load of computing and network resources. Lastly, we employ an enhanced Deep Reinforcement Learning (DRL) approach to solve the problem. Performance evaluation demonstrates that compared to standard DRL, the proposed algorithm effectively reduces delay and improves reliability while maintaining long-term load balance, resulting in an overall performance improvement of 54.7%.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"3359-3372"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10748399/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In the era driven by big data and algorithms, the efficient collaboration of pervasive computing power is crucial for rapidly meeting computing demands and enhancing resource utilization. However, current mainstream end-edge-cloud collaboration faces challenges of computing isolation, adversely affecting resource efficiency and user experience. The Computing Power Network (CPN) is a novel architecture designed to sense and collaborate ubiquitous computing resources through networks. Nevertheless, the expansion of its scope and the integration of networks complicate task scheduling. To address this, we design a collaborative scheduling system that considers the joint selection of computing nodes and network links, aiming to reduce delay, enhance reliability, and ensure long-term load balance. First, we propose a delay-prioritized reliable scheduling policy based on a dual-priority mechanism for forwarding and computing. Second, we define the scheduling problem as a Constrained Markov Decision Process (CMDP) and introduce Lyapunov optimization to transform constraints into instantaneous optimizations, achieving a long-term balanced load of computing and network resources. Lastly, we employ an enhanced Deep Reinforcement Learning (DRL) approach to solve the problem. Performance evaluation demonstrates that compared to standard DRL, the proposed algorithm effectively reduces delay and improves reliability while maintaining long-term load balance, resulting in an overall performance improvement of 54.7%.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.