{"title":"SLO-Aware Scheduling Deep Learning Inference for Digital Twin-Enabled Serverless Edge","authors":"Mingyuan Ding;Guangping Xu;Kang Liu","doi":"10.1109/JIOT.2025.3565576","DOIUrl":null,"url":null,"abstract":"The digital twin (DT) technique employs virtual models to accurately represent physical entities and their dynamic behaviors, enabling applications in prediction, optimization, and real-time monitoring. When integrated with deep learning (DL) and offered as a service, DTs significantly enhance system intelligence and operational efficiency. However, resource constraints in edge environments pose significant challenges for GPU resource scheduling, particularly under concurrent execution of DLI tasks for DT services. To address these challenges, this article introduces an service level objective (SLO)-aware scheduling framework designed to optimize DL inference for DT-enabled intelligent edge systems, which combines accurate task processing latency prediction with a dynamic synchronization strategy. The proposed strategy optimizes GPU resource allocation to meet task-specific performance objectives, thereby improving system productivity and efficiency. Comprehensive evaluations show that our method significantly reduces SLO violations by 31.2%–90.1% and JCT by 16.2%–80.3% compared to baseline methods, demonstrating its effectiveness in resource-limited edge computing environments under high workload scenarios.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 15","pages":"29252-29264"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10979987/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The digital twin (DT) technique employs virtual models to accurately represent physical entities and their dynamic behaviors, enabling applications in prediction, optimization, and real-time monitoring. When integrated with deep learning (DL) and offered as a service, DTs significantly enhance system intelligence and operational efficiency. However, resource constraints in edge environments pose significant challenges for GPU resource scheduling, particularly under concurrent execution of DLI tasks for DT services. To address these challenges, this article introduces an service level objective (SLO)-aware scheduling framework designed to optimize DL inference for DT-enabled intelligent edge systems, which combines accurate task processing latency prediction with a dynamic synchronization strategy. The proposed strategy optimizes GPU resource allocation to meet task-specific performance objectives, thereby improving system productivity and efficiency. Comprehensive evaluations show that our method significantly reduces SLO violations by 31.2%–90.1% and JCT by 16.2%–80.3% compared to baseline methods, demonstrating its effectiveness in resource-limited edge computing environments under high workload scenarios.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.