{"title":"Multigranularity Interleaved Reconfigurable Edge Data Center Network Architecture for Accelerated GAI Jobs","authors":"Yun Teng;Hui Yang;Qiuyan Yao;Wenlong Cheng;Miao Hao;Jie Zhang","doi":"10.1109/JIOT.2025.3531385","DOIUrl":null,"url":null,"abstract":"The network has become a bottleneck for generative artificial intelligence (GAI) jobs. Accelerating GAI jobs in edge data centers using hybrid electrical/optical switch is considered a promising solution. This architecture optimizes bandwidth utilization by enabling demand-aware topology reconfiguration through flexible configuration of optical circuit switche optical circuit switches (OCS). However, frequent topology reconfiguration may increase latency. Therefore, there is a balanced relationship between latency and bandwidth utilization. In this article, we propose a multigranularity adaptive interleaved algorithm for service scheduling in edge data centers. First, different degrees of time slot shifts are introduced based on the latency sensitivity of jobs, where large bandwidth GAI jobs are transmitted in a single hop by configuring a demand-aware topology. Additionally, when the reconfiguration threshold is met, low-priority ports are prioritized for reconfiguration to ensure latency requirements are met. This approach effectively resolves the tradeoff between bandwidth utilization and latency by decoupling them from each other. Simulation results show that this approach can effectively reduce the latency and improve the network throughput.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 10","pages":"13222-13232"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10879283/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The network has become a bottleneck for generative artificial intelligence (GAI) jobs. Accelerating GAI jobs in edge data centers using hybrid electrical/optical switch is considered a promising solution. This architecture optimizes bandwidth utilization by enabling demand-aware topology reconfiguration through flexible configuration of optical circuit switche optical circuit switches (OCS). However, frequent topology reconfiguration may increase latency. Therefore, there is a balanced relationship between latency and bandwidth utilization. In this article, we propose a multigranularity adaptive interleaved algorithm for service scheduling in edge data centers. First, different degrees of time slot shifts are introduced based on the latency sensitivity of jobs, where large bandwidth GAI jobs are transmitted in a single hop by configuring a demand-aware topology. Additionally, when the reconfiguration threshold is met, low-priority ports are prioritized for reconfiguration to ensure latency requirements are met. This approach effectively resolves the tradeoff between bandwidth utilization and latency by decoupling them from each other. Simulation results show that this approach can effectively reduce the latency and improve the network throughput.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.