Jan De Neve;Ziyue Zhang;Wouter Tavernier;Didier Colle;Mario Pickavet
{"title":"Edge coloring bipartite multigraphs for dynamically configuring optical switches","authors":"Jan De Neve;Ziyue Zhang;Wouter Tavernier;Didier Colle;Mario Pickavet","doi":"10.1364/JOCN.559454","DOIUrl":null,"url":null,"abstract":"Multi-chip graphics processing units (GPUs) interconnected by a photonic network-on-wafer are a promising technology to further increase the performance of GPUs. The network control algorithm managing dynamic bandwidth allocation (DBA) in this network needs to execute very frequently so that resources can be optimally used. This algorithm relies on edge coloring bipartite multigraphs to translate inter-chip bandwidth demands into updated routing tables for the GPU chips and optical switches in the network. In this work, we design fast edge coloring algorithms, both approximate and exact, for bipartite multigraphs. These algorithms are tailored to the high edge multiplicities of the multigraphs in this research. The runtimes are optimized by using efficient data structures and introducing pre- and post-processing. These new algorithms are up to <tex>${20} \\times$</tex> faster than the state-of-the-art baseline algorithm. New simulations show that, with such low reconfiguration periods, DBA has the potential to double the performance of a high-traffic GPU workload compared to a static network with the same bandwidth.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 8","pages":"720-731"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11095305/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-chip graphics processing units (GPUs) interconnected by a photonic network-on-wafer are a promising technology to further increase the performance of GPUs. The network control algorithm managing dynamic bandwidth allocation (DBA) in this network needs to execute very frequently so that resources can be optimally used. This algorithm relies on edge coloring bipartite multigraphs to translate inter-chip bandwidth demands into updated routing tables for the GPU chips and optical switches in the network. In this work, we design fast edge coloring algorithms, both approximate and exact, for bipartite multigraphs. These algorithms are tailored to the high edge multiplicities of the multigraphs in this research. The runtimes are optimized by using efficient data structures and introducing pre- and post-processing. These new algorithms are up to ${20} \times$ faster than the state-of-the-art baseline algorithm. New simulations show that, with such low reconfiguration periods, DBA has the potential to double the performance of a high-traffic GPU workload compared to a static network with the same bandwidth.
期刊介绍:
The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.