{"title":"DIDA: Distributed In-Network Intelligent Data Plane for Machine Learning Applications","authors":"Giulio Sidoretti;Lorenzo Bracciale;Stefano Salsano;Hesham ElBakoury;Pierpaolo Loreti","doi":"10.1109/TNSM.2025.3548477","DOIUrl":null,"url":null,"abstract":"Recent advances in network switch designs have enabled machine learning inference directly within the switch at line speed. However, hardware constraints limit switches capabilities of tracking stateful features essential for accurate inference, as the demand for these features grows rapidly with line rates. To address this, we propose DIDA, a distributed in-network machine learning approach. In DIDA, feature extraction occurs at the host, features are transmitted via in-band telemetry, and inference is performed on the switches. In this paper, we evaluate the effectiveness and efficiency of this architecture. We examine its impact on network bandwidth, CPU and memory usage at the host, and its robustness across different feature sets and deep neural network classifications.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"22 3","pages":"2564-2579"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network and Service Management","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10936993/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advances in network switch designs have enabled machine learning inference directly within the switch at line speed. However, hardware constraints limit switches capabilities of tracking stateful features essential for accurate inference, as the demand for these features grows rapidly with line rates. To address this, we propose DIDA, a distributed in-network machine learning approach. In DIDA, feature extraction occurs at the host, features are transmitted via in-band telemetry, and inference is performed on the switches. In this paper, we evaluate the effectiveness and efficiency of this architecture. We examine its impact on network bandwidth, CPU and memory usage at the host, and its robustness across different feature sets and deep neural network classifications.
期刊介绍:
IEEE Transactions on Network and Service Management will publish (online only) peerreviewed archival quality papers that advance the state-of-the-art and practical applications of network and service management. Theoretical research contributions (presenting new concepts and techniques) and applied contributions (reporting on experiences and experiments with actual systems) will be encouraged. These transactions will focus on the key technical issues related to: Management Models, Architectures and Frameworks; Service Provisioning, Reliability and Quality Assurance; Management Functions; Enabling Technologies; Information and Communication Models; Policies; Applications and Case Studies; Emerging Technologies and Standards.