{"title":"Characterizing Network Requirements for GPU API Remoting in AI Applications","authors":"Tianxia Wang, Zhuofu Chen, Xingda Wei, Jinyu Gu, Rong Chen, Haibo Chen","doi":"arxiv-2401.13354","DOIUrl":null,"url":null,"abstract":"GPU remoting is a promising technique for supporting AI applications.\nNetworking plays a key role in enabling remoting. However, for efficient\nremoting, the network requirements in terms of latency and bandwidth are\nunknown. In this paper, we take a GPU-centric approach to derive the minimum\nlatency and bandwidth requirements for GPU remoting, while ensuring no (or\nlittle) performance degradation for AI applications. Our study including\ntheoretical model demonstrates that, with careful remoting design, unmodified\nAI applications can run on the remoting setup using commodity networking\nhardware without any overhead or even with better performance, with low network\ndemands.","PeriodicalId":501333,"journal":{"name":"arXiv - CS - Operating Systems","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Operating Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2401.13354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
GPU remoting is a promising technique for supporting AI applications.
Networking plays a key role in enabling remoting. However, for efficient
remoting, the network requirements in terms of latency and bandwidth are
unknown. In this paper, we take a GPU-centric approach to derive the minimum
latency and bandwidth requirements for GPU remoting, while ensuring no (or
little) performance degradation for AI applications. Our study including
theoretical model demonstrates that, with careful remoting design, unmodified
AI applications can run on the remoting setup using commodity networking
hardware without any overhead or even with better performance, with low network
demands.