{"title":"UCL: Unit Competition of Layers for Streaming Tasks in Heterogeneous Networks","authors":"Jing Yu, Liantao Wu, Guoliang Gao, Chenyu Gong","doi":"10.1109/GLOBECOM48099.2022.10000741","DOIUrl":null,"url":null,"abstract":"Partitioning and offloading the deep neural network (DNN) model over multi-tier computing units have been recently proposed to shorten the inference time. However, the state-of-the-art cannot adapt to large-scale offloading problems for streaming tasks because of its exponential complexity. Besides, as an essential kind of DNNs, the offloading of grouped con-volutional neural networks (GCNNs) has not been explored yet. Motivated by the above facts, in this paper, we concentrate on the offloading of chained DNNs (CDNNs) and GCNNs for streaming tasks. Consider a typical heterogeneous network consisting of various computing units, the user equipment (UE) publishes computation-intensive and delay-sensitive streaming DNN tasks while computing units accomplish them collaboratively. To mini-mize the delay of processing the task stream, DNN layers should be offloaded to appropriate units, which is the streaming-task multi-unit (STMU) problem. To tackle this problem, we formulate a non-cooperative potential game called unit competition of layers (UCL). The theoretical analysis proves the existence of the Nash equilibrium (NE), and the corresponding algorithm with linear complexity is developed to achieve the NE. Finally, extensive experiments demonstrate that UCL outperforms the state-of-the-art significantly in large-scale scenarios while maintaining similar performance on small-scale tasks.","PeriodicalId":313199,"journal":{"name":"GLOBECOM 2022 - 2022 IEEE Global Communications Conference","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GLOBECOM 2022 - 2022 IEEE Global Communications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM48099.2022.10000741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Partitioning and offloading the deep neural network (DNN) model over multi-tier computing units have been recently proposed to shorten the inference time. However, the state-of-the-art cannot adapt to large-scale offloading problems for streaming tasks because of its exponential complexity. Besides, as an essential kind of DNNs, the offloading of grouped con-volutional neural networks (GCNNs) has not been explored yet. Motivated by the above facts, in this paper, we concentrate on the offloading of chained DNNs (CDNNs) and GCNNs for streaming tasks. Consider a typical heterogeneous network consisting of various computing units, the user equipment (UE) publishes computation-intensive and delay-sensitive streaming DNN tasks while computing units accomplish them collaboratively. To mini-mize the delay of processing the task stream, DNN layers should be offloaded to appropriate units, which is the streaming-task multi-unit (STMU) problem. To tackle this problem, we formulate a non-cooperative potential game called unit competition of layers (UCL). The theoretical analysis proves the existence of the Nash equilibrium (NE), and the corresponding algorithm with linear complexity is developed to achieve the NE. Finally, extensive experiments demonstrate that UCL outperforms the state-of-the-art significantly in large-scale scenarios while maintaining similar performance on small-scale tasks.