Adaptive DNN Partition in Edge Computing Environments

2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS) Pub Date : 2020-12-01 DOI:10.1109/ICPADS51040.2020.00097

Weiwei Miao, Zeng Zeng, Lei Wei, Shihao Li, Chengling Jiang, Zhen Zhang

{"title":"Adaptive DNN Partition in Edge Computing Environments","authors":"Weiwei Miao, Zeng Zeng, Lei Wei, Shihao Li, Chengling Jiang, Zhen Zhang","doi":"10.1109/ICPADS51040.2020.00097","DOIUrl":null,"url":null,"abstract":"Deep Neural Network (DNN) has been applied widely nowadays, making remarkable achievements in a wide variety of research fields. With the improvement of the accuracy requirements for the inference results, the topology of DNN tends to be more and more complex, evolving from chain topology to directed acyclic graph (DAG) topology, which leads to the huge amount of computation. For those end devices which have limited computing resources, the delay of running DNN models independently may be intolerable. As a solution, edge computing can make use of all available devices in the edge computing environments comprehensively to run DNN inference tasks, so as to achieve the purpose of acceleration. In this case, how to split DNN inference task into several small tasks and assign them to different edge devices is the central issue. This paper proposes a load-balancing algorithm to split DNN with DAG topology adaptively according to the environment. Extensive experimental results show the the propose adaptive algorithm can effectively accelerate the inference speed.","PeriodicalId":196548,"journal":{"name":"2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPADS51040.2020.00097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Deep Neural Network (DNN) has been applied widely nowadays, making remarkable achievements in a wide variety of research fields. With the improvement of the accuracy requirements for the inference results, the topology of DNN tends to be more and more complex, evolving from chain topology to directed acyclic graph (DAG) topology, which leads to the huge amount of computation. For those end devices which have limited computing resources, the delay of running DNN models independently may be intolerable. As a solution, edge computing can make use of all available devices in the edge computing environments comprehensively to run DNN inference tasks, so as to achieve the purpose of acceleration. In this case, how to split DNN inference task into several small tasks and assign them to different edge devices is the central issue. This paper proposes a load-balancing algorithm to split DNN with DAG topology adaptively according to the environment. Extensive experimental results show the the propose adaptive algorithm can effectively accelerate the inference speed.

查看原文本刊更多论文

边缘计算环境下的自适应DNN划分

目前，深度神经网络(Deep Neural Network, DNN)得到了广泛的应用，在各个研究领域都取得了令人瞩目的成就。随着对推理结果精度要求的提高，深度神经网络的拓扑结构越来越复杂，从链式拓扑向有向无环图(DAG)拓扑演变，导致计算量巨大。对于那些计算资源有限的终端设备来说，独立运行DNN模型的延迟可能是无法忍受的。作为一种解决方案，边缘计算可以综合利用边缘计算环境中所有可用的设备来运行DNN推理任务，从而达到加速的目的。在这种情况下，如何将DNN推理任务拆分为几个小任务并分配给不同的边缘设备是核心问题。本文提出了一种负载均衡算法，可根据环境自适应地拆分具有DAG拓扑的DNN。大量的实验结果表明，该自适应算法可以有效地提高推理速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)

自引率

0.00%

发文量