Dynamic DNN model selection and inference off loading for video analytics with edge-cloud collaboration

Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video Pub Date : 2022-06-17 DOI:10.1145/3534088.3534352

Xuezhi Wang, Guanyu Gao, Xiaohu Wu, Yan Lyu, Weiwei Wu

{"title":"Dynamic DNN model selection and inference off loading for video analytics with edge-cloud collaboration","authors":"Xuezhi Wang, Guanyu Gao, Xiaohu Wu, Yan Lyu, Weiwei Wu","doi":"10.1145/3534088.3534352","DOIUrl":null,"url":null,"abstract":"The edge-cloud collaboration architecture can support Deep Neural Network-based (DNN) video analytics with low inference delays and high accuracy. However, the video analytics pipelines with edge-cloud collaboration are complex, involving the decision-making for many coupled control knobs. We propose a deep reinforcement learning-based approach, named ModelIO, for dynamic DNN Model selection and Inference Offloading for video analytics with edge-cloud collaboration. We jointly consider the decision-making for video pre-processing, DNN model selection, local inference, and offloading in a video analytics system to maximize performances. Our method can learn the optimal control policy for video analytics with the edge-cloud collaboration without complex system modeling. We implement a real-world testbed to conduct the experiments to evaluate the performances of our method. The results show that our method can significantly improve the system processing capacity, reduce average inference delays, and maximize overall rewards.","PeriodicalId":150454,"journal":{"name":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","volume":"271 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3534088.3534352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The edge-cloud collaboration architecture can support Deep Neural Network-based (DNN) video analytics with low inference delays and high accuracy. However, the video analytics pipelines with edge-cloud collaboration are complex, involving the decision-making for many coupled control knobs. We propose a deep reinforcement learning-based approach, named ModelIO, for dynamic DNN Model selection and Inference Offloading for video analytics with edge-cloud collaboration. We jointly consider the decision-making for video pre-processing, DNN model selection, local inference, and offloading in a video analytics system to maximize performances. Our method can learn the optimal control policy for video analytics with the edge-cloud collaboration without complex system modeling. We implement a real-world testbed to conduct the experiments to evaluate the performances of our method. The results show that our method can significantly improve the system processing capacity, reduce average inference delays, and maximize overall rewards.

查看原文本刊更多论文

边缘云协作视频分析的动态DNN模型选择和推理卸载

边缘云协作架构可以支持基于深度神经网络(DNN)的视频分析，具有低推理延迟和高精度。然而，与边缘云协作的视频分析管道是复杂的，涉及到许多耦合控制旋钮的决策。我们提出了一种基于深度强化学习的方法，名为ModelIO，用于动态DNN模型选择和推断卸载，用于边缘云协作的视频分析。我们共同考虑了视频分析系统中视频预处理、深度神经网络模型选择、局部推理和卸载的决策，以实现性能最大化。该方法可以在不需要复杂系统建模的情况下，通过边缘云协作学习视频分析的最优控制策略。我们实现了一个真实的测试平台来进行实验，以评估我们的方法的性能。结果表明，该方法可以显著提高系统的处理能力，降低平均推理延迟，并使整体奖励最大化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video

自引率

0.00%

发文量