Flexible high-resolution object detection on edge devices with tunable latency

Proceedings of the 27th Annual International Conference on Mobile Computing and Networking Pub Date : 2021-10-19 DOI:10.1145/3447993.3483274

Shiqi Jiang, Zhiqi Lin, Yuanchun Li, Yuanchao Shu, Yunxin Liu

{"title":"Flexible high-resolution object detection on edge devices with tunable latency","authors":"Shiqi Jiang, Zhiqi Lin, Yuanchun Li, Yuanchao Shu, Yunxin Liu","doi":"10.1145/3447993.3483274","DOIUrl":null,"url":null,"abstract":"Object detection is a fundamental building block of video analytics applications. While Neural Networks (NNs)-based object detection models have shown excellent accuracy on benchmark datasets, they are not well positioned for high-resolution images inference on resource-constrained edge devices. Common approaches, including down-sampling inputs and scaling up neural networks, fall short of adapting to video content changes and various latency requirements. This paper presents Remix, a flexible framework for high-resolution object detection on edge devices. Remix takes as input a latency budget, and come up with an image partition and model execution plan which runs off-the-shelf neural networks on non-uniformly partitioned image blocks. As a result, it maximizes the overall detection accuracy by allocating various amount of compute power onto different areas of an image. We evaluate Remix on public dataset as well as real-world videos collected by ourselves. Experimental results show that Remix can either improve the detection accuracy by 18%-120% for a given latency budget, or achieve up to 8.1× inference speedup with accuracy on par with the state-of-the-art NNs.","PeriodicalId":177431,"journal":{"name":"Proceedings of the 27th Annual International Conference on Mobile Computing and Networking","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"43","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 27th Annual International Conference on Mobile Computing and Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3447993.3483274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 43

Abstract

Object detection is a fundamental building block of video analytics applications. While Neural Networks (NNs)-based object detection models have shown excellent accuracy on benchmark datasets, they are not well positioned for high-resolution images inference on resource-constrained edge devices. Common approaches, including down-sampling inputs and scaling up neural networks, fall short of adapting to video content changes and various latency requirements. This paper presents Remix, a flexible framework for high-resolution object detection on edge devices. Remix takes as input a latency budget, and come up with an image partition and model execution plan which runs off-the-shelf neural networks on non-uniformly partitioned image blocks. As a result, it maximizes the overall detection accuracy by allocating various amount of compute power onto different areas of an image. We evaluate Remix on public dataset as well as real-world videos collected by ourselves. Experimental results show that Remix can either improve the detection accuracy by 18%-120% for a given latency budget, or achieve up to 8.1× inference speedup with accuracy on par with the state-of-the-art NNs.

查看原文本刊更多论文

具有可调延迟的边缘设备上灵活的高分辨率对象检测

对象检测是视频分析应用程序的基本组成部分。虽然基于神经网络(nn)的目标检测模型在基准数据集上显示出出色的准确性，但它们并不适合在资源受限的边缘设备上进行高分辨率图像推断。常见的方法，包括降低采样输入和放大神经网络，都不能适应视频内容的变化和各种延迟要求。本文提出了一种用于边缘设备高分辨率目标检测的灵活框架Remix。Remix将延迟预算作为输入，并提出了一个图像分区和模型执行计划，该计划在非均匀分区的图像块上运行现成的神经网络。因此，它通过将不同数量的计算能力分配到图像的不同区域来最大化整体检测精度。我们在公共数据集和我们自己收集的真实视频上评估Remix。实验结果表明，在给定的延迟预算下，Remix可以将检测精度提高18%-120%，或者实现高达8.1倍的推理加速，并且精度与最先进的神经网络相当。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 27th Annual International Conference on Mobile Computing and Networking

自引率

0.00%

发文量