Rim: Offloading Inference to the Edge

Proceedings of the International Conference on Internet-of-Things Design and Implementation Pub Date : 2021-05-18 DOI:10.1145/3450268.3453521

Yitao Hu, Weiwu Pang, Xiaochen Liu, Rajrup Ghosh, Bongjun Ko, Wei-Han Lee, R. Govindan

引用次数: 11

Abstract

Video cameras are among the most ubiquitous sensors in the Internet-of-Things. Video and audio applications, such as cross-camera activity detection, avatar extraction or language translation will, in the future, offload processing to an edge cluster of GPUs. Rim is a management system for such clusters that satisfies throughput and latency requirements of these applications, while enabling high cluster utilization. It uses coarse-grained knowledge of application structure to profile throughput of applications on resources, then uses these profiles to place applications on cluster nodes to achieve these goals. It dynamically adapts placement to load and failures. Experiments show that on maximal workloads on a testbed, Rim can satisfy requirements of all applications, but competing approaches designed for low-latency GPU execution cannot.

查看原文本刊更多论文

边缘:卸载边缘推理

摄像机是物联网中最普遍的传感器之一。视频和音频应用程序，如跨摄像头活动检测、角色提取或语言翻译，未来将把处理工作转移到gpu的边缘集群上。Rim是一个用于此类集群的管理系统，它可以满足这些应用程序的吞吐量和延迟需求，同时实现高集群利用率。它使用应用程序结构的粗粒度知识来分析应用程序在资源上的吞吐量，然后使用这些配置文件将应用程序放置在集群节点上以实现这些目标。它根据负载和故障动态地调整位置。实验表明，在测试平台上的最大工作负载下，Rim可以满足所有应用程序的要求，但竞争对手为低延迟GPU执行而设计的方法却不能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the International Conference on Internet-of-Things Design and Implementation

自引率

0.00%

发文量