RAVIP:异构多通道视频流实时AI视觉平台

J. Inf. Process. Syst. Pub Date : 2021-04-01 DOI:10.3745/JIPS.02.0154

Jeong-Hun Lee, Kwang-il Hwang

{"title":"RAVIP:异构多通道视频流实时AI视觉平台","authors":"Jeong-Hun Lee, Kwang-il Hwang","doi":"10.3745/JIPS.02.0154","DOIUrl":null,"url":null,"abstract":"Object detection techniques based on deep learning such as YOLO have high detection performance and precision in a single channel video stream. In order to expand to multiple channel object detection in real-time, however, high-performance hardware is required. In this paper, we propose a novel back-end server framework, a real-time AI vision platform (RAVIP), which can extend the object detection function from single channel to simultaneous multi-channels, which can work well even in low-end server hardware. RAVIP assembles appropriate component modules from the RODEM (real-time object detection module) Base to create perchannel instances for each channel, enabling efficient parallelization of object detection instances on limited hardware resources through continuous monitoring with respect to resource utilization. Through practical experiments, RAVIP shows that it is possible to optimize CPU, GPU, and memory utilization while performing object detection service in a multi-channel situation. In addition, it has been proven that RAVIP can provide object detection services with 25 FPS for all 16 channels at the same time.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"RAVIP: Real-Time AI Vision Platform for Heterogeneous Multi-Channel Video Stream\",\"authors\":\"Jeong-Hun Lee, Kwang-il Hwang\",\"doi\":\"10.3745/JIPS.02.0154\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection techniques based on deep learning such as YOLO have high detection performance and precision in a single channel video stream. In order to expand to multiple channel object detection in real-time, however, high-performance hardware is required. In this paper, we propose a novel back-end server framework, a real-time AI vision platform (RAVIP), which can extend the object detection function from single channel to simultaneous multi-channels, which can work well even in low-end server hardware. RAVIP assembles appropriate component modules from the RODEM (real-time object detection module) Base to create perchannel instances for each channel, enabling efficient parallelization of object detection instances on limited hardware resources through continuous monitoring with respect to resource utilization. Through practical experiments, RAVIP shows that it is possible to optimize CPU, GPU, and memory utilization while performing object detection service in a multi-channel situation. In addition, it has been proven that RAVIP can provide object detection services with 25 FPS for all 16 channels at the same time.\",\"PeriodicalId\":415161,\"journal\":{\"name\":\"J. Inf. Process. Syst.\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Inf. Process. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3745/JIPS.02.0154\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Inf. Process. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3745/JIPS.02.0154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

基于深度学习的目标检测技术如YOLO在单通道视频流中具有较高的检测性能和精度。然而，为了扩展到实时的多通道目标检测，需要高性能的硬件。本文提出了一种新颖的后端服务器框架——实时人工智能视觉平台(RAVIP)，它可以将目标检测功能从单通道扩展到同时多通道，即使在低端服务器硬件上也能很好地工作。RAVIP从RODEM(实时目标检测模块)库中组装适当的组件模块，为每个通道创建跨通道实例，通过对资源利用率的持续监控，在有限的硬件资源上实现目标检测实例的高效并行化。通过实际实验，RAVIP表明，在多通道情况下，在执行目标检测服务时，可以优化CPU、GPU和内存利用率。此外，已经证明RAVIP可以同时为所有16个通道提供25 FPS的目标检测服务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

RAVIP: Real-Time AI Vision Platform for Heterogeneous Multi-Channel Video Stream

Object detection techniques based on deep learning such as YOLO have high detection performance and precision in a single channel video stream. In order to expand to multiple channel object detection in real-time, however, high-performance hardware is required. In this paper, we propose a novel back-end server framework, a real-time AI vision platform (RAVIP), which can extend the object detection function from single channel to simultaneous multi-channels, which can work well even in low-end server hardware. RAVIP assembles appropriate component modules from the RODEM (real-time object detection module) Base to create perchannel instances for each channel, enabling efficient parallelization of object detection instances on limited hardware resources through continuous monitoring with respect to resource utilization. Through practical experiments, RAVIP shows that it is possible to optimize CPU, GPU, and memory utilization while performing object detection service in a multi-channel situation. In addition, it has been proven that RAVIP can provide object detection services with 25 FPS for all 16 channels at the same time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

J. Inf. Process. Syst.

自引率

0.00%

发文量