面向YOLOv3并发实时目标检测的多线程帧平铺模型

Proceedings of the 2020 6th International Conference on Computer and Technology Applications Pub Date : 2020-04-14 DOI:10.1145/3397125.3397139

Sara Abri, Rayan Abri, Anıl Yarıcı, S. Cetin

{"title":"面向YOLOv3并发实时目标检测的多线程帧平铺模型","authors":"Sara Abri, Rayan Abri, Anıl Yarıcı, S. Cetin","doi":"10.1145/3397125.3397139","DOIUrl":null,"url":null,"abstract":"In the process of object detection, You Look Only Once, version 3 (YOLOv3) approach has been applied as a more efficient solution than Convolutional Neural Network (CNN) methods in accuracy and speed. In some environments, there is a need to process multiple real-time object detection algorithms concurrently where each object detection algorithm receives a live stream from a camera. In this research, we aim to optimize the system resources including the Graphics Processing Unit(GPU) and the Central Processing Unit(CPU) where accuracy indicated using frames per second (FPS) has no change in the process of detection. We propose improvements in terms of architectural models for real-time object detection. In this way, a Multi-thread Frame Tiling model is proposed to optimize GPU and CPU usage per YOLO object to handle concurrent surveillance video streams. Thus, it can be covered a large scale of surveillance cameras on multiple live streams concurrently. To evaluate the proposed model, we investigate the efficiency of GPU and CPU system resources. The experiment results show that the detection algorithm with the Multi-thread Frame Tiling model is by an average of 116.6 in FPS for all video qualities (480p, 720p, and 1080p resolution) compared to the Original YOLOv3 by an average of 27.1 while there is no change in accuracy in the detection process by YOLOv3.","PeriodicalId":303304,"journal":{"name":"Proceedings of the 2020 6th International Conference on Computer and Technology Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Multi-Thread Frame Tiling Model in Concurrent Real-Time Object Detection for Resources Optimization in YOLOv3\",\"authors\":\"Sara Abri, Rayan Abri, Anıl Yarıcı, S. Cetin\",\"doi\":\"10.1145/3397125.3397139\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the process of object detection, You Look Only Once, version 3 (YOLOv3) approach has been applied as a more efficient solution than Convolutional Neural Network (CNN) methods in accuracy and speed. In some environments, there is a need to process multiple real-time object detection algorithms concurrently where each object detection algorithm receives a live stream from a camera. In this research, we aim to optimize the system resources including the Graphics Processing Unit(GPU) and the Central Processing Unit(CPU) where accuracy indicated using frames per second (FPS) has no change in the process of detection. We propose improvements in terms of architectural models for real-time object detection. In this way, a Multi-thread Frame Tiling model is proposed to optimize GPU and CPU usage per YOLO object to handle concurrent surveillance video streams. Thus, it can be covered a large scale of surveillance cameras on multiple live streams concurrently. To evaluate the proposed model, we investigate the efficiency of GPU and CPU system resources. The experiment results show that the detection algorithm with the Multi-thread Frame Tiling model is by an average of 116.6 in FPS for all video qualities (480p, 720p, and 1080p resolution) compared to the Original YOLOv3 by an average of 27.1 while there is no change in accuracy in the detection process by YOLOv3.\",\"PeriodicalId\":303304,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Conference on Computer and Technology Applications\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Conference on Computer and Technology Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3397125.3397139\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Conference on Computer and Technology Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3397125.3397139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在目标检测过程中，You Look Only Once, version 3 (YOLOv3)方法在精度和速度上都比卷积神经网络(CNN)方法更有效。在某些环境中，需要同时处理多个实时目标检测算法，其中每个目标检测算法接收来自摄像机的实时流。在本研究中，我们的目标是优化系统资源，包括图形处理单元(GPU)和中央处理单元(CPU)，其中使用每秒帧数(FPS)表示的精度在检测过程中没有变化。我们在实时目标检测的架构模型方面提出了改进。在此基础上，提出了一种多线程帧平铺模型来优化每个YOLO对象的GPU和CPU使用率，以处理并发监控视频流。因此，它可以同时覆盖多个实时流上的大规模监控摄像机。为了评估所提出的模型，我们研究了GPU和CPU系统资源的效率。实验结果表明，在所有视频质量(480p、720p和1080p分辨率)下，采用多线程帧平铺模型的检测算法比原始的YOLOv3平均提高了27.1帧/秒，而YOLOv3在检测过程中的精度没有变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Thread Frame Tiling Model in Concurrent Real-Time Object Detection for Resources Optimization in YOLOv3

In the process of object detection, You Look Only Once, version 3 (YOLOv3) approach has been applied as a more efficient solution than Convolutional Neural Network (CNN) methods in accuracy and speed. In some environments, there is a need to process multiple real-time object detection algorithms concurrently where each object detection algorithm receives a live stream from a camera. In this research, we aim to optimize the system resources including the Graphics Processing Unit(GPU) and the Central Processing Unit(CPU) where accuracy indicated using frames per second (FPS) has no change in the process of detection. We propose improvements in terms of architectural models for real-time object detection. In this way, a Multi-thread Frame Tiling model is proposed to optimize GPU and CPU usage per YOLO object to handle concurrent surveillance video streams. Thus, it can be covered a large scale of surveillance cameras on multiple live streams concurrently. To evaluate the proposed model, we investigate the efficiency of GPU and CPU system resources. The experiment results show that the detection algorithm with the Multi-thread Frame Tiling model is by an average of 116.6 in FPS for all video qualities (480p, 720p, and 1080p resolution) compared to the Original YOLOv3 by an average of 27.1 while there is no change in accuracy in the detection process by YOLOv3.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2020 6th International Conference on Computer and Technology Applications

自引率

0.00%

发文量