Impact of Video Compression on the Performance of Object Detection Systems for Surveillance Applications

2022 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) Pub Date : 2022-11-10 DOI:10.1109/AVSS56176.2022.9959476

Michael O'Byrne, M. Sugrue, Kinesense Ltd, Anil Vibhoothi, Kokaram

{"title":"Impact of Video Compression on the Performance of Object Detection Systems for Surveillance Applications","authors":"Michael O'Byrne, M. Sugrue, Kinesense Ltd, Anil Vibhoothi, Kokaram","doi":"10.1109/AVSS56176.2022.9959476","DOIUrl":null,"url":null,"abstract":"This study examines the relationship between H.264 video compression and the performance of an object detection network (YOLOv5). We curated a set of 50 surveillance videos and annotated targets of interest (people, bikes, and vehicles). Videos were encoded at 5 quality levels using Constant Rate Factor (CRF) values in the set {22,32,37,42,47}. YOLOv5 was applied to compressed videos and detection performance was analyzed at each CRF level. Test results indicate that the detection performance is generally robust to moderate levels of compression; using a CRF value of 37 instead of 22 leads to significantly reduced bitrates/file sizes without adversely affecting detection performance. However, detection performance degrades appreciably at higher compression levels, especially in complex scenes with poor lighting and fast-moving targets. Finally, retraining YOLOv5 on compressed imagery gives up to a 1% improvement in F1 score when applied to highly compressed footage.","PeriodicalId":408581,"journal":{"name":"2022 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AVSS56176.2022.9959476","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

This study examines the relationship between H.264 video compression and the performance of an object detection network (YOLOv5). We curated a set of 50 surveillance videos and annotated targets of interest (people, bikes, and vehicles). Videos were encoded at 5 quality levels using Constant Rate Factor (CRF) values in the set {22,32,37,42,47}. YOLOv5 was applied to compressed videos and detection performance was analyzed at each CRF level. Test results indicate that the detection performance is generally robust to moderate levels of compression; using a CRF value of 37 instead of 22 leads to significantly reduced bitrates/file sizes without adversely affecting detection performance. However, detection performance degrades appreciably at higher compression levels, especially in complex scenes with poor lighting and fast-moving targets. Finally, retraining YOLOv5 on compressed imagery gives up to a 1% improvement in F1 score when applied to highly compressed footage.

查看原文本刊更多论文

视频压缩对目标检测系统性能的影响

本研究探讨了H.264视频压缩与目标检测网络(YOLOv5)性能之间的关系。我们策划了一组50个监控视频，并标注了感兴趣的目标(人、自行车和车辆)。使用集{22,32,37,42,47}中的恒定速率因子(CRF)值以5个质量级别对视频进行编码。将YOLOv5应用于压缩视频，并在每个CRF水平上分析检测性能。测试结果表明，该算法在中等压缩水平下具有较好的鲁棒性;使用CRF值37而不是22可以显著降低比特率/文件大小，而不会对检测性能产生不利影响。然而，在较高的压缩水平下，检测性能明显下降，特别是在光线不足和快速移动目标的复杂场景中。最后，在压缩图像上重新训练YOLOv5，当应用于高度压缩的镜头时，F1分数提高了1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

自引率

0.00%

发文量