Efficient Object Tracking in Compressed Video Streams with Graph Cuts

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2018-08-01 DOI:10.1109/MMSP.2018.8547120

Fernando Bombardelli da Silva, Serhan Gul, Daniel Becker, Matthias Schmidt, C. Hellge

{"title":"Efficient Object Tracking in Compressed Video Streams with Graph Cuts","authors":"Fernando Bombardelli da Silva, Serhan Gul, Daniel Becker, Matthias Schmidt, C. Hellge","doi":"10.1109/MMSP.2018.8547120","DOIUrl":null,"url":null,"abstract":"In this paper we present a compressed-domain object tracking algorithm for H.264/AVC compressed videos and integrate the proposed algorithm into an indoor vehicle tracking scenario at a car park. Our algorithm works by taking an initial segmentation map or bounding box of the target object in the first frame of the video sequence as input and applying Graph Cuts optimization based on a Markov Random Field model. Our algorithm does not rely on pixels (except for the first frame) and works by only using the codec motion vectors and block coding modes extracted from the H.264/AVC bitstream via inexpensive partial decoding. In this way, we manage to reduce the compute and storage requirements of our system significantly compared to “pixel-domain” tracking algorithms that first fully decode the video stream and work on reconstructed pixels. We demonstrate the quantitative performance of our algorithm over VOT2016 dataset and also integrate our algorithm into a camera-based parking management system and show qualitative results in a real application scenario. Results show that our compressed-domain algorithm provides a good compromise between high accuracy tracking and low-complexity processing showing that it is feasible for scenarios requiring large-scale object tracking in bandwidth-limited conditions.","PeriodicalId":137522,"journal":{"name":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2018.8547120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

In this paper we present a compressed-domain object tracking algorithm for H.264/AVC compressed videos and integrate the proposed algorithm into an indoor vehicle tracking scenario at a car park. Our algorithm works by taking an initial segmentation map or bounding box of the target object in the first frame of the video sequence as input and applying Graph Cuts optimization based on a Markov Random Field model. Our algorithm does not rely on pixels (except for the first frame) and works by only using the codec motion vectors and block coding modes extracted from the H.264/AVC bitstream via inexpensive partial decoding. In this way, we manage to reduce the compute and storage requirements of our system significantly compared to “pixel-domain” tracking algorithms that first fully decode the video stream and work on reconstructed pixels. We demonstrate the quantitative performance of our algorithm over VOT2016 dataset and also integrate our algorithm into a camera-based parking management system and show qualitative results in a real application scenario. Results show that our compressed-domain algorithm provides a good compromise between high accuracy tracking and low-complexity processing showing that it is feasible for scenarios requiring large-scale object tracking in bandwidth-limited conditions.

查看原文本刊更多论文

使用图形切割的压缩视频流中的有效对象跟踪

本文提出了一种针对H.264/AVC压缩视频的压缩域目标跟踪算法，并将该算法集成到停车场的室内车辆跟踪场景中。我们的算法将视频序列第一帧中目标对象的初始分割图或边界框作为输入，并应用基于马尔可夫随机场模型的Graph Cuts优化。我们的算法不依赖于像素(除了第一帧)，只使用编解码器运动矢量和块编码模式，通过廉价的部分解码从H.264/AVC比特流中提取。通过这种方式，与首先完全解码视频流并处理重建像素的“像素域”跟踪算法相比，我们设法大大减少了系统的计算和存储需求。我们在VOT2016数据集上展示了我们的算法的定量性能，并将我们的算法集成到基于摄像头的停车管理系统中，并在实际应用场景中展示了定性结果。结果表明，我们的压缩域算法在高精度跟踪和低复杂度处理之间提供了很好的折衷，表明在带宽有限的条件下需要大规模目标跟踪的场景是可行的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)

自引率

0.00%

发文量