A DCT-Domain Video Alignment Technique for MPEG Sequences

2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI:10.1109/MMSP.2005.248599

Ming-Sui Lee, Mei-Yin Shen, C.-C. Jay Kuo

{"title":"A DCT-Domain Video Alignment Technique for MPEG Sequences","authors":"Ming-Sui Lee, Mei-Yin Shen, C.-C. Jay Kuo","doi":"10.1109/MMSP.2005.248599","DOIUrl":null,"url":null,"abstract":"An image/video registration technique for multiple compressed video inputs such as MPEG sequences is investigated. The proposed technique is based on the matching of discrete cosine transform (DCT) coefficients and motion vectors. First, the I frame of each input sequence is separated into the background and moving objects. For the background, coarse edge features are extracted by applying edge detectors of different characteristics to the luminance DC coefficients. Each detector generates a difference map for a single background. A threshold is determined for each difference map to produce a binary map. Then, alignment parameters are determined using the binary maps of input images generated by the same detector. For the moving object, alignment parameters can be finetuned by the motion information of all frames in the same group of pictures (GOP). Finally, the actual displacement in the pixel domain is estimated by the weighted average of alignment parameters from all background detectors and refinement parameters from motion information. It is shown by experimental results that the proposed method reduces the computational cost of image/video registration significantly in comparison with the traditional pixel domain registration techniques while achieving certain quality of composition","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE 7th Workshop on Multimedia Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2005.248599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

An image/video registration technique for multiple compressed video inputs such as MPEG sequences is investigated. The proposed technique is based on the matching of discrete cosine transform (DCT) coefficients and motion vectors. First, the I frame of each input sequence is separated into the background and moving objects. For the background, coarse edge features are extracted by applying edge detectors of different characteristics to the luminance DC coefficients. Each detector generates a difference map for a single background. A threshold is determined for each difference map to produce a binary map. Then, alignment parameters are determined using the binary maps of input images generated by the same detector. For the moving object, alignment parameters can be finetuned by the motion information of all frames in the same group of pictures (GOP). Finally, the actual displacement in the pixel domain is estimated by the weighted average of alignment parameters from all background detectors and refinement parameters from motion information. It is shown by experimental results that the proposed method reduces the computational cost of image/video registration significantly in comparison with the traditional pixel domain registration techniques while achieving certain quality of composition

查看原文本刊更多论文

一种用于MPEG序列的dct域视频对齐技术

研究了多压缩视频输入(如MPEG序列)的图像/视频配准技术。该方法基于离散余弦变换(DCT)系数与运动向量的匹配。首先，将每个输入序列的I帧分离为背景和运动物体。对于背景，采用不同特征的边缘检测器对亮度DC系数提取粗边缘特征。每个检测器为单个背景生成一个差值图。为每个差值映射确定一个阈值以生成二值映射。然后，利用同一检测器生成的输入图像的二值映射确定对准参数。对于运动目标，可以利用同一组图像(GOP)中所有帧的运动信息来微调对齐参数。最后，通过加权平均所有背景检测器的对准参数和运动信息的细化参数来估计像素域的实际位移。实验结果表明，与传统的像素域配准技术相比，该方法在达到一定的合成质量的同时，显著降低了图像/视频配准的计算成本

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2005 IEEE 7th Workshop on Multimedia Signal Processing

自引率

0.00%

发文量