A perceptual preprocessor to segment video for motion estimation

Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225) Pub Date : 1998-03-30 DOI:10.1109/DCC.1998.672255

Yi-jen Chiu

{"title":"A perceptual preprocessor to segment video for motion estimation","authors":"Yi-jen Chiu","doi":"10.1109/DCC.1998.672255","DOIUrl":null,"url":null,"abstract":"Summary form only given. The objective of motion estimation and motion compensation is to reduce the temporal redundancy between adjacent pictures in a video sequence. Motion estimation is usually performed by calculating an error metric, such as mean absolute error (MAE), for each block in the current frame over a displaced region in the previously reconstructed frame. The motion vector is attained as the displacement having the minimum error metric. Although this achieves minimum-MAE in the residual block, it does not necessarily result in the best perceptual quality since the MAE is not always a good indicator of video quality. In low bit rate video coding, the overhead in sending the motion vectors becomes a significant proportion of the total data rate. The minimum-MAE motion vector may not achieve the minimum joint entropy for coding the residual block and motion vector, and thus may not achieve the best compression efficiency. In this paper, we attack these problems by introducing a perceptual preprocessor which takes advantage of the insensitivity of the human visual system (HVS) to mild changes in pixel intensity in order to segment the video into regions according to the perceptibility of the picture changes. Our preprocessor can exploit the local psycho-perceptual properties of the HVS because it is designed to segment video in the spatio-temporal pixel domain. The associated computational complexity for the segmentation in the spatio-temporal pixel domain is very small. With the information of segmentation, we then determine which macroblocks require motion estimation.","PeriodicalId":191890,"journal":{"name":"Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.1998.672255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Summary form only given. The objective of motion estimation and motion compensation is to reduce the temporal redundancy between adjacent pictures in a video sequence. Motion estimation is usually performed by calculating an error metric, such as mean absolute error (MAE), for each block in the current frame over a displaced region in the previously reconstructed frame. The motion vector is attained as the displacement having the minimum error metric. Although this achieves minimum-MAE in the residual block, it does not necessarily result in the best perceptual quality since the MAE is not always a good indicator of video quality. In low bit rate video coding, the overhead in sending the motion vectors becomes a significant proportion of the total data rate. The minimum-MAE motion vector may not achieve the minimum joint entropy for coding the residual block and motion vector, and thus may not achieve the best compression efficiency. In this paper, we attack these problems by introducing a perceptual preprocessor which takes advantage of the insensitivity of the human visual system (HVS) to mild changes in pixel intensity in order to segment the video into regions according to the perceptibility of the picture changes. Our preprocessor can exploit the local psycho-perceptual properties of the HVS because it is designed to segment video in the spatio-temporal pixel domain. The associated computational complexity for the segmentation in the spatio-temporal pixel domain is very small. With the information of segmentation, we then determine which macroblocks require motion estimation.

查看原文本刊更多论文

一种用于视频分割运动估计的感知预处理器

只提供摘要形式。运动估计和运动补偿的目标是减少视频序列中相邻图像之间的时间冗余。运动估计通常通过计算误差度量来执行，例如平均绝对误差(MAE)，对于当前帧中的每个块在先前重构帧中的移位区域。运动矢量作为具有最小误差度量的位移得到。虽然这在残差块中实现了最小的MAE，但它并不一定产生最佳的感知质量，因为MAE并不总是视频质量的良好指标。在低比特率视频编码中，发送运动矢量的开销占总数据速率的很大一部分。对于残差块和运动向量的编码，最小mae运动向量可能无法达到最小联合熵，因此可能无法达到最佳压缩效率。在本文中，我们通过引入感知预处理器来解决这些问题，该预处理器利用人类视觉系统(HVS)对像素强度的轻微变化不敏感的特点，根据图像变化的可感知性将视频分割成多个区域。我们的预处理器可以利用HVS的局部心理感知特性，因为它被设计成在时空像素域分割视频。在时空像素域进行分割的计算复杂度非常小。根据分割信息，确定需要运动估计的宏块。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)

自引率

0.00%

发文量