{"title":"Video watermarking in the 3D-DWT domain using quantization-based methods","authors":"P. Campisi","doi":"10.1109/MMSP.2005.248600","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248600","url":null,"abstract":"Video watermarking methods operating in the three-dimensional discrete wavelet transform (3D-DWT) domain using quantization-based embedding techniques are here proposed. Specifically, the video sequence is partitioned into spatio-temporal units of fixed length. Then the video shots undergo a three level 3D-DWT. Two embedding methods are here considered: quantization index modulation and rational dither modulation. Finally the inverse 3D-DWT is performed thus obtained the marked video shot. The effectiveness of the proposed methods has been verified experimentally. They guarantee a high mark imperceptibility as well as robustness to attacks such as MPEG2 compression, MPEG4 compression, collusion, transcoding, and frame dropping. Gain attack is also considered","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124972858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Motion Compensated Wyner-Ziv Video Coding","authors":"Jiong Sun, Haibo Li","doi":"10.1109/MMSP.2005.248547","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248547","url":null,"abstract":"In Wyner-Ziv video coding, efficient compression is achieved by exploiting source statistics at the decoder only, which is radically different from conventional video coding. The performance of a Wyner-Ziv video codec is greatly dependent on the quality of reconstructed side information. In this paper we give an explicit motion compensation scheme. Unlike existing schemes, motion compensation is carried out between side information and the quantized version of the transmitted frame. This needs neither the encoder to transmit additional, implicit motion information to the receiver nor two I frames for interpolating the middle frames. This enables a sequential decoding architecture, which is a necessity for real-time video coding. Our experimental results are promising. 2dB gain in the quality of reconstructed frames has been achieved","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129878262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Constant Quality Rate Control Scheme for Object-based Encoding","authors":"Y. H. Ang, R. Ma, C. Xiang","doi":"10.1109/MMSP.2005.248652","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248652","url":null,"abstract":"In this paper, a novel constant quality rate control (CQRC) algorithm is proposed for object-based (MPEG-4) video codecs. Instead of minimizing distortion of every single frame or minimizing average frame distortion, this controller seeks to minimize the variation of the frame distortion to achieve consistent good quality for whole video sequences. The CQRC algorithm uses a linear rate control model to estimate frame-level bit allocation based on a target distortion measure, and a quadratic rate-quantization model to calculate the quantization parameter for the current frame. The scheme is then further extended to encompass multiple (arbitrary shaped) video objects by means of a bitrate distribution algorithm. Experimental results demonstrate the ability of the CQRC algorithm to achieve a video sequence with less flickering effects and motion jerkiness compared to MPEG-4's scalable rate control scheme, even in the scenario of using imperfect segmentation masks","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"310 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122784598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SNR-Based Frame-Level Video Bit Rate Allocation","authors":"X. Zhuang, Xiangui Kang, Li Liu, Junqiang Lan, Guang Zhou, Guangdong Wu","doi":"10.1109/MMSP.2005.248611","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248611","url":null,"abstract":"Quality fluctuation has a major negative effect on perceptive video quality. In J. Lan et al., (2004), we derived accurate approximations in close-form for the highly nonlinear rate-distortion (R-D) and distortion-quantization (D-Q) relationships, all at the frame-level. Based on the two close forms, we can allocate the bit rate at the frame-level rather easily as far as a target distortion for each frame could be established. In J. Lan et al., (2004), a target distortion was set up for each frame based on a hypothesis that maintaining constant distortion over frames would boast video quality smoothing and extensive experiments showed the constant-distortion bit allocation (CDBA) scheme significantly outperformed the popular constant bit allocation (CBA) scheme in terms of delivered video quality. Maintaining constant distortion is no different from maintaining constant Peak-Signal-to-Noise-Ratio (PSNR). In scene changes, however, the picture energy often dramatically changes, producing significantly different Signal-to-Noise-Ratio (SNR) if constant distortion or constant PSNR is maintained. Although computationally more complex, SNR represents a more objective measure than PSNR in assessing picture/video quality. In the paper, an SNR-based bit allocation scheme is developed for video quality smoothing. The algorithm uses a single pass and attempts to maintain constant SNR at the frame level throughout the video sequence. Experimental results on all testing video sequences show that the proposed CSNRBA scheme provides smooth video quality in terms of natural color and sharp objects and silhouette significantly better than both the CBA and CDBA schemes","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"159 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113987938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Study of Bias Correction Methods for Enhancing Median Edge Detector Prediction","authors":"T. Hai-jiang, K. Sei-ichiro, T. Kazuyuki","doi":"10.1109/MMSP.2005.248614","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248614","url":null,"abstract":"In this paper, we present three novel lossless compression approaches for gray-scale continuous tone natural image. Our methods enhance the median edge detector (MED), which is the core part of JPED-LS algorithm, by reducing the entropy of the prediction error via adaptive regression. These modified predictors improve the prediction accuracy by reducing the negative effect due to MED's oversimplified edge orientation detection. The experimental results show that our approaches achieve evidently better performance than MED with only neglectable increasing of computational complexity and without introduce extra pixels into the causal template","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117292078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Entropy-Based 2D Image Dissimilarity Measure","authors":"P. Tsai, Meng-Hung Wu","doi":"10.1109/MMSP.2005.248634","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248634","url":null,"abstract":"Traditional histogram or statistics based 2D image similarity/dissimilarity metrics fail to handle conjugate pair of black and white images, due to the lack of spatial information in the measurement. Recently proposed compression-based dissimilarity measure (CDM) based on the concept of Kolmogorov complexity has provided a different paradise for similarity measurement. However, without a clear definition on how to \"concatenate\" two 2D images, CDM has difficulties applying with 2D images directly. In this paper, we propose an entropy-based 2D image dissimilarity measure within the same Kolmogorov complexity paradise. The spatial relationship between images is embedded in our metric, and the actual compression of images is not needed once the entropy values are obtained. The proposed metric has been tested for scene change detection application, and encouraging results are presented here","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123023603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distortion Measures in MPEG-Compressed Domain for Multidimensional Transcoding","authors":"Yong Ju Jung, T. Thang, Yong Man Ro","doi":"10.1109/MMSP.2005.248590","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248590","url":null,"abstract":"In order to find the optimal combination of spatio-SNR-temporal transcoding operations, we need to measure the distortion due to the transcoding operations. In this paper, we develop computational methods to calculate the distortion by using only the information extracted directly from the input bitstream through a minimum decoding process in the DCT domain. The objective of our distortion modeling is to estimate the multidimensional distortion before the entire transcoding process","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123080166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speech-and Network-Adaptive Layered G. 729 Coder for Loss Concealments of Real-Time Voice Over IP","authors":"B. Sat, B. Wah","doi":"10.1109/MMSP.2005.248569","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248569","url":null,"abstract":"In this paper, we propose a layered CELP speech coding (LC) scheme that adapts dynamically to the characteristics of the speech encoded and the network loss conditions in real time transmissions of voice over IP. Based on the ITU G.729 CS-ACELP codec operating at 8 Kbps, we design a variable bit-rate codec that is robust to losses and delays in IP networks. To cope with bursty losses while maintaining an acceptable end-to-end delay, our scheme employs LC with redundant piggybacking of perceptually important parameters in the base layer, with a degree of redundancy adjusted according to feedbacks from receivers. Under various delay constraints, we study trade-offs between the additional bit rate required for redundant piggybacking and the protection of perceptually important parameters. Experimental results show that our scheme works well and has quality comparable to full replication","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"06 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115474355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hanfeng Chen, Sung-Soo Kim, Sung-Hee Lee, Ohjae Kwon, Jun-Ho Sung
{"title":"Nonlinearity compensated smooth frame insertion for motion-blur reduction in LCD","authors":"Hanfeng Chen, Sung-Soo Kim, Sung-Hee Lee, Ohjae Kwon, Jun-Ho Sung","doi":"10.1109/MMSP.2005.248646","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248646","url":null,"abstract":"A nonlinearity compensated smooth frame insertion (NCSFI) method is proposed to reduce motion-blur caused by hold-type display in LCD. Firstly, each original frame is duplicated into two frames. Then in the first frame, spatial high frequency is removed and in the second frame, the same amount of high frequency is enhanced. Finally, look-up table based nonlinearity compensation is applied to the two new frames to compensate the nonlinearity between gray level and panel luminance. Experiments on real-time display system show that the proposed NCSFI method can reduce motion-blur significantly without luminance distortion","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123620406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Flashlight Scene Detection for MPEG Videos","authors":"Jian Wang, Yanling Xu, Songyu Yu, Yuanhua Zhou","doi":"10.1109/MMSP.2005.248676","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248676","url":null,"abstract":"A new flashlight scene detection approach is presented. It focuses on two techniques: a new flashlight model based on the spatial-temporal characteristics of intensity value for the DC sequence, and a local threshold selection scheme using the sliding window. The advantages of this approach are its good performance for multi-frame and gradual flashlight scene cases, and local threshold selection. Experimental results show that the proposed algorithm is fast, robust and high accuracy","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129451478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}