2007 IEEE 9th Workshop on Multimedia Signal Processing最新文献_第10页

Spatial and Temporal Adaptation of Interpolation Filter For Low Complexity Encoding/Decoding 低复杂度编码/解码中插值滤波器的时空自适应

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412843

D. Rusanovskyy, M. Gabbouj, K. Ugur

{"title":"Spatial and Temporal Adaptation of Interpolation Filter For Low Complexity Encoding/Decoding","authors":"D. Rusanovskyy, M. Gabbouj, K. Ugur","doi":"10.1109/MMSP.2007.4412843","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412843","url":null,"abstract":"Compared to video coding with non-adaptive interpolation filtering, adaptive filters achieve higher compression ratios, with an increase in encoding and decoding complexity. In our earlier work, we significantly reduced the decoding complexities of adaptive filtering schemes with a minimal impact on the coding efficiency by making use of different filters and adapting them spatially and temporally. However, our previous scheme required high encoder complexity, as several encoding passes per frame were needed to analyze the input image and optimize the selection of interpolation filters. In this paper, a novel algorithm that does not require multiple encoding passes, but still give similar or better performance is proposed. This is achieved by using a modified decision making function that does not require full reconstruction of coded frame and use motion and prediction information more efficiently. In addition, we generalized our previous scheme by introducing additional filters, so that better Rate-Distortion-Complexity tradeoffs are possible. Experimental results show that up-to 50-70% reduction in interpolation complexity is achieved, with less than 0.13 dB penalty on coding efficiency.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125437421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Perceptual Enhancement for Fully Scalable Audio 完全可扩展音频的感知增强

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412811

Te Li, S. Rahardja, S. Koh

引用次数: 1

Impact of Additional Noise on Subjective and Objective Quality Assessement in VoIP 附加噪声对VoIP主客观质量评价的影响

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412813

Zdenek Becvar, L. Novák, J. Zelenka, M. Brada, P. Slepička

引用次数: 8

Flexible Video Decoding: A Distributed Source Coding Approach 灵活的视频解码:一种分布式源编码方法

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412828

Ngai-Man Cheung, Antonio Ortega

{"title":"Flexible Video Decoding: A Distributed Source Coding Approach","authors":"Ngai-Man Cheung, Antonio Ortega","doi":"10.1109/MMSP.2007.4412828","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412828","url":null,"abstract":"We investigate video compression techniques to address problems that require flexible video decoding. In these, the encoder has access to a number of candidate predictors that allow it to exploit source signal correlation, but only a subset of these predictors will be available at the decoder. Crucially, the encoder does not know which predictors will be available. Flexible decoding is important in a number of applications including frame-by-frame forward and backward video playback, multiview video, bitstreams switching, robust video transmission, etc. The main challenge to support flexible decoding is that the encoder needs to compress a current frame under the uncertainty on the predictor at decoder. An approach based on conventional \"closed loop\" prediction, e.g., motion-compensated predictive (MCP) coding in the case of video, could be developed by including multiple possible prediction residues in the bitstream, but this would lead to a considerable coding performance penalty, if all possible predictor combinations are supported, or to drifting, if only some combinations are. Moreover, it is not possible in general to guarantee that decoded versions under different prediction scenarios will be identical. In this paper, we propose a distributed source coding (DSC) based algorithm to tackle the problem. The main novelties of the proposed algorithm are that it incorporates different macroblock modes and significance coding within the DSC framework. This, combined with a judicious exploitation of correlation statistics, allows us to achieve competitive coding performance. Using forward/backward video playback as an example, we demonstrate the proposed algorithm can outperform a solution based on MCP coding.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132518502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Image alignment with rotation manifolds built on sparse geometric expansions 基于稀疏几何展开的旋转流形图像对齐

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412850

E. Kokiopoulou, P. Frossard

{"title":"Image alignment with rotation manifolds built on sparse geometric expansions","authors":"E. Kokiopoulou, P. Frossard","doi":"10.1109/MMSP.2007.4412850","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412850","url":null,"abstract":"In this paper we discuss the problem of alignment of patterns under arbitrary rotation. When a generic image pattern is geometrically transformed, it typically spans a (possibly nonlinear) manifold in a high dimensional space. When the pattern of interest is given by a sparse approximation over a structured dictionary of geometric atoms, we show that the rotation manifold can be expressed analytically as a function of the transformation parameters. At the same time, its high order derivatives are also given in a closed form when the pattern is represented as a sparse linear combination of a few differentiable basis functions. In this framework, the alignment problem is formulated as the minimization of the distance between the reference pattern and the manifold, which boils down to a nonlinear least squares optimization problem. We propose to solve this problem by a Newton-type method, whose solution is facilitated by the analytical expressions of the manifold derivatives. We further derive a global optimization heuristic algorithm based on Newton, and provide sufficient conditions for computing the global minimizer. Experimental results demonstrate the effectiveness of the proposed methodology for image alignment and rotation invariant pattern recognition.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134457731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Dynamic FEC-Distortion Optimization for H.264 Scalable Video Streaming H.264可扩展视频流的动态fec失真优化

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412839

Wei-Chung Wen, Hsu-Feng Hsiao, Jen-Yu Yu

引用次数: 12

Analyzing the Multimodal Behaviors of Users of a Speech-to-Speech Translation Device by using Concept Matching Scores 用概念匹配分数分析语音翻译设备用户的多模态行为

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412867

Jongho Shin, P. Georgiou, Shrikanth S. Narayanan

引用次数: 0

Multimodal Sensor Analysis of Sitar Performance: Where is the Beat? 锡塔琴性能的多模态传感器分析:节拍在哪里?

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412821

M. S. Benning, A. Kapur, B. Till, G. Tzanetakis

引用次数: 7

Multiple description image coding with redundant expansions and optimal quantization 具有冗余展开和最优量化的多描述图像编码

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412844

Ivana Radulovic, P. Frossard

引用次数: 4

Rate-Distortion Optimized I-Slice Selection for Low Delay Video Transmission 低延迟视频传输的率失真优化i片选择

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412831

Yuan Lin, A. N. Kim, Eren Gürses, A. Perkis

{"title":"Rate-Distortion Optimized I-Slice Selection for Low Delay Video Transmission","authors":"Yuan Lin, A. N. Kim, Eren Gürses, A. Perkis","doi":"10.1109/MMSP.2007.4412831","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412831","url":null,"abstract":"Rate smoothing is essential for achieving lower delay when transmitting real-time video over the network. Recently, \"explicit slice-based mode selection\" (ESM) is proposed as a new way of achieving this goal together with its inherent quality smoothness and error resilience features. However previous studies focus on the practical aspects and do not address an optimized solution. In this paper, we propose a rate-distortion (RD) optimized solution for finding the best location and size of the intra-coded slices. The experimental results show that for a target bit rate the optimized scheme is able to offer performance close to that of mode selection on a macroblock level, over wireless channels with different packet loss rates. Moreover, the optimized ESM algorithm provides significant advantages of granular bit stream prioritization for network transmission. However, the RD based optimization is in general computationally expensive. We therefore propose a heuristic approach which incorporates both channel statistics and sequence characteristics. Results show that it yields close to optimal performance at lower complexity.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116687169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4