2007 IEEE 9th Workshop on Multimedia Signal Processing最新文献

筛选
英文 中文
Spatial and Temporal Adaptation of Interpolation Filter For Low Complexity Encoding/Decoding 低复杂度编码/解码中插值滤波器的时空自适应
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412843
D. Rusanovskyy, M. Gabbouj, K. Ugur
{"title":"Spatial and Temporal Adaptation of Interpolation Filter For Low Complexity Encoding/Decoding","authors":"D. Rusanovskyy, M. Gabbouj, K. Ugur","doi":"10.1109/MMSP.2007.4412843","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412843","url":null,"abstract":"Compared to video coding with non-adaptive interpolation filtering, adaptive filters achieve higher compression ratios, with an increase in encoding and decoding complexity. In our earlier work, we significantly reduced the decoding complexities of adaptive filtering schemes with a minimal impact on the coding efficiency by making use of different filters and adapting them spatially and temporally. However, our previous scheme required high encoder complexity, as several encoding passes per frame were needed to analyze the input image and optimize the selection of interpolation filters. In this paper, a novel algorithm that does not require multiple encoding passes, but still give similar or better performance is proposed. This is achieved by using a modified decision making function that does not require full reconstruction of coded frame and use motion and prediction information more efficiently. In addition, we generalized our previous scheme by introducing additional filters, so that better Rate-Distortion-Complexity tradeoffs are possible. Experimental results show that up-to 50-70% reduction in interpolation complexity is achieved, with less than 0.13 dB penalty on coding efficiency.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125437421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Impact of Additional Noise on Subjective and Objective Quality Assessement in VoIP 附加噪声对VoIP主客观质量评价的影响
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412813
Zdenek Becvar, L. Novák, J. Zelenka, M. Brada, P. Slepička
{"title":"Impact of Additional Noise on Subjective and Objective Quality Assessement in VoIP","authors":"Zdenek Becvar, L. Novák, J. Zelenka, M. Brada, P. Slepička","doi":"10.1109/MMSP.2007.4412813","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412813","url":null,"abstract":"The main requirement in the Voice over IP technology is a good quality of received voice signal during communication between subscribers. The signal quality can be influenced by many factors such as packet loss, jitter, packet delay, noise etc. and it can be measured by number of methods. The main purpose of this paper is the investigation of an impact of different noise types and different noise levels on the quality assessment in VoIP. The artificial generated noises and real noises obtained from real telecommunications networks were used for testing. The next goal is a comparison of the results obtained by subjective listening tests and objective measuring methods. PESQ and 3SQM were used for objective testing in this paper.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121435180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Flexible Video Decoding: A Distributed Source Coding Approach 灵活的视频解码:一种分布式源编码方法
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412828
Ngai-Man Cheung, Antonio Ortega
{"title":"Flexible Video Decoding: A Distributed Source Coding Approach","authors":"Ngai-Man Cheung, Antonio Ortega","doi":"10.1109/MMSP.2007.4412828","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412828","url":null,"abstract":"We investigate video compression techniques to address problems that require flexible video decoding. In these, the encoder has access to a number of candidate predictors that allow it to exploit source signal correlation, but only a subset of these predictors will be available at the decoder. Crucially, the encoder does not know which predictors will be available. Flexible decoding is important in a number of applications including frame-by-frame forward and backward video playback, multiview video, bitstreams switching, robust video transmission, etc. The main challenge to support flexible decoding is that the encoder needs to compress a current frame under the uncertainty on the predictor at decoder. An approach based on conventional \"closed loop\" prediction, e.g., motion-compensated predictive (MCP) coding in the case of video, could be developed by including multiple possible prediction residues in the bitstream, but this would lead to a considerable coding performance penalty, if all possible predictor combinations are supported, or to drifting, if only some combinations are. Moreover, it is not possible in general to guarantee that decoded versions under different prediction scenarios will be identical. In this paper, we propose a distributed source coding (DSC) based algorithm to tackle the problem. The main novelties of the proposed algorithm are that it incorporates different macroblock modes and significance coding within the DSC framework. This, combined with a judicious exploitation of correlation statistics, allows us to achieve competitive coding performance. Using forward/backward video playback as an example, we demonstrate the proposed algorithm can outperform a solution based on MCP coding.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132518502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Efficient Dependency Tracking in Packetised Media Streams 在打包媒体流中有效的依赖跟踪
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412837
Alexander Eichhorn
{"title":"Efficient Dependency Tracking in Packetised Media Streams","authors":"Alexander Eichhorn","doi":"10.1109/MMSP.2007.4412837","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412837","url":null,"abstract":"Scheduling and error control mechanisms for robust delivery of media streams over packet networks rely on distortion metrics to optimally allocate resources and protect streams front uncontrolled quality degradation. Current distortion metrics are accurate, but the actual distortion values are expensive to obtain. Therefore, distortion models often assume fixed dependency patterns and neglect fragmentation issues. While this decreases runtime complexity, it also limits the application of such models to special stream classes and network environments. In response, we present a practical, efficient and format-independent framework to reason about dependencies in media streams. Based on correlation analysis we show that the estimations made by our framework match traditional distortion metrics for a number of H.264 encoded streams. Performance benchmarks indicate, that our framework is applicable at very-low computational overheads.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130697775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Dynamic FEC-Distortion Optimization for H.264 Scalable Video Streaming H.264可扩展视频流的动态fec失真优化
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412839
Wei-Chung Wen, Hsu-Feng Hsiao, Jen-Yu Yu
{"title":"Dynamic FEC-Distortion Optimization for H.264 Scalable Video Streaming","authors":"Wei-Chung Wen, Hsu-Feng Hsiao, Jen-Yu Yu","doi":"10.1109/MMSP.2007.4412839","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412839","url":null,"abstract":"Forward error correction codes have been shown to be a feasible solution either in application layer or in link layer to fulfill the need of quality of service for multimedia streaming over the fluctuant channels. In this paper, we propose FEC-distortion optimization algorithms to efficiently utilize the bandwidth for better video quality. The optimization criterions are based on the unequal error protection by taking account of the error drifting problems from both temporal motion compensation and inter-layer prediction of H.264/MPEG-4 AVC scalable video coding. Also, it can adapt to the content-dependent quality contribution of each video frame in a video layer. Lightweight error-concealment is also incorporated with the proposed algorithms for better H.264 SVC streaming. For some applications where either computation might be the bottleneck or the upper bound of non-decodable probability of each video layer is specified, alternative bandwidth allocation algorithm is provided with the trade-off of slight quality degradation.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133228726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Image alignment with rotation manifolds built on sparse geometric expansions 基于稀疏几何展开的旋转流形图像对齐
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412850
E. Kokiopoulou, P. Frossard
{"title":"Image alignment with rotation manifolds built on sparse geometric expansions","authors":"E. Kokiopoulou, P. Frossard","doi":"10.1109/MMSP.2007.4412850","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412850","url":null,"abstract":"In this paper we discuss the problem of alignment of patterns under arbitrary rotation. When a generic image pattern is geometrically transformed, it typically spans a (possibly nonlinear) manifold in a high dimensional space. When the pattern of interest is given by a sparse approximation over a structured dictionary of geometric atoms, we show that the rotation manifold can be expressed analytically as a function of the transformation parameters. At the same time, its high order derivatives are also given in a closed form when the pattern is represented as a sparse linear combination of a few differentiable basis functions. In this framework, the alignment problem is formulated as the minimization of the distance between the reference pattern and the manifold, which boils down to a nonlinear least squares optimization problem. We propose to solve this problem by a Newton-type method, whose solution is facilitated by the analytical expressions of the manifold derivatives. We further derive a global optimization heuristic algorithm based on Newton, and provide sufficient conditions for computing the global minimizer. Experimental results demonstrate the effectiveness of the proposed methodology for image alignment and rotation invariant pattern recognition.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134457731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
New Detectors for Watermarks with Unknown Power Based on Student-t Image Priors 基于Student-t图像先验的未知功率水印检测器
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412889
Antonis Mairgiotis, G. Chantas, N. Galatsanos, K. Blekas, Yongyi Yang
{"title":"New Detectors for Watermarks with Unknown Power Based on Student-t Image Priors","authors":"Antonis Mairgiotis, G. Chantas, N. Galatsanos, K. Blekas, Yongyi Yang","doi":"10.1109/MMSP.2007.4412889","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412889","url":null,"abstract":"In this paper we present new detectors for additive watermarks when the power of the watermark is unknown. These detectors are based on modeling the image using student-t statistics. As a result, due to the generative properties of the student-t density function, such models are spatially adaptive and the Expectation-Maximization algorithm can be used to obtain maximum likelihood estimates of their parameters. Using these image models detectors based on the generalized likelihood ratio and Rao tests are derived for this problem. Numerical experiments are presented that demonstrate the properties of these detectors and compared them with previously proposed detectors.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120963589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Multiple description image coding with redundant expansions and optimal quantization 具有冗余展开和最优量化的多描述图像编码
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412844
Ivana Radulovic, P. Frossard
{"title":"Multiple description image coding with redundant expansions and optimal quantization","authors":"Ivana Radulovic, P. Frossard","doi":"10.1109/MMSP.2007.4412844","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412844","url":null,"abstract":"This paper addresses the problem of optimal rate allocation for multiple description coding with redundant signal expansions. In case of redundant descriptions, the quantization of the transform coefficients has clearly to be adapted to the importance of the basis functions, to the redundancy in the representation, and to the expected loss probability on the transmission channel. We derive a rate-distortion optimal solution for the scalar quantization of coefficients in redundant signal representations. The application of the optimal rate allocation to a typical image communication problem demonstrates performance gains with respect to scheme based on uniform quantization with fixed step size, and to solutions based on unequal error protection.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125299282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Usability Evaluation of Finger Pointer for Home-Use Display 家用显示器手指指针的可用性评价
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412823
I. Yuyama, Shigeki Takiura, Yasumasa Numata, H. Hasegawa, Yuki Watanabe
{"title":"Usability Evaluation of Finger Pointer for Home-Use Display","authors":"I. Yuyama, Shigeki Takiura, Yasumasa Numata, H. Hasegawa, Yuki Watanabe","doi":"10.1109/MMSP.2007.4412823","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412823","url":null,"abstract":"With the onset of interactive digital data broadcasting, a user-friendly pointing device for controlling home use displays such as digital HDTVs is desirable. A linger pointer that does not use any artificial devices in the hand is a suitable candidate. This paper describes the basic design and usability of the finger pointer. To obtain a degree of freedom in the operation, we propose a finger pointer used in the crooked elbow position. In the experiments, we detect the position of the fingertip with a stereo camera and examine whether there is a datum point on the body that has sufficient accuracy for our purpose. Then, the psychological plane where the display is mapped by the finger is measured. The performance of the finger pointer is evaluated in accordance with IS09241-9. Moreover, we examine the subjective evaluation test and a comparison test using the mouse. The experimental results show the finger pointer to be a promising device for domestic use.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129483398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources 基于技术的幼儿语言和读写能力评估系统:多种信息源的作用
2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412810
A. Alwan, Yijian Bai, M. Black, Larry Casey, M. Gerosa, M. Heritage, Markus R Iseli, Barbara Jones, Ebrahim (Abe) Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan, P. Price, J. Tepperman, Shizhen Wang
{"title":"A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources","authors":"A. Alwan, Yijian Bai, M. Black, Larry Casey, M. Gerosa, M. Heritage, Markus R Iseli, Barbara Jones, Ebrahim (Abe) Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan, P. Price, J. Tepperman, Shizhen Wang","doi":"10.1109/MMSP.2007.4412810","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412810","url":null,"abstract":"This paper describes the design and realization of an automatic system for assessing and evaluating the language and literacy skills of young children. This system was developed in the context of the TBALL (technology based assessment of language and literacy) project and aims at automatically assessing the English literacy skills of both native talkers of American English and Mexican-American children in grades K-2. The automatic assessments were carried out employing appropriate speech recognition and understanding techniques. In this paper, we describe the system focusing on the role of the multiple sources of information at our disposal. We present the content of the assessment system, discuss some issues in creating a child-friendly interface, and how to provide a suitable feedback to the teachers. In addition, we will discuss the different assessment modules and the different algorithms used for speech analysis.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127731492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书