2010 IEEE International Workshop on Multimedia Signal Processing最新文献_第2页

A weighted approach of missing data technique in cepstra domain based on S-function 一种基于s函数的倒频谱域缺失数据加权方法

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5661987

Pei Yi, Yubo Ge

引用次数: 3

Improving multiple-F0 estimation by onset detection for polyphonic music transcription 利用起音检测改进复调音乐转录的多重f0估计

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5661985

F. Canadas-Quesada, F. J. Rodríguez-Serrano, P. Vera-Candeas, N. Ruiz-Reyes, J. Carabias-Orti

引用次数: 3

Object tracking under illumination variations using 2D-cepstrum characteristics of the target 利用目标的二维倒谱特征对光照变化下的目标进行跟踪

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662076

Fuat Çogun, A. Cetin

引用次数: 8

H.264-based multiple description coding using motion compensated temporal interpolation 基于h .264的多描述编码，采用运动补偿时间插值

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662026

C. Greco, Marco Cagnazzo, B. Pesquet-Popescu

引用次数: 11

Human emotion recognition using real 3D visual features from Gabor library 使用Gabor库中的真实3D视觉特征进行人类情感识别

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662073

Tie Yun, L. Guan

{"title":"Human emotion recognition using real 3D visual features from Gabor library","authors":"Tie Yun, L. Guan","doi":"10.1109/MMSP.2010.5662073","DOIUrl":"https://doi.org/10.1109/MMSP.2010.5662073","url":null,"abstract":"Emotional state recognition is an important component for efficient human-computer interaction. Most existing works address this problem using 2D features, but they are sensitive to head pose, clutter, and variations in lighting conditions. The general 3D based methods only consider geometric information for feature extraction. In this paper, we present a real 3D visual features based method for human emotion recognition. 3D geometric information plus colour/density information of the facial expressions are extracted by 3D Gabor library to construct visual feature vectors. The filter's scale, orientation, and shape of the library are specified according to the appearance patterns of the 3D facial expressions. An improved kernel canonical correlation analysis (IKCCA) algorithm is proposed for final decision. From training samples, the semantic ratings that describe the different facial expressions are computed by IKCCA to generate a seven dimensional semantic expression vector. It is applied for learning the correlation with different testing samples. According to this correlation, we estimate the associated expression vector and perform expression classification. From experiment results, our proposed method demonstrates impressive performance.","PeriodicalId":105774,"journal":{"name":"2010 IEEE International Workshop on Multimedia Signal Processing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122068802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Spatial intra-prediction based on mixtures of sparse representations 基于稀疏表示混合的空间内预测

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662044

Angélique Dremeau, Mehmet Türkan, C. Herzet, C. Guillemot, J. Fuchs

引用次数: 4

Side information refinement for long duration GOPs in DVC DVC中长时间GOPs的边信息细化

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662038

G. Petrazzuoli, Thomas Maugey, Marco Cagnazzo, B. Pesquet-Popescu

{"title":"Side information refinement for long duration GOPs in DVC","authors":"G. Petrazzuoli, Thomas Maugey, Marco Cagnazzo, B. Pesquet-Popescu","doi":"10.1109/MMSP.2010.5662038","DOIUrl":"https://doi.org/10.1109/MMSP.2010.5662038","url":null,"abstract":"Side information generation is a critical step in distributed video coding systems. This is performed by using motion compensated temporal interpolation between two or more key frames (KFs). However, when the temporal distance between key frames increases (i.e. when the GOP size becomes large), the linear interpolation becomes less effective. In a previous work we showed that this problem can be mitigated by using high order interpolation. Now, in the case of long duration GOP, state-of-the-art algorithms propose a hierarchical algorithm for side information generation. By using this procedure, the quality of the central interpolated image in a GOP is consistently worse than images closer to the KFs. In this paper we propose a refinement of the central WZFs by higher order interpolation of the already decoded WZFs, that are closer to the WZF to be estimated. So we reduce the fluctuation of side information quality, with a beneficial impact on final rate-distortion characteristics of the system. The experimental results show an improvement on the SI up to 2.71 dB with respect the state-of-the-art and a global improvement of the PSNR on the decoded frames up to 0.71 dB and a bit rate reduction up to 15 %.","PeriodicalId":105774,"journal":{"name":"2010 IEEE International Workshop on Multimedia Signal Processing","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123137701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Fitting pinna-related transfer functions to anthropometry for binaural sound rendering 拟合与峰峰相关的传递函数到人体测量中用于双耳声音渲染

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662018

Simone Spagnol, M. Geronazzo, F. Avanzini

引用次数: 30

Joint source-channel coding/decoding of 3D-ESCOT bitstreams 3D-ESCOT位流的联合源信道编码/解码

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662034

M. Abid, M. Kieffer, B. Pesquet-Popescu

引用次数: 2

Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme 基于新下混方案的ITU-T G.722参数立体声扩展

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662017

Thi Minh Nguyet Hoang, S. Ragot, Balázs Kövesi, P. Scalart

引用次数: 4