2007 IEEE 9th Workshop on Multimedia Signal Processing最新文献_第6页

On Design of Linear Minimum-Entropy Predictor 线性最小熵预测器的设计

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412852

X. Wang, Xiaolin Wu

引用次数: 3

Wavelet-Based Multi-View Video Coding with Spatial Scalability 基于小波的空间可扩展性多视点视频编码

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412906

Jens-Uwe Garbas, A. Kaup

引用次数: 7

Cross-Layer Adaptive ARQ for Uplink Video Streaming in Tandem Wireless/Wireline Networks 无线/有线串联网络中上行视频流的跨层自适应ARQ

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412840

A. Argyriou

引用次数: 4

Symmetric Distributed Arithmetic Coding of Correlated Sources 相关源对称分布式算法编码

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412830

Marco Grangetto, E. Magli, G. Olmo

引用次数: 15

Multiscale Integral Invariants For Facial Landmark Detection in 2.5D Data 基于多尺度积分不变量的2.5D人脸标记检测

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412846

Adam Slater, Y. Hu, N. Boston

引用次数: 1

Content-based Video Signatures based on Projections of Difference Images 基于差分图像投影的基于内容的视频签名

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412886

R. Radhakrishnan, C. Bauer

引用次数: 35

Analysis of multimodal binary detection systems based on dependent/independent modalities 基于依赖/独立模态的多模态二元检测系统分析

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412820

O. Koval, S. Voloshynovskiy, T. Pun

引用次数: 11

Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs 结合声源和MFCC功能，使用GMMs增强说话人识别性能

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412892

Danoush Hosseinzadeh, S. Krishnan

{"title":"Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs","authors":"Danoush Hosseinzadeh, S. Krishnan","doi":"10.1109/MMSP.2007.4412892","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412892","url":null,"abstract":"This work presents seven novel spectral features for speaker recognition. These features are the spectral centroid (SC), spectral bandwidth (SBW), spectral band energy (SBE), spectral crest factor (SCF), spectral flatness measure (SFM), Shannon entropy (SE) and Renyi entropy (RE). The proposed spectral features can quantify some of the characteristics of the vocal source or the excitation component of speech. This is useful for speaker recognition since vocal source information is known to be complementary to the vocal tract transfer function, which is usually obtained using the Mel frequency cepstral coefficients (MFCC) or linear predication cepstral coefficients (LPCC). To evaluate the performance of the spectral features, experiments were performed using a text-independent cohort Gaussian mixture model (GMM) speaker identification system. Based on 623 users from the TIMIT database, the spectral features achieved an identification accuracy of 99.33% when combined with the MFCC based features and when using undistorted speech. This represents a 4.03% improvement over the baseline system trained with only MFCC and DeltaMFCC features.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132725997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 62

Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate 基于音频活动率的全长音乐自动类型分类实验

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412827

Shiva Sundaram, Shrikanth S. Narayanan

引用次数: 9

A Multimodal Image Registration and Fusion Methodology Applied to Drug Discovery Research 一种多模态图像配准与融合方法在药物发现研究中的应用

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412883

S. Makrogiannis, J. Wellen, Y. Wu, L. Bloy, S. Sarkar

引用次数: 4