2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP)最新文献_第3页

An integrated approach for efficient analysis of facial expressions 一种有效分析面部表情的综合方法

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005116702110219

M. Ghayoumi, A. Bansal

{"title":"An integrated approach for efficient analysis of facial expressions","authors":"M. Ghayoumi, A. Bansal","doi":"10.5220/0005116702110219","DOIUrl":"https://doi.org/10.5220/0005116702110219","url":null,"abstract":"This paper describes a new automated facial expression analysis system that integrates Locality Sensitive Hashing (LSH) with Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) to improve execution efficiency of emotion classification and continuous identification of unidentified facial expressions. Images are classified using feature-vectors on two most significant segments of face: eye segments and mouth-segment. LSH uses a family of hashing functions to map similar images in a set of collision-buckets. Taking a representative image from each cluster reduces the image space by pruning redundant similar images in the collision-buckets. The application of PCA and LDA reduces the dimension of the data-space. We describe the overall architecture and the implementation. The performance results show that the integration of LSH with PCA and LDA significantly improves computational efficiency, and improves the accuracy by reducing the frequency-bias of similar images during PCA and SVM stage. After the classification of image on database, we tag the collision-buckets with basic emotions, and apply LSH on new unidentified facial expressions to identify the emotions. This LSH based identification is suitable for fast continuous recognition of unidentified facial expressions.","PeriodicalId":438702,"journal":{"name":"2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126654433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Auditory features analysis for BIC-based audio segmentation 基于bic的音频分割听觉特征分析

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005063800480053

T. Maka

引用次数: 1

3D dual-tree discrete wavelet transform based multiple description video coding 基于三维双树离散小波变换的多描述视频编码

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.3233/JCM-160704

J. Chen, Jie Liao, Yuhang Yang, C. Cai

{"title":"3D dual-tree discrete wavelet transform based multiple description video coding","authors":"J. Chen, Jie Liao, Yuhang Yang, C. Cai","doi":"10.3233/JCM-160704","DOIUrl":"https://doi.org/10.3233/JCM-160704","url":null,"abstract":"A 3D dual-tree discrete wavelet transform (DT-DWT) based multiple description video coding algorithm is proposed to combat the transmitting error or packet loss due to Internet or wireless network channel failure. Each description of the proposed multiple description coding scheme consists of a base layer and an enhancement layer. First, the input image sequence is encoded by a standard H.264 encoder in low bit rate to form the base layer, which is then duplicated to each description. Second, the difference between the reconstructed base layer and the input image sequence is encoded by a 3D dual-tree wavelet encoder to produce four coefficient trees. After noise-shaping, these four trees are partitioned into two groups, individually forming enhancement layers of two descriptions. Since the 3D DT-DWT equips 28 directional subbands, the enhancement layer can be coded without motion estimation. The plenty of directional selectivity of DT-DWT solves the mismatch problem and improves the coding efficiency. If all descriptions are available in the receiver, a high quality video can be reconstructed by a central decoder. If only one description is received, a side decoder can be used to reconstruct the source with acceptable quality. Simulation results have shown that the quality of reconstructed video by the proposed algorithm is superior to that by the state-of-the-art multiple description video coding methods.","PeriodicalId":438702,"journal":{"name":"2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121278969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Automatic letter/pillarbox detection for optimized display of digital TV 自动检测字母/邮筒，优化数字电视显示

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005064202810288

L. Carreira, Tiago Rosa Maria Paula Queluz

引用次数: 0

Lip tracking using particle filter and geometric model for visual speech recognition 基于粒子滤波和几何模型的唇部跟踪视觉语音识别

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005045601720179

Islem Jarraya, S. Werda, W. Mahdi

引用次数: 3

Smoothed surface transitions for human motion synthesis 用于人体运动合成的平滑表面转换

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005122400730079

A. Doshi

引用次数: 0

Towards a wake-up and synchronization mechanism for Multiscreen applications using iBeacon 基于iBeacon的多屏应用的唤醒和同步机制

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005121800670072

Louay Bassbouss, Görkem Güçlü, S. Steglich

引用次数: 3

Methods and algorithms of cluster analysis in the mining industry: Solution of tasks for mineral rocks recognition 采矿业聚类分析方法与算法:矿物岩石识别任务的解决方案

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005022901650171

O. Baklanova, O. Shvets

引用次数: 5

Gender classification using M-estimator based radial basis function neural network 基于m估计量的径向基函数神经网络性别分类

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-28 DOI: 10.5220/0005117103020306

Chien-Cheng Lee

引用次数: 7

HMM-based breath and filled pauses elimination in ASR ASR中基于hmm的呼吸和充满停顿的消除

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) Pub Date : 2014-08-01 DOI: 10.5220/0005023002550260

Piotr Żelasko, T. Jadczyk, B. Ziółko

引用次数: 4