2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)最新文献_第4页

On the enhancement of dereverberation algorithms using multiple perceptual-evaluation criteria 基于多感知评价标准的去噪算法的改进

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813373

Rafael Zambrano-Lopez, T. Prego, A. Lima, S. L. Netto

引用次数: 0

Robust sound event classification by using denoising autoencoder 基于去噪自编码器的鲁棒声音事件分类

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813376

Jianchao Zhou, Liqun Peng, Xiaoou Chen, Deshun Yang

引用次数: 5

Perceptual video quality assessment: Spatiotemporal pooling strategies for different distortions and visual maps 感知视频质量评估:不同失真和视觉地图的时空池策略

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813336

Mohammed A. Aabed, G. Al-Regib

{"title":"Perceptual video quality assessment: Spatiotemporal pooling strategies for different distortions and visual maps","authors":"Mohammed A. Aabed, G. Al-Regib","doi":"10.1109/MMSP.2016.7813336","DOIUrl":"https://doi.org/10.1109/MMSP.2016.7813336","url":null,"abstract":"In this paper, we investigate the challenge of distortion map feature selection and spatiotemporal pooling in perceptual video quality assessment (PVQA). We analyze three distortion maps representing different visual features spatially and temporally: squared error, local pixel-level SSIM, and absolute difference of optical flow magnitudes. We examine the performance of each of these maps with different spatial and temporal pooling strategies across three databases. We identify the most effective statistical pooling strategies spatially and temporally with respect to PVQA. We also show the most significant spatial and temporal features correlated with perception for every distortion/feature map. Our results show that varying the pooling strategy and distortion maps yields a significant improvement in perceptual quality estimation. We also deduce insights from our results to better understand the sensitivity of human vision to distortions. We aim for these findings to provide perceptual cues and guidelines to researchers during metric design, perceptual feature selection, HVS modeling and pooling selection/optimization. We further show that the same distortions across databases can yield different results in terms of PVQA evaluation and verification.","PeriodicalId":113192,"journal":{"name":"2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130087249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Face video based touchless blood pressure and heart rate estimation 基于面部视频的非接触式血压和心率估计

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813389

Monika Jain, Sujay Deb, A. Subramanyam

{"title":"Face video based touchless blood pressure and heart rate estimation","authors":"Monika Jain, Sujay Deb, A. Subramanyam","doi":"10.1109/MMSP.2016.7813389","DOIUrl":"https://doi.org/10.1109/MMSP.2016.7813389","url":null,"abstract":"Hypertension (high blood pressure) is the leading cause for increasing number of premature deaths due to cardiovascular diseases. Continuous hypertension screening seems to be a promising approach in order to take appropriate steps to alleviate hypertension-related diseases. Many studies have shown that physiological signal like Photoplethysmogram (PPG) can be reliably used for predicting the Blood Pressure (BP) and Heart Rate (HR). However, the existing approaches use a transmission or reflective type wearable sensor to collect the PPG signal. These sensors are bulky and mostly require an assistance of a trained medical practitioner; which preclude these approaches from continuous BP monitoring outside the medical centers. In this paper, we propose a novel touchless approach that predicts BP and HR using the face video based PPG. Since the facial video can easily be captured using a consumer grade camera, this approach is a convenient way for continuous hypertension monitoring outside the medical centers. The approach is validated using the face video data collected in our lab, with the ground truth BP and HR measured using a clinically approved BP monitor OMRON HBP1300. Accuracy of the method is measured in terms of normalized mean square error, mean absolute error and error standard deviation; which complies with the standards mentioned by Association for the Advancement of Medical Instrumentation. Two-tailed dependent sample t-test is also conducted to verify that there is no statistically significant difference between the BP and HR predicted using the proposed approach and the BP and HR measured using OMRON.","PeriodicalId":113192,"journal":{"name":"2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116298706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 43

Audiovisual quality study for videoconferencing on IP networks IP网络视频会议的视听质量研究

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813379

Ines Saidi, Lu Zhang, Vincent Barriac, O. Déforges

引用次数: 3

A study of the perceptual relevance of the burst phase of stop consonants with implications in speech coding 顿音爆发相位的知觉相关性与语音编码的意义研究

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813374

Vincent Santini, P. Gournay, R. Lefebvre

引用次数: 0

A drift compensated reversible watermarking scheme for H.265/HEVC 一种H.265/HEVC的漂移补偿可逆水印方案

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813358

S. Gaj, Shuvendu Rana, A. Sur, P. Bora

引用次数: 7

Fast mode decision for HEVC intra coding with efficient mode skipping and improved RMD 基于高效模式跳转和改进RMD的HEVC编码快速模式决策

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813355

Xin Lu, Nan Xiao, Yue Hu, Zhilu Wu, G. Martin

引用次数: 8

A quantitative real time data analysis in vehicular speech environment with varying SNR 不同信噪比下车载语音环境的实时定量数据分析

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813375

Sai Prithvi Gadde, Sam Tabaja, Philip Olivier, N. Jaber, Mahdi Ali, R. Chabaan, Scott Bone

{"title":"A quantitative real time data analysis in vehicular speech environment with varying SNR","authors":"Sai Prithvi Gadde, Sam Tabaja, Philip Olivier, N. Jaber, Mahdi Ali, R. Chabaan, Scott Bone","doi":"10.1109/MMSP.2016.7813375","DOIUrl":"https://doi.org/10.1109/MMSP.2016.7813375","url":null,"abstract":"The purpose of this paper is to compare the performance of two common filters operating on noisy speech recorded in automobiles travelling at various speeds. The filters are based on Spectral Subtraction (SS) and Kalman Filtering (KF). The literature contains studies based on simulated data whereas this paper uses real time data collected in car's in search of an optimal solution. The comparisons were based on real recorded samples containing noisy speech signals with durations of approximately 2 minutes each. Different cases of noise levels which represent the most common situations experienced by drivers were created. The different settings used include varying car speeds (e.g., 40 mph, 70 mph), varying fan power, and window positions settings. The study was carried out using three different car models. The measured noisy voice signals were filtered using the different filtering techniques and the resulting filtered signals were compared in the time domain and the frequency domain, both quantitatively and psychometrically. Furthermore, the quantitative analysis approach was applied to the results for more accurate interpretation. Results show that SS outperforms KF in noise reduction, and with much less speech distortion at the different Signal to Noise Ratios (SNRs) tested. The audio test results subjected to human listening are comparable with the simulation results. Overall, SS showed superior performance over KF in vehicular hands-free speech applications.","PeriodicalId":113192,"journal":{"name":"2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131670397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Image coding using parametric texture synthesis 图像编码使用参数纹理合成

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2016-09-01 DOI: 10.1109/MMSP.2016.7813339

Uday Singh Thakur, Bappaditya Ray

引用次数: 4