2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)最新文献_第3页

Snakes assisted food image segmentation 蛇辅助食物图像分割

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343437

Y. He, N. Khanna, C. Boushey, E. Delp

引用次数: 10

Nonlinear additive model based saliency map weighting strategy for image quality assessment 基于非线性加性模型的显著性图加权图像质量评价策略

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343461

Ke Gu, Guangtao Zhai, Xiaokang Yang, Li Chen, Wenjun Zhang

{"title":"Nonlinear additive model based saliency map weighting strategy for image quality assessment","authors":"Ke Gu, Guangtao Zhai, Xiaokang Yang, Li Chen, Wenjun Zhang","doi":"10.1109/MMSP.2012.6343461","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343461","url":null,"abstract":"Most state-of-the-art image quality metrics are based on the two-step approach: local distortion/fidelity measurement and pooling. During the pooling stage, many weighting strategies have been proposed incorporating properties of the distortion itself, various masking effects and visual attention. Recently, researchers have devoted great enthusiasm and effort to the improvement of image quality assessment using visual saliency models. In this research, it is noticed that visual saliency features of both the original image and the distorted one have impacts on the process of image quality assessment. To reduce the overlapping effects, a nonlinear additive model is proposed to integrate saliency features from the original and distorted images towards improved error weighting results. Our extensive experimental studies on four publicly available image databases (LIVE, TID2008, CSIQ and A57) indicate that the proposed improved nonlinear additive model based saliency map weighting strategy constantly leads to higher prediction accuracy for image quality assessment than traditional methods.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"85 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134127837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25

A novel local audio fingerprinting algorithm 一种新的局部音频指纹识别算法

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343429

Mani Malekesmaeili, R. Ward

引用次数: 12

Regularized sequential selection and backtracking removal for CS atom matching CS原子匹配的正则化顺序选择和回溯去除

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343442

Chunyan Zeng, Lihong Ma, Ming-hui Du, Jing Tian

{"title":"Regularized sequential selection and backtracking removal for CS atom matching","authors":"Chunyan Zeng, Lihong Ma, Ming-hui Du, Jing Tian","doi":"10.1109/MMSP.2012.6343442","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343442","url":null,"abstract":"Atom selection is crucial to compressive sensing (CS) reconstruction by orthogonal matching pursuit (OMP), where the look-ahead (LA) OMP algorithm (LAOMP) evaluated final effects of all the LA atoms before they were included into a support set, certainly, a high computation burden has to be suffered. This paper modifies LAOMP method by two folds: 1) Regularization (R-LAOMP) is introduced to restrict the atom selection by similar small residuals, while mutual effects of new selected atoms are considered to alleviate the high computation costs. 2) Backtracking-based (LA-BOMP) atom pruning is employed to remove the most mismatching atoms in support sets to balance the accuracy and the random disturbance in optimization procedures. Accordingly this regularized forward atom evaluation combining backward atom deleting method (R-LA-BOMP) leads to a significant improvement in LAOMP, while a trade-off between performance and complexity is achieved. Experiments of the regularized atom selection and the backtracking pruning algorithms are performed on Gaussian sparse signals, 0-1 sparse signals and speech voices and the results are given.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133436449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Analysis of mesh-based motion compensation in wavelet lifting of dynamical 3-D+t CT data 三维CT动态数据小波提升中基于网格的运动补偿分析

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343432

Wolfgang Schnurrer, T. Richter, Jürgen Seiler, André Kaup

引用次数: 6

Quality-optimized encoding of JPEG images using transform domain sparsification 使用变换域稀疏的JPEG图像的质量优化编码

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343453

Junichi Ishida, Gene Cheung, Akira Kubota, Antonio Ortega

{"title":"Quality-optimized encoding of JPEG images using transform domain sparsification","authors":"Junichi Ishida, Gene Cheung, Akira Kubota, Antonio Ortega","doi":"10.1109/MMSP.2012.6343453","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343453","url":null,"abstract":"To account for the unique characteristics and limitations of the human visual system (HVS) when perceiving images, a variety of perceptual quality metrics have been proposed in the literature. Tailoring rate-distortion (RD) optimization for each metric is cumbersome and time-consuming. In this paper, we propose a general RD-optimization strategy called “transform domain bounding box” (BB) that can easily adapt to different quality metrics for JPEG-like block-based encoding of images. First, we define an objective function that is a weighted sum of the l0-norm of the transform coefficients (a proxy for rate) and distortion from the transform domain representation. Next, for a given distortion target τ, we define a don't care region (DCR) that specifies a search region of representations with distortion ≤τ. We then show that the sparsest transform domain representation (lowest encoding rate) inside a BB that tightly contains the DCR can be constructed efficiently. Varying τ to induce different DCRs and corresponding BBs results in a set of constructed sparse representations of different sparsity counts, and the one that optimally trades off rate and distortion can be easily identified as solution to our objective. We show that our proposed BB strategy can be easily re-targeted for three common quality metrics: MSE, MSE-HVS-M and SSIM. Experimental results show that our BB strategy outperformed unoptimized JPEG compression by up to 1dB in PSNR when distortion metric is MSE, up to 2dB when metric is MSE-HVS-M, and up to 0.005 when metric is SSIM.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122373951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Enhancing recommended video lists for Youtube-like social media 增强youtube类社交媒体的推荐视频列表

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343448

Xiaoqiang Ma, Haiyang Wang, Haitao Li, Jiangchuan Liu, Hongbo Jiang

{"title":"Enhancing recommended video lists for Youtube-like social media","authors":"Xiaoqiang Ma, Haiyang Wang, Haitao Li, Jiangchuan Liu, Hongbo Jiang","doi":"10.1109/MMSP.2012.6343448","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343448","url":null,"abstract":"Youtube-like video sharing sites (VSSes) have gained increasing popularity in recent years. Meanwhile, Facebook-like online social networks (OSNs), have seen their tremendous success in connecting people of common interests. These two new generation of networked services are now bridged in that many users of OSNs share video contents originating from VSSes with their friends, and it has been shown that a significant portion of views of VSSes are attributed to this sharing scheme of social networks. To understand how the video sharing behavior, which is largely based on social relationship, impacts users' viewing pattern, we have conducted a long-term measurement with RenRen and YouKu, the largest online social network and the largest video sharing site in China, respectively. We show that social friends are more likely to have common interests and their sharing behaviors provide guidance to enhance recommended video lists. In this paper, we take a first step toward learning OSN video sharing patterns for VSS video recommendation. An auto-encoder model is developed to learn the social similarity of different videos in terms of their sharing in OSN. We therefore propose a similarity-based strategy to enhance recommended video lists for VSSes. Evaluation results demonstrate that this strategy can remarkably improve the precision in VSSes, as compared to state-of-the-art strategies without social information.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125098969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Efficient binary representation of delta Quantization Parameter for High Efficiency Video Coding 高效视频编码中delta量化参数的有效二进制表示

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343414

K. Chono

引用次数: 0

Improved endoscope distortion correction does not necessarily enhance mucosa-classification based medical decision support systems 改进的内窥镜畸变校正并不一定会增强基于粘膜分类的医疗决策支持系统

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343433

Michael Gschwandtner, Jutta Hämmerle-Uhl, Y. Höller, M. Liedlgruber, A. Uhl, A. Vécsei

引用次数: 9

Scalable depth maps with R-D optimized embedding 可扩展深度地图与R-D优化嵌入

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343452

R. Mathew, D. Taubman, P. Zanuttigh

引用次数: 2