2010 IEEE International Workshop on Multimedia Signal Processing最新文献

Bilateral depth-discontinuity filter for novel view synthesis 用于新型视图合成的双侧深度不连续滤波器

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662009

Ismaël Daribo, H. Saito

引用次数: 13

Rate-distortion optimized low-delay 3D video communications 率失真优化低延迟3D视频通信

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5661998

E. Masala

引用次数: 2

Challenging the security of Content-Based Image Retrieval systems 挑战基于内容的图像检索系统的安全性

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5661993

Thanh-Toan Do, Ewa Kijak, T. Furon, L. Amsaleg

引用次数: 14

Robust background subtraction method based on 3D model projections with likelihood 基于似然的三维模型投影鲁棒背景相减方法

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662014

Hiroshi Sankoh, A. Ishikawa, S. Naito, S. Sakazawa

{"title":"Robust background subtraction method based on 3D model projections with likelihood","authors":"Hiroshi Sankoh, A. Ishikawa, S. Naito, S. Sakazawa","doi":"10.1109/MMSP.2010.5662014","DOIUrl":"https://doi.org/10.1109/MMSP.2010.5662014","url":null,"abstract":"We propose a robust background subtraction method for multi-view images, which is essential for realizing free viewpoint video where an accurate 3D model is required. Most of the conventional methods determine background using only visual information from a single camera image, and the precise silhouette cannot be obtained. Our method employs an approach of integrating multi-view images taken by multiple cameras, in which the background region is determined using a 3D model generated by multi-view images. We apply the likelihood of background to each pixel of camera images, and derive an integrated likelihood for each voxel in a 3D model. Then, the background region is determined based on the minimization of energy functions of the voxel likelihood. Furthermore, the proposed method also applies a robust refining process, where a foreground region obtained by a projection of a 3D model is improved according to geometric information as well as visual information. A 3D model is finally reconstructed using the improved foreground silhouettes. Experimental results show the effectiveness of the proposed method compared with conventional works.","PeriodicalId":105774,"journal":{"name":"2010 IEEE International Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128415250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Fast environment extraction for lighting and occlusion of virtual objects in real scenes 真实场景中虚拟物体光照和遮挡的快速环境提取

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662007

François Fouquet, Jean-Philippe Farrugia, Brice Michoud, S. Brandel

引用次数: 1

Gaussian mixture vector quantization-based video summarization using independent component analysis 基于独立分量分析的高斯混合矢量量化视频摘要

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662062

Junfeng Jiang, Xiao-Ping Zhang

引用次数: 7

A resilient and low-delay P2P streaming system based on network coding with random multicast trees 基于随机组播树网络编码的弹性低延迟P2P流系统

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662054

Marco Toldo, E. Magli

引用次数: 15

Time-space acoustical feature for fast video copy detection 用于快速视频复制检测的时空声学特征

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662070

Y. Itoh, Masahiro Erokuumae, K. Kojima, M. Ishigame, Kazuyo Tanaka

{"title":"Time-space acoustical feature for fast video copy detection","authors":"Y. Itoh, Masahiro Erokuumae, K. Kojima, M. Ishigame, Kazuyo Tanaka","doi":"10.1109/MMSP.2010.5662070","DOIUrl":"https://doi.org/10.1109/MMSP.2010.5662070","url":null,"abstract":"We propose a new time-space acoustical feature for fast video copy detection to search a video segment for a number of video streams to find illegal video copies on Internet video site and so on. We extract a small number of feature vectors from acoustically peculiar points that express the point of local maximum/minimum in the time sequence of acoustical power envelopes in video data. The relative values of the feature points are extracted, so called time-space acoustical feature, because the volume in the video stream differs in different recording environments. The features can be obtained quickly compared with representative features such as MFCC, and they require a short processing time for matching because the number and the dimension of each feature vector are both small. The accuracy and the computation time of the proposed method is evaluated using recorded TV movie programs for input data, and a 30 sec. −3 min. segment in DVD for reference data, assuming a copyright holder of a movie searches the illegal copies for video streams. We could confirm that the proposed method completed all processes within the computation time of the former feature extraction with 93.2% of F-measure in 3 minutes video segment detection.","PeriodicalId":105774,"journal":{"name":"2010 IEEE International Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114642681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Measuring errors for massive triangle meshes 大规模三角形网格的测量误差

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662050

Anis Meftah, Arnaud Roquel, F. Payan, M. Antonini

引用次数: 2

Video super-resolution for dual-mode digital cameras via scene-matched learning 通过场景匹配学习实现双模数码相机的视频超分辨率

2010 IEEE International Workshop on Multimedia Signal Processing Pub Date : 2010-12-10 DOI: 10.1109/MMSP.2010.5662061

Guangtao Zhai, Xiaolin Wu

{"title":"Video super-resolution for dual-mode digital cameras via scene-matched learning","authors":"Guangtao Zhai, Xiaolin Wu","doi":"10.1109/MMSP.2010.5662061","DOIUrl":"https://doi.org/10.1109/MMSP.2010.5662061","url":null,"abstract":"Many consumer digital cameras support dual shooting mode of both low-resolution (LR) video and high-resolution (HR) image. By periodically switching between the video and image modes, this type of cameras make it possible to super-resolve the LR video with the assistance of neighboring HR still images. We propose a model-based video super-resolution (VSR) technique for the above dual-mode cameras. A HR video frame is modeled as a 2D piecewise autoregressive (PAR) process. The PAR model parameters are learnt from the HR still images inserted between LR video frames. By registering the LR video frames and the HR still images, we base the learning on sample statistics that matches the scene to be constructed. The resulting PAR model is more accurate and robust than if the model parameters are estimated from the LR video frames without referring to the HR images or from a training set. Aided by the powerful scene-matched model the LR video frame is upsampled to the resolution of the HR image via adaptive interpolation. As such, the proposed VSR technique does not require explicit motion estimation of subpixel precision nor the solution of a large-scale inverse problem. The new VSR technique is competitive in visual quality against existing techniques with a fraction of the computational cost.","PeriodicalId":105774,"journal":{"name":"2010 IEEE International Workshop on Multimedia Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126209511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3