2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)最新文献

Visual conditioning for augmented-reality-assisted video conferencing 增强现实辅助视频会议的视觉调节

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-12-31 DOI: 10.1109/MMSP.2012.6343418

O. Guleryuz, T. Kalker

引用次数: 1

A P300-based BCI classification algorithm using median filtering and Bayesian feature extraction 一种基于p300的中值滤波和贝叶斯特征提取的BCI分类算法

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343459

Xiaoou Li, Feng Wang, Xu Chen, R. Ward

引用次数: 3

A Lagrangian framework for video analytics 视频分析的拉格朗日框架

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343474

A. Kuhn, T. Senst, I. Keller, T. Sikora, H. Theisel

{"title":"A Lagrangian framework for video analytics","authors":"A. Kuhn, T. Senst, I. Keller, T. Sikora, H. Theisel","doi":"10.1109/MMSP.2012.6343474","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343474","url":null,"abstract":"The extraction of motion patterns from image sequences based on the optical flow methodology is an important and timely topic among visual multi media applications. In this work we will present a novel framework that combines the optical flow methodology from image processing with methods developed for the Lagrangian analysis of time-dependent vector fields. The Lagrangian approach has been proven to be a valuable and powerful tool to capture the complex dynamic motion behavior within unsteady vector fields. To come up with a compact and applicable framework, this paper will provide concepts on how to compute trajectory-based Lagrangian measures in series of optical flow fields, a set of basic measures to capture the essence of the motion behavior within the image, and a compact hierarchical, feature-based description of the resulting motion features. The resulting framework will bee shown to be suitable for an automated image analysis as well as compact visual analysis of image sequences in its spatio-temporal context. We show its applicability for the task of motion feature description and extraction on different temporal scales, crowd motion analysis, and automated detection of abnormal events within video sequences.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125159090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

An optimal client buffer model for multiplexing HTTP streams 用于多路复用HTTP流的最佳客户端缓冲模型

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343455

Saayan Mitra, Viswanathan Swaminathan

{"title":"An optimal client buffer model for multiplexing HTTP streams","authors":"Saayan Mitra, Viswanathan Swaminathan","doi":"10.1109/MMSP.2012.6343455","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343455","url":null,"abstract":"The basic tenet of HTTP streaming is to deliver fragments of video and audio that are individually addressable chunks of content over HTTP. Some media players consume incoming video and audio data only in a time ordered multiplexed format. If alternate tracks need to be added post packaging of the media, it has to be repackaged that involves duplication resulting in multiple multiplexed files. Additionally for adaptive streaming, a set of all those files need to be added for each bitrate. Alternatively, it is more efficient to store component tracks separately, fetching only the required tracks and multiplexing audio and video in the client before sending the data to the decoder. To deliver an optimal viewing experience, the client has to take care of the seemingly conflicting constraints viz., handling the network jitter, minimizing the time to switch to an alternate track and minimizing the live latency. For instance, to absorb more network jitter more data should be available in the buffers but this would increase the switching latency. We introduce a formal buffer model for a client that gathers video and audio fragments and multiplexes them on the fly. This model uses separate video and audio buffers, a multiplexed buffer in the application, and decoding buffer associated with the decoder. We model the buffer sizes, their thresholds to request data from the network, and the rate of transfer of data between buffers. We show that these buffers can be designed varying these parameters to optimize for the above constraints. This buffer model can also be leveraged for deciding when to switch in adaptive bitrate streaming. We further validate these by experimental results from our implementation.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114279101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Human action recognition using Lagrangian descriptors 使用拉格朗日描述符的人类动作识别

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343469

Esra Acar, T. Senst, A. Kuhn, I. Keller, H. Theisel, S. Albayrak, T. Sikora

引用次数: 17

Interactive mobile visual search for social activities completion using query image contextual model 交互式移动视觉搜索的社交活动完成使用查询图像上下文模型

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343447

Ning Zhang, Tao Mei, Xiansheng Hua, L. Guan, Shipeng Li

{"title":"Interactive mobile visual search for social activities completion using query image contextual model","authors":"Ning Zhang, Tao Mei, Xiansheng Hua, L. Guan, Shipeng Li","doi":"10.1109/MMSP.2012.6343447","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343447","url":null,"abstract":"Mobile devices are ubiquitous. People use their phones as a personal concierge not only discovering information but also searching for particular interest on-the-go and making decisions. This brings a new horizon for multimedia retrieval on mobile. While existing efforts have predominantly focused on understanding textual or a voice query, this paper presents a new perspective which understands visual queries captured by the built-in camera such that mobile-based social activities can be recommended for users to complete. In this work, a query image-based contextual model is proposed for visual search. A mobile user can take a photo and naturally indicate an object-of-interest within the photo via circle based gesture called “O” gesture. Both selected object-of-interest region as well as surrounding visual context in photo are used in achieving a search-based recognition by retrieving similar images based on a large-scale of visual vocabulary tree. Consequently, social activities such as visiting contextually relevant entities (i.e., local businesses) are recommended to the users based on their visual queries and GPS location. Along with the proposed method, an exemplary real application has been developed on Windows Phone 7 devices and evaluated with a wide variety of scenarios on million-scale image database. To test the performance of proposed mobile visual search model, extensive experimentation has been conducted and compared with state-of-the-art algorithms in content-based image retrieval (CBIR) domain.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116879288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Video jitter analysis for automatic bootleg detection 视频抖动分析自动盗版检测

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343423

M. V. Scarzanella, P. Dragotti

引用次数: 27

Improved seam carving for semantic video cod 改进了语义视频编码的接缝雕刻

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343415

M. Decombas, F. Dufaux, E. Renan, B. Pesquet-Popescu, F. Capman

引用次数: 11

Learning based screen image compression 基于学习的屏幕图像压缩

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343419

Huan Yang, Weisi Lin, Chenwei Deng

引用次数: 13

Block-based compressed sampling with non-linear coding for image transmission 基于块的非线性编码压缩采样图像传输

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343416

B. Liu, Wei Qiao, Zixiang Xiong, G. Arce, J. Garcia-Frías

引用次数: 0