2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)最新文献

Social stances by virtual smiles 虚拟微笑的社会地位

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616144

M. Ochs, C. Pelachaud, K. Prepin

引用次数: 1

A nested infinite Gaussian mixture model for identifying known and unknown audio events 用于识别已知和未知音频事件的嵌套无限高斯混合模型

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616152

Y. Sasaki, Kazuyoshi Yoshii, S. Kagami

引用次数: 3

On coding and resampling of video in 4:2:2 chroma format for cascaded coding applications 级联编码应用中4:2:2色度格式视频的编码和重采样

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616153

Andrea Gabriellini, M. Mrak

引用次数: 1

Event-driven retrieval in collaborative photo collections 协作照片集中的事件驱动检索

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616121

M. Brenner, E. Izquierdo

引用次数: 5

Introducing motion information in dense feature classifiers 在密集特征分类器中引入运动信息

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616132

Claudiu Tanase, B. Mérialdo

{"title":"Introducing motion information in dense feature classifiers","authors":"Claudiu Tanase, B. Mérialdo","doi":"10.1109/WIAMIS.2013.6616132","DOIUrl":"https://doi.org/10.1109/WIAMIS.2013.6616132","url":null,"abstract":"Semantic concept detection in large scale video collections is mostly achieved through a static analysis of selected keyframes. A popular choice for representing the visual content of an image is based on the pooling of local descriptors such as Dense SIFT. However, simple motion features such as optic flow can be extracted relatively easy from such keyframes. In this paper we propose an efficient addition to the DSIFT approach by including information derived from optic flow. Based on optic flow magnitude, we can estimate for each DSIFT patch whether it is static or moving. We modify the bag of words model used traditionally with DSIFT by creating two separate occurrence histograms instead of one: one for static patches and one for dynamic patches. We further refine this method by studying different separation thresholds and soft assign-ment, as well as different normalization techniques. Classifier score fusion is used to maximize the average precision of all these variants. Experimental results on the TRECVID Semantic Indexing collection show that by means of classifier fusion our method increases overall mean average precision of the DSIFT classifier from 0.061 to 0.106.","PeriodicalId":408077,"journal":{"name":"2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)","volume":"62 11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131027801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Affine invariant salient patch descriptors for image retrieval 用于图像检索的仿射不变显著补丁描述子

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616136

F. Isikdogan, A. A. Salah

引用次数: 1

Tapped delay multiclass support vector machines for industrial workflow recognition 工业工作流程识别的抽头延迟多类支持向量机

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616141

Eftychios E. Protopapadakis, A. Doulamis, N. Doulamis

引用次数: 7

An application framework for implicit sentiment human-centered tagging using attributed affect 一种使用归因情感的隐式情感以人为中心标注的应用框架

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616145

K. C. Apostolakis, P. Daras

引用次数: 0

Footstep detection and classification using distributed microphones 基于分布式麦克风的脚步声检测与分类

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616127

K. Nakadai, Yuta Fujii, S. Sugano

{"title":"Footstep detection and classification using distributed microphones","authors":"K. Nakadai, Yuta Fujii, S. Sugano","doi":"10.1109/WIAMIS.2013.6616127","DOIUrl":"https://doi.org/10.1109/WIAMIS.2013.6616127","url":null,"abstract":"This paper addresses footstep detection and classification with multiple microphones distributed on the floor. We propose to introduce geometrical features such as position and velocity of a sound source for classification which is estimated by amplitude-based localization. It does not require precise inter-microphone time synchronization unlike a conventional microphone array technique. To classify various types of sound events, we introduce four types of features, i.e., time-domain, spectral and Cepstral features in addition to the geometrical features. We constructed a prototype system for footstep detection and classification based on the proposed ideas with eight microphones aligned in a 2-by-4 grid manner. Preliminary classification experiments showed that classification accuracy for four types of sound sources such as a walking footstep, running footstep, handclap, and utterance maintains over 70% even when the signal-to-noise ratio is low, like 0 dB. We also confirmed two advantages with the proposed footstep detection and classification. One is that the proposed features can be applied to classification of other sound sources besides footsteps. The other is that the use of a multichannel approach further improves noise-robustness by selecting the best microphone among the microphones, and providing geometrical information on a sound source.","PeriodicalId":408077,"journal":{"name":"2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132249181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

A heuristic for distance fusion in cover song identification 一种启发式距离融合的翻唱歌曲识别方法

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) Pub Date : 2013-07-03 DOI: 10.1109/WIAMIS.2013.6616128

Alessio Degani, M. Dalai, R. Leonardi, P. Migliorati

引用次数: 14