2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)最新文献_第2页

Quality enhancement of procam system by radiometric compensation 利用辐射补偿提高节目系统质量

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343439

Tai-Hsiang Huang, Chen-Tai Kao, Homer H. Chen

引用次数: 9

Consistent spatio-temporal filling of disocclusions in the multiview-video-plus-depth format 多视点视频加深度格式中咬合的一致时空填充

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343410

Martin Köppel, Xi Wang, D. Doshkov, T. Wiegand, P. Ndjiki-Nya

引用次数: 20

Affect recognition using EEG signal 利用脑电信号影响识别

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343458

Haiyan Xu, K. Plataniotis

{"title":"Affect recognition using EEG signal","authors":"Haiyan Xu, K. Plataniotis","doi":"10.1109/MMSP.2012.6343458","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343458","url":null,"abstract":"Emotion states greatly influence many areas in our daily lives, such as: learning, decision making and interaction with others. Therefore, the ability to detect and recognize one's emotional states is essential in intelligence Human Machine Interaction (HMI). The aim of this study was to develop a new system that can sense and communicate emotion changes expressed by the Central Nervous System (CNS) through the use of EEG signals. More specifically, this study was carried out to develop an EEG-based subject-dependent affect recognition system to quantitatively measure and categorize three affect states: Positively excited, neutral and negatively excited. In this paper, we discussed implementation issues associated with each key stage of a fully automated affect recognition system: emotion elicitation protocol, feature extraction and classification. EEG recordings from 5 subjects with IAPS images as stimuli from the eNTERFACE06 database were used for simulation purposes. Discriminating features were extracted in both time and frequency domains (statistical, narrow-band, HOC, and wavelet entropy) to better understand the oscillatory nature of the brain waves. Through the use of k Nearest Neighbor classifier (kNN), we obtained mean correct classification rates of 90.77% on the three emotion classes when K equals 5. This demonstrated the feasibility of brain waves as a mean to categorize a user's emotion state. Secondly, we also assessed the suitability of commercially available EEG headsets such as Emotive Epoc for emotion recognition applications. This study was carried out by comparing the sensor location, signal integrity with those of Biosemi Active II. A new set of recognition performance was presented with reduced number of channels.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121253717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 44

Features for comparing tune similarity of songs across different languages 比较不同语言歌曲曲调相似度的功能

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343464

Naveen Kumar, A. Tsiartas, Shrikanth S. Narayanan

引用次数: 2

Sharing the trees among random forests for effective and efficient concept detection 在随机森林中共享树以实现有效和高效的概念检测

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343445

Tzu-Hsuan Chiu, Guan-Long Wu, Yu-Chuan Su, Winston H. Hsu

引用次数: 3

Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams 将ASR输出集成到广播流的说话人分割和聚类任务中

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343426

J. Silovský, J. Zdánský, J. Nouza, P. Cerva, J. Prazak

{"title":"Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams","authors":"J. Silovský, J. Zdánský, J. Nouza, P. Cerva, J. Prazak","doi":"10.1109/MMSP.2012.6343426","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343426","url":null,"abstract":"In this paper we study the effect of incorporation of automatic transcriptions in the speaker diarization process. We aim to improve both the diarization accuracy as evaluated by standard objective measures and quality of the diarization output from user's perspective. Although the presented approach relies on output of an automatic speech recognizer, it makes no use of lexical information. Instead, we use information about word boundaries and classification of non-speech events occurring in the processed stream. The former information is used as constraining condition for speaker change-point candidates and the latter facilitate to neglect various vocal noise sounds that carry no speaker-specific information (considering representation of the signal by cepstral features) and thus harm the speaker's representation. The experimental evaluation of the presented approach was carried out using the COST278 multilingual broadcast news database. We demonstrate that the approach yields improvement in terms of both speaker diarization and segmentation performance measures. Furthermore, we show that the number of change-points detected within words (and not at their boundaries) is significantly reduced.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132141863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

ConCor+: Robust and confident video synchronization using consensus-based Cross-correlation ConCor+:使用基于共识的相互关联进行稳健和自信的视频同步

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343420

Anas Al-Nuaimi, Burak Cizmeci, F. Schweiger, Roman Katz, S. Taifour, E. Steinbach, M. Fahrmair

引用次数: 2

Low bitrate coding schemes for local image descriptors 局部图像描述符的低比特率编码方案

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343427

A. Redondi, M. Cesana, M. Tagliasacchi

{"title":"Low bitrate coding schemes for local image descriptors","authors":"A. Redondi, M. Cesana, M. Tagliasacchi","doi":"10.1109/MMSP.2012.6343427","DOIUrl":"https://doi.org/10.1109/MMSP.2012.6343427","url":null,"abstract":"Efficient coding of local image descriptors is of paramount importance when they need to be transmitted to a remote destination on bandwidth constrained networks. This is a case that arises, e.g., in mobile visual search and visual wireless sensor networks. In this work we consider SURF, a popular descriptor suitable for low-complexity devices, and we provide a comparative study of lossy coding schemes operating at low bitrate (e.g., less than 128 bits / descriptor). Our investigation covers schemes that address both intra- and inter-descriptor redundancy, including methods that have not been tested before in this context, e.g., sparse coding, lifting-based coding on trees, and hybrid intra and inter-descriptor coding. The experimental evaluation is carried out on two publicly available datasets, in terms of both rate-distortion and rate-accuracy, for the specific task of object recognition. Our results show that a rate saving of 15-30% can be achieved by exploiting intra-descriptor redundancy. On the other side, addressing inter-descriptor redundancy does not lead to substantial gains when applied alone, whereas it leads to marginal gains (up to 3%) when used in hybrid schemes jointly with intra-descriptor coding.","PeriodicalId":325274,"journal":{"name":"2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131668947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Extracting social values and group identities from social media text data 从社交媒体文本数据中提取社会价值和群体身份

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343446

David A. Broniatowski

引用次数: 3

Blob detection and filtering for character segmentation of license plates 车牌字符分割中的斑点检测与滤波

2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2012-11-12 DOI: 10.1109/MMSP.2012.6343467

Youngwoo Yoon, Kyu-Dae Ban, H. Yoon, Jaehong Kim

引用次数: 18