2007 IEEE 9th Workshop on Multimedia Signal Processing最新文献

Joint Analysis of the Emotional Fingerprint in the Face and Speech: A single subject study 面部和言语情感指纹的联合分析:单受试者研究

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412814

C. Busso, Shrikanth S. Narayanan

引用次数: 21

Systematic comparison of BIC-based speaker segmentation systems 基于bic的说话人分割系统的系统比较

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412819

V. Moschou, M. Kotti, Emmanouil Benetos, Constantine Kotropoulos

引用次数: 7

Unequal Growth Codes: Intermediate Performance and Unequal Error Protection for Video Streaming 不等增长码:视频流的中间性能和不等错误保护

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412829

A. Dimakis, Jiajun Wang, K. Ramchandran

引用次数: 43

Dual-Mode Wideband Speech Compression 双模宽带语音压缩

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412816

Visar Berisha, A. Spanias

引用次数: 0

Grid-based Template Matching for People Counting 基于网格的人口计数模板匹配

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412881

J. Hsieh, Cheng-Shuang Peng, Kuo-Chin Fan

{"title":"Grid-based Template Matching for People Counting","authors":"J. Hsieh, Cheng-Shuang Peng, Kuo-Chin Fan","doi":"10.1109/MMSP.2007.4412881","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412881","url":null,"abstract":"This paper presents a novel template matching method to detect and track pedestrians for people counting in real-time. Firstly, a novel background subtraction method is proposed for extracting all foreground objects from background. Then, a shadow elimination method is used to remove unwanted shadow from the background. In order to identify pedestrians from non-pedestrian objects, this paper proposed a novel grid-based template matching scheme to robustly verify each pedestrian. Usually, a pedestrian will have different appearances at different positions. The grid-based approach can effectively reduce the perspective effects into a minimum since it uses different templates to record the appearance changes at each grid. When more templates are used, the detection process will become more inefficient. To speed up its efficiency, an integral image is used to filter out all impossible candidates in advance. Lastly, a tracking method is applied to tracking the direction of each moving pedestrian so that the real number of passing people per direction can be counted more accurately. Experimental results have proved that the proposed method is robust, accurate, and powerful in people counting.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126470480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Robust Image Watermarking Based on Local Zernike Moments 基于局部泽尼克矩的鲁棒图像水印

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412901

Nitin Singhal, Young-Yoon Lee, Chang-Su Kim, Sang Uk Lee

引用次数: 15

New Directions in Image and Video Quality Assessment Plenary Talk 图像和视频质量评估的新方向

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412802

A. Bovik

引用次数: 1

Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter 多模态会议监控:改进的混合粒子滤波对说话人跟踪和分割的影响

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-12-01 DOI: 10.1109/MMSP.2007.4412818

Viktor Rozgic, C. Busso, P. Georgiou, Shrikanth S. Narayanan

{"title":"Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter","authors":"Viktor Rozgic, C. Busso, P. Georgiou, Shrikanth S. Narayanan","doi":"10.1109/MMSP.2007.4412818","DOIUrl":"https://doi.org/10.1109/MMSP.2007.4412818","url":null,"abstract":"In this paper we address improvements to our multimodal system for tracking of meeting participants and speaker segmentation with a focus on the microphone array modality. We propose an algorithm that uses Directions-of-Arrival estimated for each microphone pair as observations and performs tracking of an unknown number of acoustically-active meeting participants and subsequent speaker segmentation. We propose modified mixture particle fillter (mMPF) for tracking of acoustic sources in the track-before-detection (TbD) framework. Trajectories of sound sources are reconstructed by the optimal assignment of posterior mixture components produced by mMPF in consecutive frames. Further, we propose a sequential optimal change-point detection algorithm which discovers speech segments in the reconstructed trajectories i.e., performs speaker segmentation. The algorithm is tested on a multi-participant meeting dataset both separately and as a part of the multimodal system. On the task of speaker detection in the multimodal setup we report significant improvement over our previous state of the art implementation.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130015153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Semantics Interpretation of Superimposed Captions in Sports Videos 体育录像中叠加字幕的语义解释

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412861

H. Shih, Chung-Lin Huang

引用次数: 4

Relevant Feature Selection for Audio-Visual Speech Recognition 视听语音识别的相关特征选择

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI: 10.1109/MMSP.2007.4412847

Thomas Drugman, Mihai Gurban, J. Thiran

引用次数: 26