2008 IEEE 10th Workshop on Multimedia Signal Processing最新文献_第5页

Syntactic matching of pedestrian trajectories for behavioral analysis 用于行为分析的行人轨迹句法匹配

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665197

Nicola Piotto, N. Conci, F. D. Natale

引用次数: 8

Motion compensated prediction in transform domain Distributed Video Coding 变换域分布式视频编码中的运动补偿预测

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665099

S. Borchert, R. Westerlaken, R. K. Gunnewiek, R. Lagendijk

引用次数: 5

Digital camera identification based on canonical correlation analysis 基于典型相关分析的数码相机识别

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665178

Chi Zhang, Hongbin Zhang

引用次数: 3

Coding structure optimization for interactive multiview streaming in virtual world observation 虚拟世界观测中交互式多视点流编码结构优化

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665121

Gene Cheung, Antonio Ortega, Takashi Sakamoto

引用次数: 28

Facial color adaptive technique based on the theory of emotion-color association and analysis of animation 基于情绪-色彩关联理论的面部色彩自适应技术及动画分析

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665194

Kyu-ho Park, Taeyong Kim

引用次数: 7

Error resilient transcoding of Scalable Video bitstreams 可伸缩视频流的抗错误转码

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665094

Yi Guo, Houqiang Li, Ye-Kui Wang, C. Chen

{"title":"Error resilient transcoding of Scalable Video bitstreams","authors":"Yi Guo, Houqiang Li, Ye-Kui Wang, C. Chen","doi":"10.1109/MMSP.2008.4665094","DOIUrl":"https://doi.org/10.1109/MMSP.2008.4665094","url":null,"abstract":"We propose in this paper a novel error resilient transcoding scheme that can be placed at the boundary between wired and wireless networks via heterogeneous network links. This error resilient transcoder shall seamlessly complement the standard scalable video coding (SVC) bitstream to offer additional error resilient adaptation capability for receiving devices. The novel error resilient transcoding scheme consists of three different modules; each is designed to meet various levels of complexity need. The three modules are all based on the loss-aware rate-distortion optimization (LA-RDO) mode decision algorithm we have previously developed for SVC. However, each individual module can be tailored to different complexity requirements depending on whether and how the LA-RDO mode decision is implemented. Another innovation of this approach is the design of a fast rate control algorithm in order to maintain consistent bitrates between input and output of the transcoder. This rate control algorithm only needs picture-level bit information for training target quantization parameters. Simulation results demonstrate that, comparing with standard SVC, the proposed approach is able to achieve up to 4 dB gain for the enhancement layer video and up to 1 dB gain for the base layer video.","PeriodicalId":402287,"journal":{"name":"2008 IEEE 10th Workshop on Multimedia Signal Processing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116639618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Music emotion annotation by machine learning 机器学习的音乐情感注释

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665144

W. Cheung, Guojun Lu

{"title":"Music emotion annotation by machine learning","authors":"W. Cheung, Guojun Lu","doi":"10.1109/MMSP.2008.4665144","DOIUrl":"https://doi.org/10.1109/MMSP.2008.4665144","url":null,"abstract":"Music emotion annotation is a task of attaching emotional terms to musical works. As volume of online musical contents expands rapidly in recent years, demands for retrieval by emotion are emerging. Currently, literature on music retrieval using emotional terms is rare. Emotion annotated data are scarce in existing music databases because annotation is still a manual task. Automating music emotion annotation is an essential prerequisite to research in music retrieval by emotion, for without which even sophisticated retrieval methods may not be very useful in a data deficient environment. This paper describes a machine learning approach to annotate music using a large number of emotional terms. We also estimate the training data size requirements for a workable annotation system. Our empirical result shows that 1) the task of music emotion annotation could be modelled using machine learning techniques to support a large number of emotional terms, 2) the combination of sampling method and data-driven detection threshold is highly effective in optimizing the use of existing annotated data in training machine learning models, 3) synonymous relationships enhance the annotation performance and 4) the training data size requirement is within reach for a workable annotation system. Essentially, automatic music emotion annotation enables music retrieval by emotion to be performed as a text retrieval task.","PeriodicalId":402287,"journal":{"name":"2008 IEEE 10th Workshop on Multimedia Signal Processing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116771937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Capturing high dynamic range images with partial re-exposures 用部分再曝光捕捉高动态范围图像

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665082

B. Guthier, S. Kopf, W. Effelsberg

引用次数: 11

Mining temporal information and web-casting text for automatic sports event detection 基于时态信息挖掘和网络投播文本的体育赛事自动检测

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665150

Minh-Son Dao, N. Babaguchi

引用次数: 5

Fast video object segmentation using Markov random field 基于马尔可夫随机场的快速视频对象分割

2008 IEEE 10th Workshop on Multimedia Signal Processing Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665101

C. Mak, W. Cham

引用次数: 5