MULTIMEDIA '04最新文献

Practical voltage scaling for mobile multimedia devices 移动多媒体设备的实用电压缩放

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027737

Wanghong Yuan, K. Nahrstedt

引用次数: 65

Nonparametric motion model with applications to camera motion pattern classification 非参数运动模型及其在摄像机运动模式分类中的应用

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027603

Ling-yu Duan, Mingliang Xu, Q. Tian, Changsheng Xu

引用次数: 11

Incremental semi-supervised subspace learning for image retrieval 用于图像检索的增量半监督子空间学习

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027530

Xiaofei He

引用次数: 96

Real-time background music monitoring based on content-based retrieval 基于内容检索的实时背景音乐监控

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027550

Yoshiharu Suga, N. Kosugi, M. Morimoto

{"title":"Real-time background music monitoring based on content-based retrieval","authors":"Yoshiharu Suga, N. Kosugi, M. Morimoto","doi":"10.1145/1027527.1027550","DOIUrl":"https://doi.org/10.1145/1027527.1027550","url":null,"abstract":"In this paper, we describe music monitoring in TV broadcasting based on content-based retrieval. A part of audio signals is sequentially extracted from TV broadcasting as a retrieval key, and a music DB that stores a great number of musical pieces is retrieved by this key based on content-based retrieval, and a musical piece is identified sequentially. In this way, we are able to carry out music monitoring. There are three necessary requirements important for realization of the music monitoring. They are robustness against non-stationary noise, real-time processing of large-scale music DB retrieval, and high granularity of the retrieval key. As a method of realizing robustness against non-stationary noise, we propose a partially similar retrieval method which improves retrieval accuracy by using the moment in which no superfluous noise is produced during the existence of non-stationary noise. In order to realize real-time processing of large-scale music DB retrieval, we adopt a coarse-to-fine strategy, and propose a spectral peaks hashing method which performs high-speed refining by using hashing. To calculate a hash value in this hashing, frequency channel numbers of the spectral peaks are used. In order to realize high granularity of the retrieval key, it is necessary to solve the problem of retrieval accuracy degradation associated with heightening the granularity. To improve this accuracy, we propose a detection-by-continuity method which uses music continuity. Moreover, by using music continuity to correct the starting point and the terminal point of a musical piece in TV broadcasting, the retrieval accuracy is improved further. In order to evaluate the effectiveness of the proposed methods, we performed experiments using a music DB which stores over 28,000 musical pieces (over 1800 hours) and TV broadcasting audio signals containing music and background music (BGM). The granularity of the retrieval key was set at about 0.5 seconds. Through these experiments, We verified that music monitoring was possible for over 90% of the total time of music and BGM used in TV broadcasting, and that real-time processing was possible.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125564324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

GURU: a multimedia distance-learning framework for users with disabilities GURU:为残疾用户提供的多媒体远程学习框架

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027698

Vidhya Balasubramanian, N. Venkatasubramanian

引用次数: 1

Interactive manipulation of replay speed while listening to speech recordings 交互式操作的重播速度，而听语音录音

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027645

Wolfgang Hürst, T. Lauer, Georg Götz

引用次数: 9

Nonparametric motion model 非参数运动模型

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027700

Ling-yu Duan, Mingliang Xu, Q. Tian, Changsheng Xu

{"title":"Nonparametric motion model","authors":"Ling-yu Duan, Mingliang Xu, Q. Tian, Changsheng Xu","doi":"10.1145/1027527.1027700","DOIUrl":"https://doi.org/10.1145/1027527.1027700","url":null,"abstract":"Motion information is a powerful cue for visual perception. In the context of video indexing and retrieval, motion content serves as a useful source for compact video representation. There has been a lot of literature about parametric motion models. However, it is hard to secure a proper parametric assumption in a wide range of video scenarios. Diverse camera shots and frequent occurrences of improper optical flow estimation or block matching motivate us to develop nonparametric motion models. In this demonstration, we present a novel nonparametric motion model. The unique features mainly include: 1) Instead of computationally expensive and vulnerable parametric regression our proposed model bases the motion characterization on the classification of motion patterns; 2) we employ machine learning to capture the knowledge of recognizing camera motion patterns from bad motion vector fields (MVF); and 3) with the mean shift filtering our proposed motion representation elegantly incorporates the spatial-range information for noise removal and discontinuity preserving smoothing of MVF. Promising results have been achieved on two tasks: 1) camera motion pattern recognition on 23191 MVFs and 2) recognition of the intensity of motion activity on 622 video segments culled from the MPEG-7 dataset.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114565448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A semi-naïve Bayesian method incorporating clustering with pair-wise constraints for auto image annotation 结合聚类和成对约束的自动图像标注semi-naïve贝叶斯方法

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027605

Wanjun Jin, Rui Shi, Tat-Seng Chua

引用次数: 24

A comparative study on attributed relational gra matching algorithms for perceptual 3-D shape descriptor in MPEG-7 MPEG-7中感知三维形状描述符的属性关联格拉匹配算法比较研究

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027686

Duck Hoon Kim, I. Yun, Sang Uk Lee

引用次数: 11

When code is content: experiments with a whistling machine 当代码满足时:用呼啸的机器做实验

MULTIMEDIA '04 Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027761

M. Böhlen, J. Rinker

引用次数: 1