Multimedia edges: finding hierarchy in all dimensions

MULTIMEDIA '01 Pub Date : 2001-10-01 DOI:10.1145/500141.500149

M. Slaney, D. Ponceleón, James Kaufman

引用次数: 22

Abstract

This paper describes a new unified representation for the informa¿tion in a video. We reduce the dimensionality of the signal with either a singular-value decomposition (on the semantic and image data) or mel-frequency cepstral coefficients (on the audio data) and then concatenate the vectors to form a multi-dimensional represen¿tation of the video. Using scale-space techniques we find large jumps in the video's path, which we call edges. We use these tech¿niques to analyze the temporal properties of the audio and image data in a video. This analysis creates a hierarchical segmentation of the video, or a table-of-contents, from the audio, semantic and image data.

查看原文本刊更多论文

多媒体边:在所有维度中寻找层次结构

本文描述了视频信息的一种新的统一表示。我们通过奇值分解(在语义和图像数据上)或梅尔频率倒谱系数(在音频数据上)来降低信号的维数，然后将向量连接起来，形成视频的多维表示。使用比例空间技术，我们在视频的路径中找到了大的跳跃，我们称之为边缘。我们使用这些技术来分析视频中音频和图像数据的时间属性。这种分析从音频、语义和图像数据中创建了视频的分层分割或目录。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

MULTIMEDIA '01

自引率

0.00%

发文量