A full-text retrieval approach to content-based audio identification

2002 IEEE Workshop on Multimedia Signal Processing. Pub Date : 2002-12-09 DOI:10.1109/MMSP.2002.1203280

Andreas Ribbrock, F. Kurth

引用次数: 13

Abstract

We give an overview on a novel framework for content-based multimedia retrieval. In this paper, we present an implementation for audio identification. This framework consists of an index-based search combining algebraic methods with classical full-text retrieval. In the main part of the paper, we propose several feature extractors which may be used for indexing the PCM audio data. We give an overview on our test results containing performance data (e.g. query response times), memory requirements (e.g., index size), and robustness issues. The size of our index turns out to be only a 1/1000th to about 1/15000th of the original PCM material depending on the required granularity for identifying a piece of audio.

查看原文本刊更多论文

基于内容的音频识别的全文检索方法

本文概述了一种基于内容的多媒体检索框架。在本文中，我们提出了一个音频识别的实现。该框架包括基于索引的搜索，将代数方法与经典全文检索相结合。在论文的主要部分，我们提出了几种可用于PCM音频数据索引的特征提取器。我们概述了我们的测试结果，包括性能数据(例如查询响应时间)、内存需求(例如索引大小)和鲁棒性问题。根据识别音频片段所需的粒度，我们的索引大小仅为原始PCM材料的1/1000到1/15000。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2002 IEEE Workshop on Multimedia Signal Processing.

自引率

0.00%

发文量