{"title":"A full-text retrieval approach to content-based audio identification","authors":"Andreas Ribbrock, F. Kurth","doi":"10.1109/MMSP.2002.1203280","DOIUrl":null,"url":null,"abstract":"We give an overview on a novel framework for content-based multimedia retrieval. In this paper, we present an implementation for audio identification. This framework consists of an index-based search combining algebraic methods with classical full-text retrieval. In the main part of the paper, we propose several feature extractors which may be used for indexing the PCM audio data. We give an overview on our test results containing performance data (e.g. query response times), memory requirements (e.g., index size), and robustness issues. The size of our index turns out to be only a 1/1000th to about 1/15000th of the original PCM material depending on the required granularity for identifying a piece of audio.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE Workshop on Multimedia Signal Processing.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2002.1203280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
We give an overview on a novel framework for content-based multimedia retrieval. In this paper, we present an implementation for audio identification. This framework consists of an index-based search combining algebraic methods with classical full-text retrieval. In the main part of the paper, we propose several feature extractors which may be used for indexing the PCM audio data. We give an overview on our test results containing performance data (e.g. query response times), memory requirements (e.g., index size), and robustness issues. The size of our index turns out to be only a 1/1000th to about 1/15000th of the original PCM material depending on the required granularity for identifying a piece of audio.