一种大型音频流中无监督发现信号基序的有效方法

2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2011-06-13 DOI:10.1109/CBMI.2011.5972536

Armando Muscariello, G. Gravier, F. Bimbot

{"title":"一种大型音频流中无监督发现信号基序的有效方法","authors":"Armando Muscariello, G. Gravier, F. Bimbot","doi":"10.1109/CBMI.2011.5972536","DOIUrl":null,"url":null,"abstract":"Providing effective tools to navigate and access through long audio archives, or monitor and classify broadcast streams, proves to be an extremely challenging task. Main issues originate from the varied nature of patterns of interest in a composite audio environment, the massive size of such databases, and the capability of performing when prior knowledge on audio content is scarce or absent. This paper proposes a computational architecture aimed at discovering occurrences of repeating patterns in audio streams by means of unsupervised learning. The targeted repetitions (or motifs) are called signalling, by analogy with a biological nomenclature, as referring to a broad class of audio patterns (as jingles, songs, advertisements, etc…) frequently occurring in broadcast audio. We adapt a system originally developed for word discovery applications, and demonstrate its effectiveness in a song discovery scenario. The adaption consists in speeding up critical parts of the computations, mostly based on audio feature coarsening, to deal with the large occurrence period of repeating songs in radio streams.","PeriodicalId":358337,"journal":{"name":"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"An efficient method for the unsupervised discovery of signalling motifs in large audio streams\",\"authors\":\"Armando Muscariello, G. Gravier, F. Bimbot\",\"doi\":\"10.1109/CBMI.2011.5972536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Providing effective tools to navigate and access through long audio archives, or monitor and classify broadcast streams, proves to be an extremely challenging task. Main issues originate from the varied nature of patterns of interest in a composite audio environment, the massive size of such databases, and the capability of performing when prior knowledge on audio content is scarce or absent. This paper proposes a computational architecture aimed at discovering occurrences of repeating patterns in audio streams by means of unsupervised learning. The targeted repetitions (or motifs) are called signalling, by analogy with a biological nomenclature, as referring to a broad class of audio patterns (as jingles, songs, advertisements, etc…) frequently occurring in broadcast audio. We adapt a system originally developed for word discovery applications, and demonstrate its effectiveness in a song discovery scenario. The adaption consists in speeding up critical parts of the computations, mostly based on audio feature coarsening, to deal with the large occurrence period of repeating songs in radio streams.\",\"PeriodicalId\":358337,\"journal\":{\"name\":\"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMI.2011.5972536\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2011.5972536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

提供有效的工具来导航和访问长音频档案，或监控和分类广播流，被证明是一项极具挑战性的任务。主要问题源于复合音频环境中感兴趣的模式的不同性质，此类数据库的巨大规模，以及在缺乏或缺乏音频内容的先验知识时执行的能力。本文提出了一种计算架构，旨在通过无监督学习发现音频流中重复模式的出现。目标重复(或母题)被称为信号，与生物学命名法类似，指的是广播音频中经常出现的一大类音频模式(如叮当声、歌曲、广告等)。我们采用了最初为单词发现应用程序开发的系统，并在歌曲发现场景中演示了其有效性。这种适应包括加速关键部分的计算，主要基于音频特征粗化，以处理广播流中重复歌曲的大量出现周期。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An efficient method for the unsupervised discovery of signalling motifs in large audio streams

Providing effective tools to navigate and access through long audio archives, or monitor and classify broadcast streams, proves to be an extremely challenging task. Main issues originate from the varied nature of patterns of interest in a composite audio environment, the massive size of such databases, and the capability of performing when prior knowledge on audio content is scarce or absent. This paper proposes a computational architecture aimed at discovering occurrences of repeating patterns in audio streams by means of unsupervised learning. The targeted repetitions (or motifs) are called signalling, by analogy with a biological nomenclature, as referring to a broad class of audio patterns (as jingles, songs, advertisements, etc…) frequently occurring in broadcast audio. We adapt a system originally developed for word discovery applications, and demonstrate its effectiveness in a song discovery scenario. The adaption consists in speeding up critical parts of the computations, mostly based on audio feature coarsening, to deal with the large occurrence period of repeating songs in radio streams.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)

自引率

0.00%

发文量