Searching for dominant high-level features for Music Information Retrieval

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) Pub Date : 2012-10-18 DOI:10.5281/ZENODO.52248

M. Zanoni, Daniele Ciminieri, A. Sarti, S. Tubaro

引用次数: 13

Abstract

Music Information Retrieval systems are often based on the analysis of a large number of low-level audio features. When dealing with problems of musical genre description and visualization, however, it would be desirable to work with a very limited number of highly informative and discriminant macro-descriptors. In this paper we focus on a specific class of training-based descriptors, which are obtained as the log-likelihood of a Gaussian Mixture Model trained with short musical excerpts that selectively exhibit a certain semantic homogeneity. As these descriptors are critically dependent on the training sets, we approach the problem of how to automatically generate suitable training sets and optimize the associated macro-features in terms of discriminant power and informative impact. We then show the application of a set of three identified macro-features to genre visualization, tracking and classification.

查看原文本刊更多论文

音乐信息检索的主要高级特征搜索

音乐信息检索系统往往是基于对大量低级音频特征的分析。然而，在处理音乐类型描述和可视化问题时，最好使用数量非常有限的高信息量和判别性宏观描述符。在本文中，我们专注于一类特定的基于训练的描述符，这些描述符是由高斯混合模型的对数似然模型得到的，该模型使用有选择性地表现出一定的语义同质性的短音乐片段进行训练。由于这些描述符严重依赖于训练集，我们研究了如何自动生成合适的训练集并在判别能力和信息影响方面优化相关的宏观特征的问题。然后，我们展示了一组三个已识别的宏观特征在类型可视化、跟踪和分类中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量