Deploying Deep Belief Nets for content based audio music similarity

IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications Pub Date : 2014-07-07 DOI:10.1109/IISA.2014.6878797

Aggelos Gkiokas, V. Katsouros, G. Carayannis

引用次数: 1

Abstract

In this paper a method for computing an audio based similarity between music excerpts is presented. The method consists of three main parts, with the first step being feature extraction, which involves the calculation of three feature sets that correspond to music timbre, rhythm and harmony. Next, for each feature set a Deep Belief Network was trained without supervision on a large music collection. The respective distances of the output units of the Deep Belief Networks between two music excerpts are computed, normalized and finally combined to form the distance measure. The proposed method was evaluated on the MIREX 2013 Audio Music Similarity task. Results are encouraging, however, they indicate that the harmonic similarity component degrades the performance.

查看原文本刊更多论文

为基于内容的音频音乐相似度部署深度信念网

本文提出了一种基于音频的音乐节选相似度计算方法。该方法主要包括三个部分，第一步是特征提取，计算三个特征集，分别对应音乐的音色、节奏和和声。接下来，对于每个特征集，深度信念网络在没有监督的情况下对大型音乐集合进行训练。计算两个音乐节选之间深度信念网络输出单元的各自距离，并进行归一化，最后组合形成距离度量。在MIREX 2013音频音乐相似性任务上对该方法进行了评估。结果令人鼓舞，然而，他们表明谐波相似分量降低了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications

自引率

0.00%

发文量