Deploying Deep Belief Nets for content based audio music similarity

Aggelos Gkiokas, V. Katsouros, G. Carayannis
{"title":"Deploying Deep Belief Nets for content based audio music similarity","authors":"Aggelos Gkiokas, V. Katsouros, G. Carayannis","doi":"10.1109/IISA.2014.6878797","DOIUrl":null,"url":null,"abstract":"In this paper a method for computing an audio based similarity between music excerpts is presented. The method consists of three main parts, with the first step being feature extraction, which involves the calculation of three feature sets that correspond to music timbre, rhythm and harmony. Next, for each feature set a Deep Belief Network was trained without supervision on a large music collection. The respective distances of the output units of the Deep Belief Networks between two music excerpts are computed, normalized and finally combined to form the distance measure. The proposed method was evaluated on the MIREX 2013 Audio Music Similarity task. Results are encouraging, however, they indicate that the harmonic similarity component degrades the performance.","PeriodicalId":298835,"journal":{"name":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","volume":"196 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2014.6878797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper a method for computing an audio based similarity between music excerpts is presented. The method consists of three main parts, with the first step being feature extraction, which involves the calculation of three feature sets that correspond to music timbre, rhythm and harmony. Next, for each feature set a Deep Belief Network was trained without supervision on a large music collection. The respective distances of the output units of the Deep Belief Networks between two music excerpts are computed, normalized and finally combined to form the distance measure. The proposed method was evaluated on the MIREX 2013 Audio Music Similarity task. Results are encouraging, however, they indicate that the harmonic similarity component degrades the performance.
为基于内容的音频音乐相似度部署深度信念网
本文提出了一种基于音频的音乐节选相似度计算方法。该方法主要包括三个部分,第一步是特征提取,计算三个特征集,分别对应音乐的音色、节奏和和声。接下来,对于每个特征集,深度信念网络在没有监督的情况下对大型音乐集合进行训练。计算两个音乐节选之间深度信念网络输出单元的各自距离,并进行归一化,最后组合形成距离度量。在MIREX 2013音频音乐相似性任务上对该方法进行了评估。结果令人鼓舞,然而,他们表明谐波相似分量降低了性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信