基于高分辨倒谱调制谱的音乐类型分类

A. Nagathil, Timo Gerkmann, Rainer Martin
{"title":"基于高分辨倒谱调制谱的音乐类型分类","authors":"A. Nagathil, Timo Gerkmann, Rainer Martin","doi":"10.5281/ZENODO.41849","DOIUrl":null,"url":null,"abstract":"We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.","PeriodicalId":409817,"journal":{"name":"2010 18th European Signal Processing Conference","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Musical genre classification based on a highly-resolved cepstral modulation spectrum\",\"authors\":\"A. Nagathil, Timo Gerkmann, Rainer Martin\",\"doi\":\"10.5281/ZENODO.41849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.\",\"PeriodicalId\":409817,\"journal\":{\"name\":\"2010 18th European Signal Processing Conference\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 18th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.41849\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 18th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.41849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

提出了基于倒谱系数调制谱的音乐类型分类新特征,并研究了调制频率分辨率对分类精度的影响。我们将基于高分辨率调制频谱的新特征集的性能与基于粗分辨调制频谱或粗略总结几个波段的调制能量的两种特征集的性能进行了比较。通过对5类音乐类型的分类实验,得出了高调制频率分辨率对表征电子音乐的谐波调制结构至关重要的结论。所提出的特征优于两种竞争方法,总检测率为81%。利用高效FFT运算计算倒谱调制谱后,特征提取的计算复杂度相当低,只需要计算22个底层特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Musical genre classification based on a highly-resolved cepstral modulation spectrum
We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信