{"title":"基于高分辨倒谱调制谱的音乐类型分类","authors":"A. Nagathil, Timo Gerkmann, Rainer Martin","doi":"10.5281/ZENODO.41849","DOIUrl":null,"url":null,"abstract":"We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.","PeriodicalId":409817,"journal":{"name":"2010 18th European Signal Processing Conference","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Musical genre classification based on a highly-resolved cepstral modulation spectrum\",\"authors\":\"A. Nagathil, Timo Gerkmann, Rainer Martin\",\"doi\":\"10.5281/ZENODO.41849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.\",\"PeriodicalId\":409817,\"journal\":{\"name\":\"2010 18th European Signal Processing Conference\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 18th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.41849\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 18th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.41849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Musical genre classification based on a highly-resolved cepstral modulation spectrum
We propose new features for musical genre classification which are based on the modulation spectrum of cepstral coefficients, and investigate the impact of the modulation frequency resolution on the classification accuracy. We compare the performance of the novel feature set which is derived from a high-resolution modulation spectrum to that of two feature sets which are either based on a coarsely resolved modulation spectrum or roughly summarize the modulation energy in a few bands. From the results of a 5-class musical genre classification experiment it can be concluded that a high modulation frequency resolution is crucial for representing the harmonic modulation structure of Electronic music in particular. The proposed features outperform the two competing methods with an overall detection rate of 81%. After computing the cepstral modulation spectrum with efficient FFT operations, the computational complexity for feature extraction is fairly low as only 22 low-level features need to be computed.