{"title":"Music genre recognition using spectrograms with harmonic-percussive sound separation","authors":"R. L. Aguiar, Yandre M. G. Costa, L. Nanni","doi":"10.1109/SCCC.2016.7836027","DOIUrl":null,"url":null,"abstract":"In this work we assesses the music genre classification using spectrograms taken from the original signal, percussive content signal, and harmonic content signal. The rationale behind this is that classifiers obtained from this three different representation of the signal may present some complementarity to each other. By this way, one can improve the recognition rates already obtained in previous works which has explored only the original signal content. LBP texture features were used to represent the spectrogram content, and the classification step was supported by SVM. The spectrogram images were zoned taking to account a perceptual scale, and a specific classifier was created for each zone, which has led us to combine classifiers outputs to get the final decision. The performance of our approach reaches the recognition rate about 88.56% which, to the best of our knowledge, is the best rate ever obtained on the LMD dataset using artist filter constraint.","PeriodicalId":432676,"journal":{"name":"2016 35th International Conference of the Chilean Computer Science Society (SCCC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 35th International Conference of the Chilean Computer Science Society (SCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCCC.2016.7836027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this work we assesses the music genre classification using spectrograms taken from the original signal, percussive content signal, and harmonic content signal. The rationale behind this is that classifiers obtained from this three different representation of the signal may present some complementarity to each other. By this way, one can improve the recognition rates already obtained in previous works which has explored only the original signal content. LBP texture features were used to represent the spectrogram content, and the classification step was supported by SVM. The spectrogram images were zoned taking to account a perceptual scale, and a specific classifier was created for each zone, which has led us to combine classifiers outputs to get the final decision. The performance of our approach reaches the recognition rate about 88.56% which, to the best of our knowledge, is the best rate ever obtained on the LMD dataset using artist filter constraint.