{"title":"一种用于全球和区域音乐数据集评估的鲁棒音乐类型分类方法","authors":"Jefferson Martins de Sousa, E. Pereira, L. Veloso","doi":"10.1109/ICDSP.2016.7868526","DOIUrl":null,"url":null,"abstract":"This paper deals with two problems: (1) the selection of a set of music features in order to achieve high genre classification accuracies; (2) the absence of a representative music dataset of regional Brazilian music. In this paper, we propose a set of features to classify genres of music. The features proposed were obtained by a methodical selection of important features used in the literature of Music Information Retrieval (MIR) and Music Emotion Recognition (MER). Besides, we propose a new music dataset called BMD (Brazilian Music Dataset)1, containing 120 songs labeled in 7 musical genres: FoFFó, Rock, Repente, MPB(Música Popular Brasileira — Brazilian Popular Music), Brega, Sertanejo and Disco. An important characteristic of this new dataset compared with others, is the presence of three popular genres in Brazil Northeast region: Repente, Brega and a characteristic genre similar to MPB, which we also call as MPB. We evaluated our proposed features on both datasets: GTZAN and BMD. The proposed approach achieved average accuracy (after 30 runs of 5-fold-cross-validations) of 79.7% for GTZAN and 86.11% for the BMD. Another important contribution of this work is random repetition of cross-validation executions. Most of the papers performs only a single n-fold cross-validation. We criticize that practice and propose, at least, 30 random executions to compute the average accuracy.","PeriodicalId":206199,"journal":{"name":"2016 IEEE International Conference on Digital Signal Processing (DSP)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"A robust music genre classification approach for global and regional music datasets evaluation\",\"authors\":\"Jefferson Martins de Sousa, E. Pereira, L. Veloso\",\"doi\":\"10.1109/ICDSP.2016.7868526\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with two problems: (1) the selection of a set of music features in order to achieve high genre classification accuracies; (2) the absence of a representative music dataset of regional Brazilian music. In this paper, we propose a set of features to classify genres of music. The features proposed were obtained by a methodical selection of important features used in the literature of Music Information Retrieval (MIR) and Music Emotion Recognition (MER). Besides, we propose a new music dataset called BMD (Brazilian Music Dataset)1, containing 120 songs labeled in 7 musical genres: FoFFó, Rock, Repente, MPB(Música Popular Brasileira — Brazilian Popular Music), Brega, Sertanejo and Disco. An important characteristic of this new dataset compared with others, is the presence of three popular genres in Brazil Northeast region: Repente, Brega and a characteristic genre similar to MPB, which we also call as MPB. We evaluated our proposed features on both datasets: GTZAN and BMD. The proposed approach achieved average accuracy (after 30 runs of 5-fold-cross-validations) of 79.7% for GTZAN and 86.11% for the BMD. Another important contribution of this work is random repetition of cross-validation executions. Most of the papers performs only a single n-fold cross-validation. We criticize that practice and propose, at least, 30 random executions to compute the average accuracy.\",\"PeriodicalId\":206199,\"journal\":{\"name\":\"2016 IEEE International Conference on Digital Signal Processing (DSP)\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Digital Signal Processing (DSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2016.7868526\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2016.7868526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
摘要
本文主要解决两个问题:(1)选择一组音乐特征以达到较高的体裁分类准确率;(2)缺乏具有代表性的巴西区域音乐数据集。在本文中,我们提出了一套特征来分类音乐的类型。所提出的特征是通过系统地选择音乐信息检索(MIR)和音乐情感识别(MER)文献中使用的重要特征得到的。此外,我们提出了一个名为BMD(巴西音乐数据集)1的新音乐数据集,其中包含120首歌曲,分为7种音乐类型:FoFFó、Rock、Repente、MPB(Música Popular Brasileira—巴西流行音乐)、Brega、Sertanejo和Disco。与其他数据集相比,这个新数据集的一个重要特征是巴西东北地区存在三种流行的类型:Repente, Brega和一种类似MPB的特征类型,我们也称之为MPB。我们在两个数据集上评估了我们提出的特征:GTZAN和BMD。该方法的平均准确率(经过30次5次交叉验证)为79.7%的GTZAN和86.11%的BMD。这项工作的另一个重要贡献是随机重复交叉验证执行。大多数论文只进行单一的n倍交叉验证。我们批评了这种做法,并建议至少随机执行30次来计算平均精度。
A robust music genre classification approach for global and regional music datasets evaluation
This paper deals with two problems: (1) the selection of a set of music features in order to achieve high genre classification accuracies; (2) the absence of a representative music dataset of regional Brazilian music. In this paper, we propose a set of features to classify genres of music. The features proposed were obtained by a methodical selection of important features used in the literature of Music Information Retrieval (MIR) and Music Emotion Recognition (MER). Besides, we propose a new music dataset called BMD (Brazilian Music Dataset)1, containing 120 songs labeled in 7 musical genres: FoFFó, Rock, Repente, MPB(Música Popular Brasileira — Brazilian Popular Music), Brega, Sertanejo and Disco. An important characteristic of this new dataset compared with others, is the presence of three popular genres in Brazil Northeast region: Repente, Brega and a characteristic genre similar to MPB, which we also call as MPB. We evaluated our proposed features on both datasets: GTZAN and BMD. The proposed approach achieved average accuracy (after 30 runs of 5-fold-cross-validations) of 79.7% for GTZAN and 86.11% for the BMD. Another important contribution of this work is random repetition of cross-validation executions. Most of the papers performs only a single n-fold cross-validation. We criticize that practice and propose, at least, 30 random executions to compute the average accuracy.