{"title":"Hybrid Method for Automatic Music Labeling","authors":"Irapuru Florido, R. T. Raittz","doi":"10.1109/CLEI.2018.00038","DOIUrl":null,"url":null,"abstract":"Automatic music labeling on large-scale bases is a premise to provide systems of music recommendation, importante subject in digital world, demanding countless research in music information retrieval field. Although there are large-scale musical bases, such as Million Song Dataset (MSD), that have lowlevel label signal audio, descendant from audio signal, they are weakly labelled, that is, songs labels may be incomplete at a high level regarding emotion, vocals and instrument. This work aims to present the Music Label Miner (MLM), a hybrid method based on grouping, genetic algorithm and statistical correlation, which generates mappings and ossible inferences of high-level labels based on audio signal, through the relationship of a Large-scale base, MSD, with a lower-dimensional Ground Truth reference base. By applying the proposed method, it will be possible to label songs automatically, which contain only low-level labels and, by the models generated from the method, reach high-level labels. The method is composed by: (i) selection and preprocessing of MSD high and low level data, (ii) reference data set called CAL500exp, (iii) MSD data grouping, (iv) CAL500exp vectorization, (v) relationship of vectorized and grouping datasets, (vi) statistical correlation, (vii) mapping, (viii) visualization of selected data characteristics, (ix) generation of models and (x) inference of high and low-level labels.","PeriodicalId":379986,"journal":{"name":"2018 XLIV Latin American Computer Conference (CLEI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 XLIV Latin American Computer Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI.2018.00038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic music labeling on large-scale bases is a premise to provide systems of music recommendation, importante subject in digital world, demanding countless research in music information retrieval field. Although there are large-scale musical bases, such as Million Song Dataset (MSD), that have lowlevel label signal audio, descendant from audio signal, they are weakly labelled, that is, songs labels may be incomplete at a high level regarding emotion, vocals and instrument. This work aims to present the Music Label Miner (MLM), a hybrid method based on grouping, genetic algorithm and statistical correlation, which generates mappings and ossible inferences of high-level labels based on audio signal, through the relationship of a Large-scale base, MSD, with a lower-dimensional Ground Truth reference base. By applying the proposed method, it will be possible to label songs automatically, which contain only low-level labels and, by the models generated from the method, reach high-level labels. The method is composed by: (i) selection and preprocessing of MSD high and low level data, (ii) reference data set called CAL500exp, (iii) MSD data grouping, (iv) CAL500exp vectorization, (v) relationship of vectorized and grouping datasets, (vi) statistical correlation, (vii) mapping, (viii) visualization of selected data characteristics, (ix) generation of models and (x) inference of high and low-level labels.