{"title":"音乐流派分类与词和文件向量","authors":"Onder Coban, Isil Karabey","doi":"10.1109/SIU.2017.7960145","DOIUrl":null,"url":null,"abstract":"In these days, music genre classification (MGC) is a quite popular research field. The main goal of the MGC studies is automatically detecting music genre (eg., rap, rock). In literature, features are generally extracted from the music's melodic content or lyrics for this task. In this study, we have performed lyrics based MGC on a Turkish dataset. We have just used lyrics as feature source and considered the MGC as a classical text classification problem. However, we represented the features using word (word2vec) and document (doc2vec) vector methods which are quite popular recently. Also, we have compared these methods with traditional Bag of Words (BoW) feature model. In addition, we have investigated the impact of preprocessing steps and vector dimension on both word and document vectors. We have conducted experiments on Support Vector Machine algorithm. Our experimental results show that word vector can be employed for feature representation.","PeriodicalId":217576,"journal":{"name":"2017 25th Signal Processing and Communications Applications Conference (SIU)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Music genre classification with word and document vectors\",\"authors\":\"Onder Coban, Isil Karabey\",\"doi\":\"10.1109/SIU.2017.7960145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In these days, music genre classification (MGC) is a quite popular research field. The main goal of the MGC studies is automatically detecting music genre (eg., rap, rock). In literature, features are generally extracted from the music's melodic content or lyrics for this task. In this study, we have performed lyrics based MGC on a Turkish dataset. We have just used lyrics as feature source and considered the MGC as a classical text classification problem. However, we represented the features using word (word2vec) and document (doc2vec) vector methods which are quite popular recently. Also, we have compared these methods with traditional Bag of Words (BoW) feature model. In addition, we have investigated the impact of preprocessing steps and vector dimension on both word and document vectors. We have conducted experiments on Support Vector Machine algorithm. Our experimental results show that word vector can be employed for feature representation.\",\"PeriodicalId\":217576,\"journal\":{\"name\":\"2017 25th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 25th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU.2017.7960145\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 25th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2017.7960145","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Music genre classification with word and document vectors
In these days, music genre classification (MGC) is a quite popular research field. The main goal of the MGC studies is automatically detecting music genre (eg., rap, rock). In literature, features are generally extracted from the music's melodic content or lyrics for this task. In this study, we have performed lyrics based MGC on a Turkish dataset. We have just used lyrics as feature source and considered the MGC as a classical text classification problem. However, we represented the features using word (word2vec) and document (doc2vec) vector methods which are quite popular recently. Also, we have compared these methods with traditional Bag of Words (BoW) feature model. In addition, we have investigated the impact of preprocessing steps and vector dimension on both word and document vectors. We have conducted experiments on Support Vector Machine algorithm. Our experimental results show that word vector can be employed for feature representation.