{"title":"A novel similarity measure: Voronoi audio similarity for genre classification","authors":"Prafulla Kalapatapu, N. Tejas, Siddharth Dalmia, Prakhar Gupta, Bhaswant Inguva, Aruna Malapati","doi":"10.1504/IJISTA.2017.10008859","DOIUrl":null,"url":null,"abstract":"One of the major challenges in genre classification, recommender systems is to find similarity between the query song and songs in a database. In this paper, we propose a novel similarity measure called Voronoi audio similarity (VAS). We extracted the Content-based features from the audio signal of the song split in frames over a particular time period and we represented each song as a point in 2D space. The proposed system is a two-level classification process, where songs are first clustered by K-means clustering and then a Voronoi diagram is created using centroids from the resulting K-means, which is called the template Voronoi diagram (TVD). This approach learns the decision boundary used for genre classification. The genre of the song could thus be predicted as the genre with the maximum normalised area overlap. Empirical results performed with 10 cross-fold validations on million song subsets of 500 songs showed 78% accuracy.","PeriodicalId":420808,"journal":{"name":"Int. J. Intell. Syst. Technol. Appl.","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Intell. Syst. Technol. Appl.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJISTA.2017.10008859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
One of the major challenges in genre classification, recommender systems is to find similarity between the query song and songs in a database. In this paper, we propose a novel similarity measure called Voronoi audio similarity (VAS). We extracted the Content-based features from the audio signal of the song split in frames over a particular time period and we represented each song as a point in 2D space. The proposed system is a two-level classification process, where songs are first clustered by K-means clustering and then a Voronoi diagram is created using centroids from the resulting K-means, which is called the template Voronoi diagram (TVD). This approach learns the decision boundary used for genre classification. The genre of the song could thus be predicted as the genre with the maximum normalised area overlap. Empirical results performed with 10 cross-fold validations on million song subsets of 500 songs showed 78% accuracy.