Comparison of music genre classification using Nearest Centroid Classifier and k-Nearest Neighbours

Elizabeth Nurmiyati Tamatjita, Aditya W. Mahastama
{"title":"Comparison of music genre classification using Nearest Centroid Classifier and k-Nearest Neighbours","authors":"Elizabeth Nurmiyati Tamatjita, Aditya W. Mahastama","doi":"10.1109/ICIMTECH.2016.7930314","DOIUrl":null,"url":null,"abstract":"Music genre is getting complex from time to time. As the size of digital media grows along with amount of data, manual search of digital audio files according to its genre is considered impractical and inefficient; therefore a classification mechanism is needed to improve searching. Zero Crossing Rate (ZCR), Average Energy (E) and Silent Ratio (SR) are a few of features that can be extracted from digital audio files to classify its genre. This research is conducted to classify music from digital audio (songs) into 12 genres: Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop, Electronic, Reggae and Rock using above mentioned features, extracted from WAV audio files. Classification is performed several times using selected 3, 6, 9 and 12 genres respectively. The result shows that classification of 3 music genres (Ballad, Blues, Classic) has the highest accuracy (96.67%), followed by 6 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz) with 70%, and 9 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop) with 53.33% accuracy. Classification of all 12 music genres yields the lowest accuracy of 33.33%. The test results with the k-Nearest Neighbours algorithm to 120 songs for k = 3 accuracy reaches 22.5%, k = 5 accuracy reaches 22.5%, k = 7 accuracy reaching 26.7% and k = 9 accuracy reaches 26.7 %. Results showed that genre classification by matching the shortest distance through the centre of the class, yields better results than using the k-NN algorithm.","PeriodicalId":327070,"journal":{"name":"2016 International Conference on Information Management and Technology (ICIMTech)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information Management and Technology (ICIMTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIMTECH.2016.7930314","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Music genre is getting complex from time to time. As the size of digital media grows along with amount of data, manual search of digital audio files according to its genre is considered impractical and inefficient; therefore a classification mechanism is needed to improve searching. Zero Crossing Rate (ZCR), Average Energy (E) and Silent Ratio (SR) are a few of features that can be extracted from digital audio files to classify its genre. This research is conducted to classify music from digital audio (songs) into 12 genres: Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop, Electronic, Reggae and Rock using above mentioned features, extracted from WAV audio files. Classification is performed several times using selected 3, 6, 9 and 12 genres respectively. The result shows that classification of 3 music genres (Ballad, Blues, Classic) has the highest accuracy (96.67%), followed by 6 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz) with 70%, and 9 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop) with 53.33% accuracy. Classification of all 12 music genres yields the lowest accuracy of 33.33%. The test results with the k-Nearest Neighbours algorithm to 120 songs for k = 3 accuracy reaches 22.5%, k = 5 accuracy reaches 22.5%, k = 7 accuracy reaching 26.7% and k = 9 accuracy reaches 26.7 %. Results showed that genre classification by matching the shortest distance through the centre of the class, yields better results than using the k-NN algorithm.
最近质心分类器与k近邻音乐类型分类的比较
音乐类型越来越复杂。随着数字媒体的规模和数据量的增长,按类型手动搜索数字音频文件被认为是不切实际和低效的;因此,需要一种分类机制来提高搜索效率。过零率(Zero Crossing Rate, ZCR)、平均能量(Average Energy, E)和无声比(Silent Ratio, SR)是可以从数字音频文件中提取的几个特征,可以用来对其类型进行分类。本研究利用上述特征,从WAV音频文件中提取音乐,将数字音频(歌曲)中的音乐分为12种类型:民谣、布鲁斯、古典、和声、嘻哈、爵士、Keroncong、拉丁、流行、电子、雷鬼和摇滚。分别使用选定的3、6、9和12个类型进行多次分类。结果表明,对3种音乐类型(Ballad、Blues、Classic)的分类准确率最高(96.67%),其次是6种类型(Ballad、Blues、Classic、Harmony、Hip Hop、Jazz),准确率为70%,9种类型(Ballad、Blues、Classic、Harmony、Hip Hop、Jazz、Keroncong、Latin、Pop),准确率为53.33%。对所有12种音乐类型进行分类的准确率最低,为33.33%。k =最近邻算法对120首歌曲的测试结果,k = 3准确率达到22.5%,k = 5准确率达到22.5%,k = 7准确率达到26.7%,k = 9准确率达到26.7%。结果表明,通过类中心匹配最短距离的类型分类比使用k-NN算法的分类效果更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信