Genre classification of symbolic music with SMBGT

Alexios Kotsifakos, Evangelos E. Kotsifakos, P. Papapetrou, V. Athitsos
{"title":"Genre classification of symbolic music with SMBGT","authors":"Alexios Kotsifakos, Evangelos E. Kotsifakos, P. Papapetrou, V. Athitsos","doi":"10.1145/2504335.2504382","DOIUrl":null,"url":null,"abstract":"Automatic music genre classification is a task that has attracted the interest of the music community for more than two decades. Music can be of high importance within the area of assistive technologies as it can be seen as an assistive technology with high therapeutic and educational functionality for children and adults with disabilities. Several similarity methods and machine learning techniques have been applied in the literature to deal with music genre classification, and as a result data mining and Music Information Retrieval (MIR) are strongly interconnected. In this paper, we deal with music genre classification for symbolic music, and specifically MIDI, by combining the recently proposed novel similarity measure for sequences, SMBGT, with the k-Nearest Neighbor (k-NN) classifier. For all MIDI songs we first extract all of their channels and then transform each channel into a sequence of 2D points, providing information for pitch and duration of their music notes. The similarity between two songs is found by computing the SMBGT for all pairs of the songs' channels and getting the maximum pairwise channel score as their similarity. Each song is treated as a query to which k-NN is applied, and the returned genre of the classifier is the one with the majority of votes in the k neighbors. Classification accuracy results indicate that there is room for improvement, especially due to the ambiguous definitions of music genres that make it hard to clearly discriminate them. Using this framework can also help us analyze and understand potential disadvantages of SMBGT, and thus identify how it can be improved when used for classification of real-time sequences.","PeriodicalId":91811,"journal":{"name":"The ... International Conference on PErvasive Technologies Related to Assistive Environments : PETRA ... International Conference on PErvasive Technologies Related to Assistive Environments","volume":"364 1","pages":"44:1-44:7"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The ... International Conference on PErvasive Technologies Related to Assistive Environments : PETRA ... International Conference on PErvasive Technologies Related to Assistive Environments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2504335.2504382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16

Abstract

Automatic music genre classification is a task that has attracted the interest of the music community for more than two decades. Music can be of high importance within the area of assistive technologies as it can be seen as an assistive technology with high therapeutic and educational functionality for children and adults with disabilities. Several similarity methods and machine learning techniques have been applied in the literature to deal with music genre classification, and as a result data mining and Music Information Retrieval (MIR) are strongly interconnected. In this paper, we deal with music genre classification for symbolic music, and specifically MIDI, by combining the recently proposed novel similarity measure for sequences, SMBGT, with the k-Nearest Neighbor (k-NN) classifier. For all MIDI songs we first extract all of their channels and then transform each channel into a sequence of 2D points, providing information for pitch and duration of their music notes. The similarity between two songs is found by computing the SMBGT for all pairs of the songs' channels and getting the maximum pairwise channel score as their similarity. Each song is treated as a query to which k-NN is applied, and the returned genre of the classifier is the one with the majority of votes in the k neighbors. Classification accuracy results indicate that there is room for improvement, especially due to the ambiguous definitions of music genres that make it hard to clearly discriminate them. Using this framework can also help us analyze and understand potential disadvantages of SMBGT, and thus identify how it can be improved when used for classification of real-time sequences.
用SMBGT进行符号音乐的类型分类
自动音乐类型分类是二十多年来吸引音乐界兴趣的一项任务。音乐在辅助技术领域非常重要,因为它可以被视为一种辅助技术,对残疾儿童和成人具有很高的治疗和教育功能。一些相似度方法和机器学习技术已经在文献中被应用于音乐类型分类,因此数据挖掘和音乐信息检索(MIR)是紧密联系在一起的。在本文中,我们通过结合最近提出的新的序列相似性度量SMBGT和k-最近邻(k-NN)分类器来处理符号音乐,特别是MIDI的音乐类型分类。对于所有MIDI歌曲,我们首先提取所有的通道,然后将每个通道转换为2D点的序列,提供音高和音乐音符持续时间的信息。两首歌曲之间的相似度是通过计算所有歌曲通道对的SMBGT并获得最大成对通道分数作为它们的相似度来发现的。每首歌都被视为应用k- nn的查询,分类器返回的类型是在k个邻居中获得多数投票的类型。分类精度结果表明,还有改进的空间,特别是由于音乐类型的定义模糊,使得很难清楚地区分它们。使用该框架还可以帮助我们分析和理解SMBGT的潜在缺点,从而确定如何在用于实时序列分类时对其进行改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信