On Generalist and Domain-Specific Music Classification Models and Their Impacts on Brazilian Music Genre Recognition

Diego Furtado Silva, A. Silva, Luís Felipe Ortolan, R. Marcacini
{"title":"On Generalist and Domain-Specific Music Classification Models and Their Impacts on Brazilian Music Genre Recognition","authors":"Diego Furtado Silva, A. Silva, Luís Felipe Ortolan, R. Marcacini","doi":"10.5753/sbcm.2021.19427","DOIUrl":null,"url":null,"abstract":"Deep learning has become the standard procedure to deal with Music Information Retrieval problems. This category of machine learning algorithms has achieved state-of-the-art results in several tasks, such as classification and auto-tagging. However, obtaining a good-performing model requires a significant amount of data. At the same time, most of the music datasets available lack cultural diversity. Therefore, the performance of the currently most used pre-trained models on underrepresented music genres is unknown. If music models follow the same direction that language models in Natural Language Processing, they should have poorer performance on music styles that are not present in the data used to train them. To verify this assumption, we use a well-known music model designed for auto-tagging in the task of genre recognition. We trained this model from scratch using a large general-domain dataset and two subsets specifying different domains. We empirically show that models trained on specific-domain data perform better than generalist models to classify music in the same domain, even trained with a smaller dataset. This outcome is distinctly observed in the subset that mainly contains Brazilian music, including several usually underrepresented genres.","PeriodicalId":292360,"journal":{"name":"Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/sbcm.2021.19427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning has become the standard procedure to deal with Music Information Retrieval problems. This category of machine learning algorithms has achieved state-of-the-art results in several tasks, such as classification and auto-tagging. However, obtaining a good-performing model requires a significant amount of data. At the same time, most of the music datasets available lack cultural diversity. Therefore, the performance of the currently most used pre-trained models on underrepresented music genres is unknown. If music models follow the same direction that language models in Natural Language Processing, they should have poorer performance on music styles that are not present in the data used to train them. To verify this assumption, we use a well-known music model designed for auto-tagging in the task of genre recognition. We trained this model from scratch using a large general-domain dataset and two subsets specifying different domains. We empirically show that models trained on specific-domain data perform better than generalist models to classify music in the same domain, even trained with a smaller dataset. This outcome is distinctly observed in the subset that mainly contains Brazilian music, including several usually underrepresented genres.
通才和特定领域音乐分类模型及其对巴西音乐类型识别的影响
深度学习已经成为处理音乐信息检索问题的标准程序。这类机器学习算法在分类和自动标记等几个任务中取得了最先进的结果。然而,获得一个性能良好的模型需要大量的数据。与此同时,大多数可用的音乐数据集缺乏文化多样性。因此,目前最常用的预训练模型在代表性不足的音乐类型上的表现是未知的。如果音乐模型遵循与自然语言处理中的语言模型相同的方向,那么它们在没有出现在用于训练它们的数据中的音乐风格上的表现应该更差。为了验证这一假设,我们使用了一个著名的音乐模型,用于自动标记类型识别任务。我们使用一个大型通用领域数据集和两个指定不同领域的子集从头开始训练这个模型。我们的经验表明,在特定领域数据上训练的模型比通用模型在同一领域的音乐分类上表现得更好,即使是用更小的数据集训练。这一结果在主要包含巴西音乐的子集中明显观察到,其中包括几个通常未被充分代表的流派。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信