On Generalist and Domain-Specific Music Classification Models and Their Impacts on Brazilian Music Genre Recognition

Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021) Pub Date : 2021-10-24 DOI:10.5753/sbcm.2021.19427

Diego Furtado Silva, A. Silva, Luís Felipe Ortolan, R. Marcacini

{"title":"On Generalist and Domain-Specific Music Classification Models and Their Impacts on Brazilian Music Genre Recognition","authors":"Diego Furtado Silva, A. Silva, Luís Felipe Ortolan, R. Marcacini","doi":"10.5753/sbcm.2021.19427","DOIUrl":null,"url":null,"abstract":"Deep learning has become the standard procedure to deal with Music Information Retrieval problems. This category of machine learning algorithms has achieved state-of-the-art results in several tasks, such as classification and auto-tagging. However, obtaining a good-performing model requires a significant amount of data. At the same time, most of the music datasets available lack cultural diversity. Therefore, the performance of the currently most used pre-trained models on underrepresented music genres is unknown. If music models follow the same direction that language models in Natural Language Processing, they should have poorer performance on music styles that are not present in the data used to train them. To verify this assumption, we use a well-known music model designed for auto-tagging in the task of genre recognition. We trained this model from scratch using a large general-domain dataset and two subsets specifying different domains. We empirically show that models trained on specific-domain data perform better than generalist models to classify music in the same domain, even trained with a smaller dataset. This outcome is distinctly observed in the subset that mainly contains Brazilian music, including several usually underrepresented genres.","PeriodicalId":292360,"journal":{"name":"Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/sbcm.2021.19427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning has become the standard procedure to deal with Music Information Retrieval problems. This category of machine learning algorithms has achieved state-of-the-art results in several tasks, such as classification and auto-tagging. However, obtaining a good-performing model requires a significant amount of data. At the same time, most of the music datasets available lack cultural diversity. Therefore, the performance of the currently most used pre-trained models on underrepresented music genres is unknown. If music models follow the same direction that language models in Natural Language Processing, they should have poorer performance on music styles that are not present in the data used to train them. To verify this assumption, we use a well-known music model designed for auto-tagging in the task of genre recognition. We trained this model from scratch using a large general-domain dataset and two subsets specifying different domains. We empirically show that models trained on specific-domain data perform better than generalist models to classify music in the same domain, even trained with a smaller dataset. This outcome is distinctly observed in the subset that mainly contains Brazilian music, including several usually underrepresented genres.

查看原文本刊更多论文

通才和特定领域音乐分类模型及其对巴西音乐类型识别的影响

深度学习已经成为处理音乐信息检索问题的标准程序。这类机器学习算法在分类和自动标记等几个任务中取得了最先进的结果。然而，获得一个性能良好的模型需要大量的数据。与此同时，大多数可用的音乐数据集缺乏文化多样性。因此，目前最常用的预训练模型在代表性不足的音乐类型上的表现是未知的。如果音乐模型遵循与自然语言处理中的语言模型相同的方向，那么它们在没有出现在用于训练它们的数据中的音乐风格上的表现应该更差。为了验证这一假设，我们使用了一个著名的音乐模型，用于自动标记类型识别任务。我们使用一个大型通用领域数据集和两个指定不同领域的子集从头开始训练这个模型。我们的经验表明，在特定领域数据上训练的模型比通用模型在同一领域的音乐分类上表现得更好，即使是用更小的数据集训练。这一结果在主要包含巴西音乐的子集中明显观察到，其中包括几个通常未被充分代表的流派。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Anais do XVIII Simpósio Brasileiro de Computação Musical (SBCM 2021)

自引率

0.00%

发文量