Auto-tagging system based on song’s latent representations for inferring contextual user information

2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) Pub Date : 2022-11-01 DOI:10.1109/WI-IAT55865.2022.00040

Á. L. Murciego, Diego M. Jiménez-Bravo, André Sales Mendes, V. F. L. Batista, M. M. García

{"title":"Auto-tagging system based on song’s latent representations for inferring contextual user information","authors":"Á. L. Murciego, Diego M. Jiménez-Bravo, André Sales Mendes, V. F. L. Batista, M. M. García","doi":"10.1109/WI-IAT55865.2022.00040","DOIUrl":null,"url":null,"abstract":"Currently in the field of Recommender Systems for the music domain, there is active research about approaches for inferring the user context. Moreover, in the Music Information Retrieval there have been great advances in the generation of latent representations of songs including approaches such as contrastive learning as pretrain strategy or other approaches related to Natural Language Modeling like codified audio language modeling (CALM). Such advances are especially useful for Music Information Retrieval discriminative tasks such as genre classification, key detection, emotion recognition and music tagging. This last task attracts the interest of music streaming services that seek to tag their catalogs, especially with tags related to the user's context as this has a great impact on their tastes and influences the developed recommender systems. These tags are usually provided by users on social networks and are frequently found only for popular songs in the catalog. However, recently added songs to the catalog or songs belonging to the long tail do not have these tags and the need to create new systems called auto-taggers capable of tagging these songs arises. This paper proposes an auto-tagging system and presents an evaluation of different multi-label classification approaches included in it for contextual label auto-tagging. These approaches use different latent representations of songs, employing a recent published dataset with user context tags. The results obtained from the case study conducted to evaluate the proposed system show a clear improvement in the classification metrics by using new latent representations compared to the use of simpler features in traditional state-of-the-art approaches.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT55865.2022.00040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Currently in the field of Recommender Systems for the music domain, there is active research about approaches for inferring the user context. Moreover, in the Music Information Retrieval there have been great advances in the generation of latent representations of songs including approaches such as contrastive learning as pretrain strategy or other approaches related to Natural Language Modeling like codified audio language modeling (CALM). Such advances are especially useful for Music Information Retrieval discriminative tasks such as genre classification, key detection, emotion recognition and music tagging. This last task attracts the interest of music streaming services that seek to tag their catalogs, especially with tags related to the user's context as this has a great impact on their tastes and influences the developed recommender systems. These tags are usually provided by users on social networks and are frequently found only for popular songs in the catalog. However, recently added songs to the catalog or songs belonging to the long tail do not have these tags and the need to create new systems called auto-taggers capable of tagging these songs arises. This paper proposes an auto-tagging system and presents an evaluation of different multi-label classification approaches included in it for contextual label auto-tagging. These approaches use different latent representations of songs, employing a recent published dataset with user context tags. The results obtained from the case study conducted to evaluate the proposed system show a clear improvement in the classification metrics by using new latent representations compared to the use of simpler features in traditional state-of-the-art approaches.

查看原文本刊更多论文

基于歌曲潜在表示的自动标注系统，用于推断上下文用户信息

目前在音乐领域的推荐系统中，对推断用户上下文的方法进行了积极的研究。此外，在音乐信息检索中，歌曲的潜在表征的生成已经取得了很大的进展，包括方法，如作为预训练策略的对比学习或其他与自然语言建模相关的方法，如编码音频语言建模(CALM)。这些进步对于音乐信息检索的鉴别任务尤其有用，例如类型分类、键检测、情感识别和音乐标记。最后一项任务吸引了音乐流媒体服务的兴趣，这些服务寻求标记他们的目录，特别是与用户上下文相关的标签，因为这对他们的品味有很大的影响，并影响了已开发的推荐系统。这些标签通常是由社交网络上的用户提供的，通常只适用于目录中的流行歌曲。然而，最近添加到目录中的歌曲或属于长尾的歌曲没有这些标签，因此需要创建能够标记这些歌曲的称为自动标签的新系统。本文提出了一个自动标注系统，并对其中包含的不同多标签分类方法进行了评价。这些方法使用不同的歌曲潜在表示，使用最近发布的带有用户上下文标签的数据集。案例研究的结果表明，与传统方法中使用更简单的特征相比，使用新的潜在表征在分类指标方面有明显的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

自引率

0.00%

发文量