Auto-tagging system based on song’s latent representations for inferring contextual user information

Á. L. Murciego, Diego M. Jiménez-Bravo, André Sales Mendes, V. F. L. Batista, M. M. García
{"title":"Auto-tagging system based on song’s latent representations for inferring contextual user information","authors":"Á. L. Murciego, Diego M. Jiménez-Bravo, André Sales Mendes, V. F. L. Batista, M. M. García","doi":"10.1109/WI-IAT55865.2022.00040","DOIUrl":null,"url":null,"abstract":"Currently in the field of Recommender Systems for the music domain, there is active research about approaches for inferring the user context. Moreover, in the Music Information Retrieval there have been great advances in the generation of latent representations of songs including approaches such as contrastive learning as pretrain strategy or other approaches related to Natural Language Modeling like codified audio language modeling (CALM). Such advances are especially useful for Music Information Retrieval discriminative tasks such as genre classification, key detection, emotion recognition and music tagging. This last task attracts the interest of music streaming services that seek to tag their catalogs, especially with tags related to the user's context as this has a great impact on their tastes and influences the developed recommender systems. These tags are usually provided by users on social networks and are frequently found only for popular songs in the catalog. However, recently added songs to the catalog or songs belonging to the long tail do not have these tags and the need to create new systems called auto-taggers capable of tagging these songs arises. This paper proposes an auto-tagging system and presents an evaluation of different multi-label classification approaches included in it for contextual label auto-tagging. These approaches use different latent representations of songs, employing a recent published dataset with user context tags. The results obtained from the case study conducted to evaluate the proposed system show a clear improvement in the classification metrics by using new latent representations compared to the use of simpler features in traditional state-of-the-art approaches.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT55865.2022.00040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Currently in the field of Recommender Systems for the music domain, there is active research about approaches for inferring the user context. Moreover, in the Music Information Retrieval there have been great advances in the generation of latent representations of songs including approaches such as contrastive learning as pretrain strategy or other approaches related to Natural Language Modeling like codified audio language modeling (CALM). Such advances are especially useful for Music Information Retrieval discriminative tasks such as genre classification, key detection, emotion recognition and music tagging. This last task attracts the interest of music streaming services that seek to tag their catalogs, especially with tags related to the user's context as this has a great impact on their tastes and influences the developed recommender systems. These tags are usually provided by users on social networks and are frequently found only for popular songs in the catalog. However, recently added songs to the catalog or songs belonging to the long tail do not have these tags and the need to create new systems called auto-taggers capable of tagging these songs arises. This paper proposes an auto-tagging system and presents an evaluation of different multi-label classification approaches included in it for contextual label auto-tagging. These approaches use different latent representations of songs, employing a recent published dataset with user context tags. The results obtained from the case study conducted to evaluate the proposed system show a clear improvement in the classification metrics by using new latent representations compared to the use of simpler features in traditional state-of-the-art approaches.
基于歌曲潜在表示的自动标注系统,用于推断上下文用户信息
目前在音乐领域的推荐系统中,对推断用户上下文的方法进行了积极的研究。此外,在音乐信息检索中,歌曲的潜在表征的生成已经取得了很大的进展,包括方法,如作为预训练策略的对比学习或其他与自然语言建模相关的方法,如编码音频语言建模(CALM)。这些进步对于音乐信息检索的鉴别任务尤其有用,例如类型分类、键检测、情感识别和音乐标记。最后一项任务吸引了音乐流媒体服务的兴趣,这些服务寻求标记他们的目录,特别是与用户上下文相关的标签,因为这对他们的品味有很大的影响,并影响了已开发的推荐系统。这些标签通常是由社交网络上的用户提供的,通常只适用于目录中的流行歌曲。然而,最近添加到目录中的歌曲或属于长尾的歌曲没有这些标签,因此需要创建能够标记这些歌曲的称为自动标签的新系统。本文提出了一个自动标注系统,并对其中包含的不同多标签分类方法进行了评价。这些方法使用不同的歌曲潜在表示,使用最近发布的带有用户上下文标签的数据集。案例研究的结果表明,与传统方法中使用更简单的特征相比,使用新的潜在表征在分类指标方面有明显的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信