HSP-TL:利用歌词和音频特征进行热门歌曲预测的三连音损失深度度量学习模型

Petros Vavaroutsos, Pantelis Vikatos
{"title":"HSP-TL:利用歌词和音频特征进行热门歌曲预测的三连音损失深度度量学习模型","authors":"Petros Vavaroutsos,&nbsp;Pantelis Vikatos","doi":"10.1016/j.sctalk.2024.100363","DOIUrl":null,"url":null,"abstract":"<div><p>The music industry is interested in the future success of a song and its presence in popular rankings such as the Billboard charts. However, a song's popularity might be impacted by variables such as music trends and social influences, which are indifferent to audio signals. In this paper, we present HSP-TL, a deep learning model, to identify likely hit songs. Our work combines temporal information and features derived from audio and lyrics to estimate the success of a recording. We adopt the concept of the triplet loss function to minimize the distance between objects with similar popularity. Also, we use convolutional neural networks on 2-D low-level audio features, contrary to the current approach. We use pre-trained language models for text-based feature extraction. Our method is evaluated on the Hit Song Prediction Dataset, which we enrich with the lyrics of each song. Our results show that the inclusion of lyrics improves song uniqueness and reflects musical trends. The proposed model outperforms the current approach by up to 8%.</p></div>","PeriodicalId":101148,"journal":{"name":"Science Talks","volume":"10 ","pages":"Article 100363"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772569324000719/pdfft?md5=1eb70505cb6b3f3d64cd43cf36fa1199&pid=1-s2.0-S2772569324000719-main.pdf","citationCount":"0","resultStr":"{\"title\":\"HSP-TL: A deep metric learning model with triplet loss for hit song prediction using lyrics and audio features\",\"authors\":\"Petros Vavaroutsos,&nbsp;Pantelis Vikatos\",\"doi\":\"10.1016/j.sctalk.2024.100363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The music industry is interested in the future success of a song and its presence in popular rankings such as the Billboard charts. However, a song's popularity might be impacted by variables such as music trends and social influences, which are indifferent to audio signals. In this paper, we present HSP-TL, a deep learning model, to identify likely hit songs. Our work combines temporal information and features derived from audio and lyrics to estimate the success of a recording. We adopt the concept of the triplet loss function to minimize the distance between objects with similar popularity. Also, we use convolutional neural networks on 2-D low-level audio features, contrary to the current approach. We use pre-trained language models for text-based feature extraction. Our method is evaluated on the Hit Song Prediction Dataset, which we enrich with the lyrics of each song. Our results show that the inclusion of lyrics improves song uniqueness and reflects musical trends. The proposed model outperforms the current approach by up to 8%.</p></div>\",\"PeriodicalId\":101148,\"journal\":{\"name\":\"Science Talks\",\"volume\":\"10 \",\"pages\":\"Article 100363\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772569324000719/pdfft?md5=1eb70505cb6b3f3d64cd43cf36fa1199&pid=1-s2.0-S2772569324000719-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science Talks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772569324000719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Talks","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772569324000719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

音乐行业关注的是一首歌曲未来的成功及其在流行排行榜(如公告牌排行榜)上的排名。然而,一首歌曲的受欢迎程度可能会受到音乐趋势和社会影响等变量的影响,而这些变量对音频信号并不关心。在本文中,我们提出了一种深度学习模型 HSP-TL,用于识别可能的热门歌曲。我们的工作结合了从音频和歌词中提取的时间信息和特征,以估算唱片的成功率。我们采用了三重损失函数的概念,以最小化具有相似流行度的对象之间的距离。此外,与目前的方法相反,我们在二维低级音频特征上使用卷积神经网络。我们使用预先训练好的语言模型进行基于文本的特征提取。我们在热门歌曲预测数据集上对我们的方法进行了评估。结果表明,歌词的加入提高了歌曲的独特性并反映了音乐趋势。所提出的模型比目前的方法优越多达 8%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
HSP-TL: A deep metric learning model with triplet loss for hit song prediction using lyrics and audio features

The music industry is interested in the future success of a song and its presence in popular rankings such as the Billboard charts. However, a song's popularity might be impacted by variables such as music trends and social influences, which are indifferent to audio signals. In this paper, we present HSP-TL, a deep learning model, to identify likely hit songs. Our work combines temporal information and features derived from audio and lyrics to estimate the success of a recording. We adopt the concept of the triplet loss function to minimize the distance between objects with similar popularity. Also, we use convolutional neural networks on 2-D low-level audio features, contrary to the current approach. We use pre-trained language models for text-based feature extraction. Our method is evaluated on the Hit Song Prediction Dataset, which we enrich with the lyrics of each song. Our results show that the inclusion of lyrics improves song uniqueness and reflects musical trends. The proposed model outperforms the current approach by up to 8%.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信