时变热门歌曲偏好建模的机器学习方法

Dionisios Nikas, Dionisios N. Sotiropoulos
{"title":"时变热门歌曲偏好建模的机器学习方法","authors":"Dionisios Nikas, Dionisios N. Sotiropoulos","doi":"10.1109/IISA56318.2022.9904376","DOIUrl":null,"url":null,"abstract":"The music industry is investing each year huge amounts of money to artists and their songs with the ultimate goal of becoming a hit song. Since its creation back in 1958, the Billboard Weekly Hot 100 chart is one of the most iconic and reliable sources of hit songs. Using Spotify’s Web API and Genius API and their massive collection of songs we gathered all the high-level audio features, lyrics and some temporal features for all the songs that made it to the Hot 100 Chart in the period 1958-2020. Using these features, we will perform an analysis on the time varying preferences on what is considered a hit song using One-Class-SVM and conclude that most of the hit songs are very similar based on their high-level audio features and lyric word-embeddings. Then, to support our results and hypothesis even more, we will try to build a multi-class classifier using algorithms such as Random Forest, KNN, Logistic regression and Support Vector Machines (SVC with RBF kernel) to predict the position/popularity of a hit song on the billboard chart. Finally, we will address our thoughts on why these features may or may not be enough to build a hit song classifier and discuss future work for a better approach to this problem.","PeriodicalId":217519,"journal":{"name":"2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Machine Learning Approach for Modeling Time-Varying Hit Song Preferences\",\"authors\":\"Dionisios Nikas, Dionisios N. Sotiropoulos\",\"doi\":\"10.1109/IISA56318.2022.9904376\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The music industry is investing each year huge amounts of money to artists and their songs with the ultimate goal of becoming a hit song. Since its creation back in 1958, the Billboard Weekly Hot 100 chart is one of the most iconic and reliable sources of hit songs. Using Spotify’s Web API and Genius API and their massive collection of songs we gathered all the high-level audio features, lyrics and some temporal features for all the songs that made it to the Hot 100 Chart in the period 1958-2020. Using these features, we will perform an analysis on the time varying preferences on what is considered a hit song using One-Class-SVM and conclude that most of the hit songs are very similar based on their high-level audio features and lyric word-embeddings. Then, to support our results and hypothesis even more, we will try to build a multi-class classifier using algorithms such as Random Forest, KNN, Logistic regression and Support Vector Machines (SVC with RBF kernel) to predict the position/popularity of a hit song on the billboard chart. Finally, we will address our thoughts on why these features may or may not be enough to build a hit song classifier and discuss future work for a better approach to this problem.\",\"PeriodicalId\":217519,\"journal\":{\"name\":\"2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA56318.2022.9904376\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA56318.2022.9904376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

音乐产业每年都在艺术家和他们的歌曲上投入巨额资金,最终目标是成为热门歌曲。自1958年创立以来,公告牌每周百强单曲榜一直是最具代表性和最可靠的热门歌曲来源之一。使用Spotify的Web API和Genius API及其大量的歌曲集合,我们收集了1958-2020年期间进入Hot 100排行榜的所有歌曲的所有高级音频功能,歌词和一些时间功能。使用这些特征,我们将使用一类支持向量机对被认为是热门歌曲的时间变化偏好进行分析,并得出结论,基于它们的高级音频特征和抒情词嵌入,大多数热门歌曲非常相似。然后,为了进一步支持我们的结果和假设,我们将尝试使用随机森林、KNN、逻辑回归和支持向量机(带有RBF内核的SVC)等算法构建一个多类分类器,以预测热门歌曲在公告牌排行榜上的位置/流行程度。最后,我们将阐述我们的想法,为什么这些特征可能或可能不足以构建热门歌曲分类器,并讨论未来的工作,以更好地解决这个问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Machine Learning Approach for Modeling Time-Varying Hit Song Preferences
The music industry is investing each year huge amounts of money to artists and their songs with the ultimate goal of becoming a hit song. Since its creation back in 1958, the Billboard Weekly Hot 100 chart is one of the most iconic and reliable sources of hit songs. Using Spotify’s Web API and Genius API and their massive collection of songs we gathered all the high-level audio features, lyrics and some temporal features for all the songs that made it to the Hot 100 Chart in the period 1958-2020. Using these features, we will perform an analysis on the time varying preferences on what is considered a hit song using One-Class-SVM and conclude that most of the hit songs are very similar based on their high-level audio features and lyric word-embeddings. Then, to support our results and hypothesis even more, we will try to build a multi-class classifier using algorithms such as Random Forest, KNN, Logistic regression and Support Vector Machines (SVC with RBF kernel) to predict the position/popularity of a hit song on the billboard chart. Finally, we will address our thoughts on why these features may or may not be enough to build a hit song classifier and discuss future work for a better approach to this problem.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信