The impact of playlist characteristics on coherence in user-curated music playlists.

IF 2.5 2区计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

EPJ Data Science Pub Date : 2025-01-01 Epub Date: 2025-03-19 DOI:10.1140/epjds/s13688-025-00531-3

Harald Schweiger, Emilia Parada-Cabaleiro, Markus Schedl

{"title":"The impact of playlist characteristics on coherence in user-curated music playlists.","authors":"Harald Schweiger, Emilia Parada-Cabaleiro, Markus Schedl","doi":"10.1140/epjds/s13688-025-00531-3","DOIUrl":null,"url":null,"abstract":"Music playlist creation is a crucial, yet not fully explored task in music data mining and music information retrieval. Previous studies have largely focused on investigating diversity, popularity, and serendipity of tracks in human- or machine-generated playlists. However, the concept of playlist coherence - vaguely defined as smooth transitions between tracks - remains poorly understood and even lacks a standardized definition. In this paper, we provide a formal definition for measuring playlist coherence based on the sequential ordering of tracks, offering a more interpretable measurement compared to existing literature, and allowing for comparisons between playlists with different musical styles. The presented formal framework to measure coherence is applied to analyze a substantial dataset of user-generated playlists, examining how various playlist characteristics influence coherence. We identified four key attributes: playlist length, number of edits, track popularity, and collaborative playlist curation as potential influencing factors. Using correlation and causal inference models, the impact of these attributes on coherence across ten auditory and one metadata feature are assessed. Our findings indicate that these attributes influence playlist coherence to varying extents. Longer playlists tend to exhibit higher coherence, whereas playlists dominated by popular tracks or those extensively modified by users show reduced coherence. In contrast, collaborative playlist curation yielded mixed results. The insights from this study have practical implications for enhancing recommendation tasks, such as automatic playlist generation and continuation, beyond traditional accuracy metrics. As a demonstration of these findings, we propose a simple greedy algorithm that reorganizes playlists to align coherence with observed trends.Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00531-3.","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"24"},"PeriodicalIF":2.5000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11923031/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPJ Data Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1140/epjds/s13688-025-00531-3","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Music playlist creation is a crucial, yet not fully explored task in music data mining and music information retrieval. Previous studies have largely focused on investigating diversity, popularity, and serendipity of tracks in human- or machine-generated playlists. However, the concept of playlist coherence - vaguely defined as smooth transitions between tracks - remains poorly understood and even lacks a standardized definition. In this paper, we provide a formal definition for measuring playlist coherence based on the sequential ordering of tracks, offering a more interpretable measurement compared to existing literature, and allowing for comparisons between playlists with different musical styles. The presented formal framework to measure coherence is applied to analyze a substantial dataset of user-generated playlists, examining how various playlist characteristics influence coherence. We identified four key attributes: playlist length, number of edits, track popularity, and collaborative playlist curation as potential influencing factors. Using correlation and causal inference models, the impact of these attributes on coherence across ten auditory and one metadata feature are assessed. Our findings indicate that these attributes influence playlist coherence to varying extents. Longer playlists tend to exhibit higher coherence, whereas playlists dominated by popular tracks or those extensively modified by users show reduced coherence. In contrast, collaborative playlist curation yielded mixed results. The insights from this study have practical implications for enhancing recommendation tasks, such as automatic playlist generation and continuation, beyond traditional accuracy metrics. As a demonstration of these findings, we propose a simple greedy algorithm that reorganizes playlists to align coherence with observed trends.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00531-3.

Abstract Image

查看原文本刊更多论文

用户策划音乐播放列表中播放列表特征对连贯性的影响。

在音乐数据挖掘和音乐信息检索中，音乐播放列表的创建是一个非常重要但尚未得到充分研究的任务。以前的研究主要集中在调查人类或机器生成的播放列表中曲目的多样性、受欢迎程度和偶然性。然而，播放列表一致性的概念——模糊地定义为音轨之间的平滑过渡——仍然缺乏理解，甚至缺乏标准化的定义。在本文中，我们提供了一个基于音轨顺序测量播放列表一致性的正式定义，与现有文献相比，提供了一个更可解释的测量方法，并允许在不同音乐风格的播放列表之间进行比较。所提出的测量连贯性的正式框架被应用于分析用户生成的播放列表的大量数据集，检查各种播放列表特征如何影响连贯性。我们确定了四个关键属性：播放列表长度、编辑次数、曲目受欢迎程度和协作播放列表管理作为潜在的影响因素。使用相关性和因果推理模型，评估了这些属性对十个听觉特征和一个元数据特征的一致性的影响。我们的研究结果表明，这些属性在不同程度上影响了播放列表的连贯性。较长的播放列表往往表现出更高的连贯性，而由流行歌曲或用户广泛修改的播放列表则表现出较低的连贯性。相比之下，协作式播放列表管理产生了好坏参半的结果。本研究的见解对增强推荐任务具有实际意义，例如自动播放列表生成和延续，超出了传统的准确性指标。为了证明这些发现，我们提出了一个简单的贪婪算法，该算法可以重组播放列表，使其与观察到的趋势保持一致。补充信息：在线版本包含补充资料，可在10.1140/epjds/s13688-025-00531-3获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

EPJ Data Science MATHEMATICS, INTERDISCIPLINARY APPLICATIONS -

CiteScore

6.10

自引率

5.60%

发文量

审稿时长

13 weeks

期刊介绍： EPJ Data Science covers a broad range of research areas and applications and particularly encourages contributions from techno-socio-economic systems, where it comprises those research lines that now regard the digital “tracks” of human beings as first-order objects for scientific investigation. Topics include, but are not limited to, human behavior, social interaction (including animal societies), economic and financial systems, management and business networks, socio-technical infrastructure, health and environmental systems, the science of science, as well as general risk and crisis scenario forecasting up to and including policy advice.