Rhythmic qualities of jazz improvisation predict performer identity and style in source-separated audio recordings.

IF 2.9 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Royal Society Open Science Pub Date : 2024-11-13 eCollection Date: 2024-11-01 DOI:10.1098/rsos.240920

Huw Cheston, Joshua L Schlichting, Ian Cross, Peter M C Harrison

{"title":"Rhythmic qualities of jazz improvisation predict performer identity and style in source-separated audio recordings.","authors":"Huw Cheston, Joshua L Schlichting, Ian Cross, Peter M C Harrison","doi":"10.1098/rsos.240920","DOIUrl":null,"url":null,"abstract":"<p><p>Great musicians have a unique style and, with training, humans can learn to distinguish between these styles. What differences between performers enable us to make such judgements? We investigate this question by building a machine learning model that predicts performer identity from data extracted automatically from an audio recording. Such a model could be trained on all kinds of musical features, but here we focus specifically on rhythm, which (unlike harmony, melody and timbre) is relevant for any musical instrument. We demonstrate that a supervised learning model trained solely on rhythmic features extracted from 300 recordings of 10 jazz pianists correctly identified the performer in 59% of cases, six times better than chance. The most important features related to a performer's 'feel' (ensemble synchronization) and 'complexity' (information density). Further analysis revealed two clusters of performers, with those in the same cluster sharing similar rhythmic traits, and that the rhythmic style of each musician changed relatively little over the duration of their career. Our findings highlight the possibility that artificial intelligence can perform performer identification tasks normally reserved for experts. Links to each recording and the corresponding predictions are available on an interactive map to support future work in stylometry.</p>","PeriodicalId":21525,"journal":{"name":"Royal Society Open Science","volume":"11 11","pages":"240920"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11557239/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Royal Society Open Science","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsos.240920","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Great musicians have a unique style and, with training, humans can learn to distinguish between these styles. What differences between performers enable us to make such judgements? We investigate this question by building a machine learning model that predicts performer identity from data extracted automatically from an audio recording. Such a model could be trained on all kinds of musical features, but here we focus specifically on rhythm, which (unlike harmony, melody and timbre) is relevant for any musical instrument. We demonstrate that a supervised learning model trained solely on rhythmic features extracted from 300 recordings of 10 jazz pianists correctly identified the performer in 59% of cases, six times better than chance. The most important features related to a performer's 'feel' (ensemble synchronization) and 'complexity' (information density). Further analysis revealed two clusters of performers, with those in the same cluster sharing similar rhythmic traits, and that the rhythmic style of each musician changed relatively little over the duration of their career. Our findings highlight the possibility that artificial intelligence can perform performer identification tasks normally reserved for experts. Links to each recording and the corresponding predictions are available on an interactive map to support future work in stylometry.

查看原文本刊更多论文

爵士乐即兴演奏的节奏特质可预测源分离录音中演奏者的身份和风格。

伟大的音乐家都有自己独特的风格，经过训练，人类可以学会区分这些风格。表演者之间的哪些差异能让我们做出这样的判断？我们通过建立一个机器学习模型来研究这个问题，该模型可以从录音中自动提取的数据中预测表演者的身份。这种模型可以根据各种音乐特征进行训练，但在此我们特别关注节奏，因为节奏（与和声、旋律和音色不同）与任何乐器都相关。我们证明，仅根据从 10 位爵士钢琴家的 300 份录音中提取的节奏特征训练的监督学习模型，就有 59% 的情况下正确识别了演奏者，比偶然识别率高出六倍。最重要的特征与演奏者的 "感觉"（合奏同步）和 "复杂性"（信息密度）有关。进一步的分析表明，演奏者有两个集群，同一集群中的演奏者具有相似的节奏特征，而且每个音乐家的节奏风格在其职业生涯中变化相对较小。我们的研究结果凸显了人工智能能够完成通常由专家完成的表演者识别任务的可能性。在交互式地图上可以链接到每段录音和相应的预测，以支持未来的风格测量工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Royal Society Open Science Multidisciplinary-Multidisciplinary

CiteScore

6.00

自引率

0.00%

发文量

508

审稿时长

14 weeks

期刊介绍： Royal Society Open Science is a new open journal publishing high-quality original research across the entire range of science on the basis of objective peer-review. The journal covers the entire range of science and mathematics and will allow the Society to publish all the high-quality work it receives without the usual restrictions on scope, length or impact.