Rhythmic qualities of jazz improvisation predict performer identity and style in source-separated audio recordings.

IF 2.9 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Royal Society Open Science Pub Date : 2024-11-13 eCollection Date: 2024-11-01 DOI:10.1098/rsos.240920
Huw Cheston, Joshua L Schlichting, Ian Cross, Peter M C Harrison
{"title":"Rhythmic qualities of jazz improvisation predict performer identity and style in source-separated audio recordings.","authors":"Huw Cheston, Joshua L Schlichting, Ian Cross, Peter M C Harrison","doi":"10.1098/rsos.240920","DOIUrl":null,"url":null,"abstract":"<p><p>Great musicians have a unique style and, with training, humans can learn to distinguish between these styles. What differences between performers enable us to make such judgements? We investigate this question by building a machine learning model that predicts performer identity from data extracted automatically from an audio recording. Such a model could be trained on all kinds of musical features, but here we focus specifically on rhythm, which (unlike harmony, melody and timbre) is relevant for any musical instrument. We demonstrate that a supervised learning model trained solely on rhythmic features extracted from 300 recordings of 10 jazz pianists correctly identified the performer in 59% of cases, six times better than chance. The most important features related to a performer's 'feel' (ensemble synchronization) and 'complexity' (information density). Further analysis revealed two clusters of performers, with those in the same cluster sharing similar rhythmic traits, and that the rhythmic style of each musician changed relatively little over the duration of their career. Our findings highlight the possibility that artificial intelligence can perform performer identification tasks normally reserved for experts. Links to each recording and the corresponding predictions are available on an interactive map to support future work in stylometry.</p>","PeriodicalId":21525,"journal":{"name":"Royal Society Open Science","volume":"11 11","pages":"240920"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11557239/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Royal Society Open Science","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsos.240920","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Great musicians have a unique style and, with training, humans can learn to distinguish between these styles. What differences between performers enable us to make such judgements? We investigate this question by building a machine learning model that predicts performer identity from data extracted automatically from an audio recording. Such a model could be trained on all kinds of musical features, but here we focus specifically on rhythm, which (unlike harmony, melody and timbre) is relevant for any musical instrument. We demonstrate that a supervised learning model trained solely on rhythmic features extracted from 300 recordings of 10 jazz pianists correctly identified the performer in 59% of cases, six times better than chance. The most important features related to a performer's 'feel' (ensemble synchronization) and 'complexity' (information density). Further analysis revealed two clusters of performers, with those in the same cluster sharing similar rhythmic traits, and that the rhythmic style of each musician changed relatively little over the duration of their career. Our findings highlight the possibility that artificial intelligence can perform performer identification tasks normally reserved for experts. Links to each recording and the corresponding predictions are available on an interactive map to support future work in stylometry.

爵士乐即兴演奏的节奏特质可预测源分离录音中演奏者的身份和风格。
伟大的音乐家都有自己独特的风格,经过训练,人类可以学会区分这些风格。表演者之间的哪些差异能让我们做出这样的判断?我们通过建立一个机器学习模型来研究这个问题,该模型可以从录音中自动提取的数据中预测表演者的身份。这种模型可以根据各种音乐特征进行训练,但在此我们特别关注节奏,因为节奏(与和声、旋律和音色不同)与任何乐器都相关。我们证明,仅根据从 10 位爵士钢琴家的 300 份录音中提取的节奏特征训练的监督学习模型,就有 59% 的情况下正确识别了演奏者,比偶然识别率高出六倍。最重要的特征与演奏者的 "感觉"(合奏同步)和 "复杂性"(信息密度)有关。进一步的分析表明,演奏者有两个集群,同一集群中的演奏者具有相似的节奏特征,而且每个音乐家的节奏风格在其职业生涯中变化相对较小。我们的研究结果凸显了人工智能能够完成通常由专家完成的表演者识别任务的可能性。在交互式地图上可以链接到每段录音和相应的预测,以支持未来的风格测量工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Royal Society Open Science
Royal Society Open Science Multidisciplinary-Multidisciplinary
CiteScore
6.00
自引率
0.00%
发文量
508
审稿时长
14 weeks
期刊介绍: Royal Society Open Science is a new open journal publishing high-quality original research across the entire range of science on the basis of objective peer-review. The journal covers the entire range of science and mathematics and will allow the Society to publish all the high-quality work it receives without the usual restrictions on scope, length or impact.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信