Yuancheng Wang, Yuhui Huang, Wei Wei, D. Cazau, O. Adam, Qiao Wang
{"title":"基于多弦概率潜在成分分析模型的中国传统弹拨乐器琵琶自动谱曲研究","authors":"Yuancheng Wang, Yuhui Huang, Wei Wei, D. Cazau, O. Adam, Qiao Wang","doi":"10.1049/icp.2021.1460","DOIUrl":null,"url":null,"abstract":"The Probabilistic Latent Component Analysis (PLCA) provides a flexible and highly interpretable framework to model a diversity of music features such as note event specific features (e.g. pitch, duration, amplitude, frequency shifting) and higherlevel knowledge like instrument timbre for Automatic Music Transcription (AMT). In this paper, Multi-String PLCA (MSPLCA) is proposed and allows to model the timbre of individual string characterized by different thickness and materials for polyphonic music transcription of pipa, the head of Chinese traditional plucked string instruments, which is barely studied in the Music Information Retrieval (MIR) community. A dataset, composing 9 famous pieces of Chinese folk music and the notelevel annotation, is created with help of musicians and music experts. As a result, MS-PLCA and its 2 variants adapted to the pipa acoustic features reach AMT performance similar to those found in literature for other instrument transcription. Finally, we illustrate the importance of modeled acoustic features over 2 most common playing techniques, vibrato and tremolo reflecting the periodic pitch and amplitude modulation.","PeriodicalId":431144,"journal":{"name":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Automatic Music Transcription dedicated to Chinese Traditional Plucked String Instrument Pipa using Multi-string Probabilistic Latent Component Analysis Models\",\"authors\":\"Yuancheng Wang, Yuhui Huang, Wei Wei, D. Cazau, O. Adam, Qiao Wang\",\"doi\":\"10.1049/icp.2021.1460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Probabilistic Latent Component Analysis (PLCA) provides a flexible and highly interpretable framework to model a diversity of music features such as note event specific features (e.g. pitch, duration, amplitude, frequency shifting) and higherlevel knowledge like instrument timbre for Automatic Music Transcription (AMT). In this paper, Multi-String PLCA (MSPLCA) is proposed and allows to model the timbre of individual string characterized by different thickness and materials for polyphonic music transcription of pipa, the head of Chinese traditional plucked string instruments, which is barely studied in the Music Information Retrieval (MIR) community. A dataset, composing 9 famous pieces of Chinese folk music and the notelevel annotation, is created with help of musicians and music experts. As a result, MS-PLCA and its 2 variants adapted to the pipa acoustic features reach AMT performance similar to those found in literature for other instrument transcription. Finally, we illustrate the importance of modeled acoustic features over 2 most common playing techniques, vibrato and tremolo reflecting the periodic pitch and amplitude modulation.\",\"PeriodicalId\":431144,\"journal\":{\"name\":\"11th International Conference of Pattern Recognition Systems (ICPRS 2021)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"11th International Conference of Pattern Recognition Systems (ICPRS 2021)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1049/icp.2021.1460\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/icp.2021.1460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
概率潜在成分分析(PLCA)提供了一个灵活且高度可解释的框架来模拟各种音乐特征,如音符事件特定特征(如音调,持续时间,幅度,频移)和更高层次的知识,如自动音乐转录(AMT)的乐器音色。本文提出了一种多弦PLCA (Multi-String PLCA, MSPLCA)方法,该方法可以对我国传统拨弦乐器中琵琶的复调曲谱进行不同厚度、不同材质的单弦音色建模,这一问题在音乐信息检索(music Information Retrieval, MIR)领域研究较少。在音乐家和音乐专家的帮助下,建立了一个由9首中国民乐名曲和小说注释组成的数据集。因此,MS-PLCA及其2个适应琵琶声学特征的变体达到了与文献中发现的其他乐器转录相似的AMT性能。最后,我们说明了建模声学特征在2种最常见的演奏技术上的重要性,即反映周期性音高和幅度调制的颤音和颤音。
Automatic Music Transcription dedicated to Chinese Traditional Plucked String Instrument Pipa using Multi-string Probabilistic Latent Component Analysis Models
The Probabilistic Latent Component Analysis (PLCA) provides a flexible and highly interpretable framework to model a diversity of music features such as note event specific features (e.g. pitch, duration, amplitude, frequency shifting) and higherlevel knowledge like instrument timbre for Automatic Music Transcription (AMT). In this paper, Multi-String PLCA (MSPLCA) is proposed and allows to model the timbre of individual string characterized by different thickness and materials for polyphonic music transcription of pipa, the head of Chinese traditional plucked string instruments, which is barely studied in the Music Information Retrieval (MIR) community. A dataset, composing 9 famous pieces of Chinese folk music and the notelevel annotation, is created with help of musicians and music experts. As a result, MS-PLCA and its 2 variants adapted to the pipa acoustic features reach AMT performance similar to those found in literature for other instrument transcription. Finally, we illustrate the importance of modeled acoustic features over 2 most common playing techniques, vibrato and tremolo reflecting the periodic pitch and amplitude modulation.