Automatic Music Transcription dedicated to Chinese Traditional Plucked String Instrument Pipa using Multi-string Probabilistic Latent Component Analysis Models
Yuancheng Wang, Yuhui Huang, Wei Wei, D. Cazau, O. Adam, Qiao Wang
{"title":"Automatic Music Transcription dedicated to Chinese Traditional Plucked String Instrument Pipa using Multi-string Probabilistic Latent Component Analysis Models","authors":"Yuancheng Wang, Yuhui Huang, Wei Wei, D. Cazau, O. Adam, Qiao Wang","doi":"10.1049/icp.2021.1460","DOIUrl":null,"url":null,"abstract":"The Probabilistic Latent Component Analysis (PLCA) provides a flexible and highly interpretable framework to model a diversity of music features such as note event specific features (e.g. pitch, duration, amplitude, frequency shifting) and higherlevel knowledge like instrument timbre for Automatic Music Transcription (AMT). In this paper, Multi-String PLCA (MSPLCA) is proposed and allows to model the timbre of individual string characterized by different thickness and materials for polyphonic music transcription of pipa, the head of Chinese traditional plucked string instruments, which is barely studied in the Music Information Retrieval (MIR) community. A dataset, composing 9 famous pieces of Chinese folk music and the notelevel annotation, is created with help of musicians and music experts. As a result, MS-PLCA and its 2 variants adapted to the pipa acoustic features reach AMT performance similar to those found in literature for other instrument transcription. Finally, we illustrate the importance of modeled acoustic features over 2 most common playing techniques, vibrato and tremolo reflecting the periodic pitch and amplitude modulation.","PeriodicalId":431144,"journal":{"name":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/icp.2021.1460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The Probabilistic Latent Component Analysis (PLCA) provides a flexible and highly interpretable framework to model a diversity of music features such as note event specific features (e.g. pitch, duration, amplitude, frequency shifting) and higherlevel knowledge like instrument timbre for Automatic Music Transcription (AMT). In this paper, Multi-String PLCA (MSPLCA) is proposed and allows to model the timbre of individual string characterized by different thickness and materials for polyphonic music transcription of pipa, the head of Chinese traditional plucked string instruments, which is barely studied in the Music Information Retrieval (MIR) community. A dataset, composing 9 famous pieces of Chinese folk music and the notelevel annotation, is created with help of musicians and music experts. As a result, MS-PLCA and its 2 variants adapted to the pipa acoustic features reach AMT performance similar to those found in literature for other instrument transcription. Finally, we illustrate the importance of modeled acoustic features over 2 most common playing techniques, vibrato and tremolo reflecting the periodic pitch and amplitude modulation.