Experiments and detailed error-analysis of automatic square notation transcription of medieval music manuscripts using CNN/LSTM-networks and a neume dictionary
IF 1.1 4区 计算机科学Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
{"title":"Experiments and detailed error-analysis of automatic square notation transcription of medieval music manuscripts using CNN/LSTM-networks and a neume dictionary","authors":"C. Wick, F. Puppe","doi":"10.1080/09298215.2021.1873393","DOIUrl":null,"url":null,"abstract":"The automatic recognition of scanned Medieval manuscripts written in square notation still represents a challenge due to degradation, non-standard layouts, or notations. We propose to apply CNN/LSTM networks that are trained using the segmentation-free CTC-loss-function. For evaluation, we use three different manuscripts and achieve a diplomatic Symbol Accuracy Rate (dSAR) of 86.0% on the most difficult book and 92.2% on the cleanest one. A neume dictionary during decoding yields a relative improvement of about 5%. Finally, we perform a thorough error analysis to provide a deeper insight into problems of the algorithm.","PeriodicalId":16553,"journal":{"name":"Journal of New Music Research","volume":"50 1","pages":"18 - 36"},"PeriodicalIF":1.1000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/09298215.2021.1873393","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of New Music Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/09298215.2021.1873393","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 10
Abstract
The automatic recognition of scanned Medieval manuscripts written in square notation still represents a challenge due to degradation, non-standard layouts, or notations. We propose to apply CNN/LSTM networks that are trained using the segmentation-free CTC-loss-function. For evaluation, we use three different manuscripts and achieve a diplomatic Symbol Accuracy Rate (dSAR) of 86.0% on the most difficult book and 92.2% on the cleanest one. A neume dictionary during decoding yields a relative improvement of about 5%. Finally, we perform a thorough error analysis to provide a deeper insight into problems of the algorithm.
期刊介绍:
The Journal of New Music Research (JNMR) publishes material which increases our understanding of music and musical processes by systematic, scientific and technological means. Research published in the journal is innovative, empirically grounded and often, but not exclusively, uses quantitative methods. Articles are both musically relevant and scientifically rigorous, giving full technical details. No bounds are placed on the music or musical behaviours at issue: popular music, music of diverse cultures and the canon of western classical music are all within the Journal’s scope. Articles deal with theory, analysis, composition, performance, uses of music, instruments and other music technologies. The Journal was founded in 1972 with the original title Interface to reflect its interdisciplinary nature, drawing on musicology (including music theory), computer science, psychology, acoustics, philosophy, and other disciplines.