Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras
{"title":"里科迪档案馆手稿中的光学音乐识别技术","authors":"Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras","doi":"arxiv-2408.10260","DOIUrl":null,"url":null,"abstract":"The Ricordi archive, a prestigious collection of significant musical\nmanuscripts from renowned opera composers such as Donizetti, Verdi and Puccini,\nhas been digitized. This process has allowed us to automatically extract\nsamples that represent various musical elements depicted on the manuscripts,\nincluding notes, staves, clefs, erasures, and composer's annotations, among\nothers. To distinguish between digitization noise and actual music elements, a\nsubset of these images was meticulously grouped and labeled by multiple\nindividuals into several classes. After assessing the consistency of the\nannotations, we trained multiple neural network-based classifiers to\ndifferentiate between the identified music elements. The primary objective of\nthis study was to evaluate the reliability of these classifiers, with the\nultimate goal of using them for the automatic categorization of the remaining\nunannotated data set. The dataset, complemented by manual annotations, models,\nand source code used in these experiments are publicly accessible for\nreplication purposes.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optical Music Recognition in Manuscripts from the Ricordi Archive\",\"authors\":\"Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras\",\"doi\":\"arxiv-2408.10260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Ricordi archive, a prestigious collection of significant musical\\nmanuscripts from renowned opera composers such as Donizetti, Verdi and Puccini,\\nhas been digitized. This process has allowed us to automatically extract\\nsamples that represent various musical elements depicted on the manuscripts,\\nincluding notes, staves, clefs, erasures, and composer's annotations, among\\nothers. To distinguish between digitization noise and actual music elements, a\\nsubset of these images was meticulously grouped and labeled by multiple\\nindividuals into several classes. After assessing the consistency of the\\nannotations, we trained multiple neural network-based classifiers to\\ndifferentiate between the identified music elements. The primary objective of\\nthis study was to evaluate the reliability of these classifiers, with the\\nultimate goal of using them for the automatic categorization of the remaining\\nunannotated data set. The dataset, complemented by manual annotations, models,\\nand source code used in these experiments are publicly accessible for\\nreplication purposes.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.10260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.10260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optical Music Recognition in Manuscripts from the Ricordi Archive
The Ricordi archive, a prestigious collection of significant musical
manuscripts from renowned opera composers such as Donizetti, Verdi and Puccini,
has been digitized. This process has allowed us to automatically extract
samples that represent various musical elements depicted on the manuscripts,
including notes, staves, clefs, erasures, and composer's annotations, among
others. To distinguish between digitization noise and actual music elements, a
subset of these images was meticulously grouped and labeled by multiple
individuals into several classes. After assessing the consistency of the
annotations, we trained multiple neural network-based classifiers to
differentiate between the identified music elements. The primary objective of
this study was to evaluate the reliability of these classifiers, with the
ultimate goal of using them for the automatic categorization of the remaining
unannotated data set. The dataset, complemented by manual annotations, models,
and source code used in these experiments are publicly accessible for
replication purposes.