里科迪档案馆手稿中的光学音乐识别技术

arXiv - CS - Digital Libraries Pub Date : 2024-08-14 DOI:arxiv-2408.10260

Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras

{"title":"里科迪档案馆手稿中的光学音乐识别技术","authors":"Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras","doi":"arxiv-2408.10260","DOIUrl":null,"url":null,"abstract":"The Ricordi archive, a prestigious collection of significant musical\nmanuscripts from renowned opera composers such as Donizetti, Verdi and Puccini,\nhas been digitized. This process has allowed us to automatically extract\nsamples that represent various musical elements depicted on the manuscripts,\nincluding notes, staves, clefs, erasures, and composer's annotations, among\nothers. To distinguish between digitization noise and actual music elements, a\nsubset of these images was meticulously grouped and labeled by multiple\nindividuals into several classes. After assessing the consistency of the\nannotations, we trained multiple neural network-based classifiers to\ndifferentiate between the identified music elements. The primary objective of\nthis study was to evaluate the reliability of these classifiers, with the\nultimate goal of using them for the automatic categorization of the remaining\nunannotated data set. The dataset, complemented by manual annotations, models,\nand source code used in these experiments are publicly accessible for\nreplication purposes.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optical Music Recognition in Manuscripts from the Ricordi Archive\",\"authors\":\"Federico Simonetta, Rishav Mondal, Luca Andrea Ludovico, Stavros Ntalampiras\",\"doi\":\"arxiv-2408.10260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Ricordi archive, a prestigious collection of significant musical\\nmanuscripts from renowned opera composers such as Donizetti, Verdi and Puccini,\\nhas been digitized. This process has allowed us to automatically extract\\nsamples that represent various musical elements depicted on the manuscripts,\\nincluding notes, staves, clefs, erasures, and composer's annotations, among\\nothers. To distinguish between digitization noise and actual music elements, a\\nsubset of these images was meticulously grouped and labeled by multiple\\nindividuals into several classes. After assessing the consistency of the\\nannotations, we trained multiple neural network-based classifiers to\\ndifferentiate between the identified music elements. The primary objective of\\nthis study was to evaluate the reliability of these classifiers, with the\\nultimate goal of using them for the automatic categorization of the remaining\\nunannotated data set. The dataset, complemented by manual annotations, models,\\nand source code used in these experiments are publicly accessible for\\nreplication purposes.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.10260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.10260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

里科尔迪档案馆是著名歌剧作曲家（如多尼采蒂、威尔第和普契尼）的重要音乐手稿的数字化收藏馆。在这一过程中，我们自动提取了代表手稿上各种音乐元素的样本，包括音符、谱表、谱号、擦除和作曲家注释等。为了区分数字化噪音和实际音乐元素，我们对这些图像的子集进行了细致的分组，并由多人将其标记为多个类别。在评估了标注的一致性后，我们训练了多个基于神经网络的分类器来区分所识别的音乐元素。这项研究的主要目的是评估这些分类器的可靠性，最终目标是使用它们对剩余的未注释数据集进行自动分类。实验中使用的数据集、人工注释、模型和源代码均可公开获取，以便复制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optical Music Recognition in Manuscripts from the Ricordi Archive

The Ricordi archive, a prestigious collection of significant musical manuscripts from renowned opera composers such as Donizetti, Verdi and Puccini, has been digitized. This process has allowed us to automatically extract samples that represent various musical elements depicted on the manuscripts, including notes, staves, clefs, erasures, and composer's annotations, among others. To distinguish between digitization noise and actual music elements, a subset of these images was meticulously grouped and labeled by multiple individuals into several classes. After assessing the consistency of the annotations, we trained multiple neural network-based classifiers to differentiate between the identified music elements. The primary objective of this study was to evaluate the reliability of these classifiers, with the ultimate goal of using them for the automatic categorization of the remaining unannotated data set. The dataset, complemented by manual annotations, models, and source code used in these experiments are publicly accessible for replication purposes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Digital Libraries

自引率

0.00%

发文量