马卡利斯:罗马尼亚斯拉夫印刷古籍的 HTR 模型

Q4 Arts and Humanities
Vladimir Polomac
{"title":"马卡利斯:罗马尼亚斯拉夫印刷古籍的 HTR 模型","authors":"Vladimir Polomac","doi":"10.15388/slavviln.2022.68(2).1","DOIUrl":null,"url":null,"abstract":"The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.","PeriodicalId":33056,"journal":{"name":"Slavistica Vilnensis","volume":"1 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Macarius: HTR modelis senoms slaviškoms spausdintoms knygoms iš Rumunijos\",\"authors\":\"Vladimir Polomac\",\"doi\":\"10.15388/slavviln.2022.68(2).1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.\",\"PeriodicalId\":33056,\"journal\":{\"name\":\"Slavistica Vilnensis\",\"volume\":\"1 10\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Slavistica Vilnensis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15388/slavviln.2022.68(2).1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Slavistica Vilnensis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15388/slavviln.2022.68(2).1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了基于人工智能、机器学习和高级神经网络原理,利用 Transkribus 软件平台为罗马尼亚斯拉夫语早期印刷书籍(16 世纪上半叶,中保加利亚教会斯拉夫语,西里尔字母)创建和评估 HTR(手写文本识别)模型的过程。HTR 模型是在 Târgovişte 印刷厂的罗马尼亚斯拉夫语早期印刷书籍材料上创建的:1508 年的《Liturgikon》和 1512 年的《Teatraevangelion》来自马卡里乌斯(Macarius)教长管理的最古老的印刷厂,以及 1547 年的《使徒》来自迪米特里耶-柳巴维奇(Dimitrije Ljubavić)管理的印刷厂。论文最重要的成果是创建了首个通用 HTR 模型 Macarius(为纪念第一位南斯拉夫语和罗马尼亚语印刷商 hieromonk Makarije 而命名)版本,其性能卓越--错误识别字符(包括重音符号)的百分比仅为 2.7%。研究表明,该 HTR 模型还可用于自动识别 16 世纪下半叶出版的罗马尼亚斯拉夫语早期印刷书籍。Transkribus 平台的所有用户均可使用 HTR 模型 Macarius 和地面实况数据,这确保了该模型的广泛使用以及进一步提高其性能的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Macarius: HTR modelis senoms slaviškoms spausdintoms knygoms iš Rumunijos
The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Slavistica Vilnensis
Slavistica Vilnensis Arts and Humanities-Language and Linguistics
CiteScore
0.30
自引率
0.00%
发文量
13
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信