{"title":"马卡利斯:罗马尼亚斯拉夫印刷古籍的 HTR 模型","authors":"Vladimir Polomac","doi":"10.15388/slavviln.2022.68(2).1","DOIUrl":null,"url":null,"abstract":"The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.","PeriodicalId":33056,"journal":{"name":"Slavistica Vilnensis","volume":"1 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Macarius: HTR modelis senoms slaviškoms spausdintoms knygoms iš Rumunijos\",\"authors\":\"Vladimir Polomac\",\"doi\":\"10.15388/slavviln.2022.68(2).1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.\",\"PeriodicalId\":33056,\"journal\":{\"name\":\"Slavistica Vilnensis\",\"volume\":\"1 10\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Slavistica Vilnensis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15388/slavviln.2022.68(2).1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Slavistica Vilnensis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15388/slavviln.2022.68(2).1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Arts and Humanities","Score":null,"Total":0}
Macarius: HTR modelis senoms slaviškoms spausdintoms knygoms iš Rumunijos
The paper describes the process of creating and evaluating the HTR (Handwritten Text Recognition) model for Romanian Slavonic early printed books (first half of the 16th century, Middle Bulgarian Church Slavonic, Cyrillic Script) using the Transkribus software platform, based on the principles of artificial intelligence, machine learning and advanced neural networks. The HTR model was created on the material of Romanian Slavonic early printed books from Târgovişte printing house: the Liturgikon from 1508 and the Teatraevangelion from 1512 from the oldest printing house managed by hieromonk Macarius, as well as the Apostle from 1547 from the printing house managed by Dimitrije Ljubavić. The most important result of the paper is the creation of the first version of the generic HTR model Macarius (named in honour of hieromonk Makarije, the first South Slavonic and Romanian printer) with exceptional performance – the percentage of incorrectly recognized characters (including accent marks) is only 2.7%. Research has shown that this HTR model can also be used for the automatic recognition of Romanian Slavonic early printed books published in the second half of the 16th century. HTR model Macarius together with Ground Truth data is available to all users of the Transkribus platform, which ensures its wider use, as well as the possibility for further improvement of its performance.