{"title":"Heuristic approach to the recognition of printed Arabic script","authors":"A. M. Obaid, T. Dobrowiecki","doi":"10.1109/INES.1997.632416","DOIUrl":null,"url":null,"abstract":"A new segmentation-free method, called N-markers, is proposed for machine recognition of the Arabic printed texts. The contribution aims at the optical character recognition of printed texts, like books and journals of good quality, usually typeset in so-called Naskhi font. The focus of attention is shifted from the recognition of multifont texts to that of single Naskhi font, taking, however, into account shape variations originated in different typesetting workshops, and the intensive presence of the ligatures in normal printed texts. The proposed method is a mixture of global and structural approaches and is related to some early ideas of the optical character recognition (OCR) of the isolated Roman characters.","PeriodicalId":161975,"journal":{"name":"Proceedings of IEEE International Conference on Intelligent Engineering Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.1997.632416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
A new segmentation-free method, called N-markers, is proposed for machine recognition of the Arabic printed texts. The contribution aims at the optical character recognition of printed texts, like books and journals of good quality, usually typeset in so-called Naskhi font. The focus of attention is shifted from the recognition of multifont texts to that of single Naskhi font, taking, however, into account shape variations originated in different typesetting workshops, and the intensive presence of the ligatures in normal printed texts. The proposed method is a mixture of global and structural approaches and is related to some early ideas of the optical character recognition (OCR) of the isolated Roman characters.