Heuristic approach to the recognition of printed Arabic script

A. M. Obaid, T. Dobrowiecki
{"title":"Heuristic approach to the recognition of printed Arabic script","authors":"A. M. Obaid, T. Dobrowiecki","doi":"10.1109/INES.1997.632416","DOIUrl":null,"url":null,"abstract":"A new segmentation-free method, called N-markers, is proposed for machine recognition of the Arabic printed texts. The contribution aims at the optical character recognition of printed texts, like books and journals of good quality, usually typeset in so-called Naskhi font. The focus of attention is shifted from the recognition of multifont texts to that of single Naskhi font, taking, however, into account shape variations originated in different typesetting workshops, and the intensive presence of the ligatures in normal printed texts. The proposed method is a mixture of global and structural approaches and is related to some early ideas of the optical character recognition (OCR) of the isolated Roman characters.","PeriodicalId":161975,"journal":{"name":"Proceedings of IEEE International Conference on Intelligent Engineering Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.1997.632416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

A new segmentation-free method, called N-markers, is proposed for machine recognition of the Arabic printed texts. The contribution aims at the optical character recognition of printed texts, like books and journals of good quality, usually typeset in so-called Naskhi font. The focus of attention is shifted from the recognition of multifont texts to that of single Naskhi font, taking, however, into account shape variations originated in different typesetting workshops, and the intensive presence of the ligatures in normal printed texts. The proposed method is a mixture of global and structural approaches and is related to some early ideas of the optical character recognition (OCR) of the isolated Roman characters.
阿拉伯文字印刷体识别的启发式方法
提出了一种新的无分割方法,称为n标记,用于机器识别阿拉伯语印刷文本。该贡献旨在光学字符识别印刷文本,如高质量的书籍和期刊,通常用所谓的纳斯克字体排版。注意的焦点从多字体文本的识别转移到单一纳斯克字体的识别,然而,考虑到不同的排版车间产生的形状变化,以及在正常印刷文本中大量存在的连体。所提出的方法是全局和结构方法的混合,并与早期孤立罗马字符光学字符识别(OCR)的一些思想有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信