Heuristic approach to the recognition of printed Arabic script

Proceedings of IEEE International Conference on Intelligent Engineering Systems Pub Date : 1997-09-15 DOI:10.1109/INES.1997.632416

A. M. Obaid, T. Dobrowiecki

引用次数: 4

Abstract

A new segmentation-free method, called N-markers, is proposed for machine recognition of the Arabic printed texts. The contribution aims at the optical character recognition of printed texts, like books and journals of good quality, usually typeset in so-called Naskhi font. The focus of attention is shifted from the recognition of multifont texts to that of single Naskhi font, taking, however, into account shape variations originated in different typesetting workshops, and the intensive presence of the ligatures in normal printed texts. The proposed method is a mixture of global and structural approaches and is related to some early ideas of the optical character recognition (OCR) of the isolated Roman characters.

查看原文本刊更多论文

阿拉伯文字印刷体识别的启发式方法

提出了一种新的无分割方法，称为n标记，用于机器识别阿拉伯语印刷文本。该贡献旨在光学字符识别印刷文本，如高质量的书籍和期刊，通常用所谓的纳斯克字体排版。注意的焦点从多字体文本的识别转移到单一纳斯克字体的识别，然而，考虑到不同的排版车间产生的形状变化，以及在正常印刷文本中大量存在的连体。所提出的方法是全局和结构方法的混合，并与早期孤立罗马字符光学字符识别(OCR)的一些思想有关。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of IEEE International Conference on Intelligent Engineering Systems

自引率

0.00%

发文量