A hypothesis testing approach to word recognition using an A* search algorithm

Proceedings of 3rd International Conference on Document Analysis and Recognition Pub Date : 1995-08-14 DOI:10.1109/ICDAR.1995.599013

Chi Fang, J. Hull

引用次数: 2

Abstract

An hypothesis testing approach for recognizing machine-printed words is presented in this paper. Based on knowledge of the document font and candidates for the identity of a word, this approach searches a tree of word decisions to generate and test hypotheses for character recognition and segmentation. The search starts at each sequential character position from both ends of a word image and proceeds inward. The accumulated cost of reaching a certain partial recognition decision is combined with the estimate of the potential cost to reach a goal state using an A* search algorithm. The proposed algorithm compensates for local degradations by relying on global characteristics of a word image. Tests of the algorithm show a recognition rate of 98.93% on degraded scanned document images with touching characters.

查看原文本刊更多论文

基于A*搜索算法的词识别假设检验方法

提出了一种机器打印单词识别的假设检验方法。基于文档字体的知识和候选词的身份，该方法搜索词决策树来生成和测试用于字符识别和分割的假设。搜索从单词图像两端的每个顺序字符位置开始，并向内进行。利用a *搜索算法，将达到某一局部识别决策的累积代价与达到目标状态的潜在代价相结合。该算法利用词图像的全局特征对局部退化进行补偿。实验表明，该算法对带有触摸字符的退化扫描文档图像的识别率达到98.93%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of 3rd International Conference on Document Analysis and Recognition

自引率

0.00%

发文量