Recurrent HMMs and Cursive Handwriting Recognition Graphs

2009 10th International Conference on Document Analysis and Recognition Pub Date : 2009-07-26 DOI:10.1109/ICDAR.2009.217

M. Schambach

引用次数: 11

Abstract

Standard cursive handwriting recognition is based on a language model, mostly a lexicon of possible word hypotheses or character n-grams. The result is a list of word alternatives ranked by confidence. Present-day applications use very large language models, leading to high computational costs and reduced accuracy. For a standard HMM-based word recognition system, a new recurrent HMM approach for very fast lexicon-free recognition will be presented. The evaluation of this model creates a "recognition graph", a compact representation of result alternatives of lexicon-free recognition. This structure is formally identical to results of single character segmentation and recognition. Thus it can be directly evaluated by interpretation algorithms following this process, and can even be merged with these results. In addition, the recognition graph is a basis for further evaluation in terms of word recognition. It allows fast evaluation of word hypotheses, easy integration of various language models like n-grams, and the efficient extraction of lexicon-free n-best result alternatives.

查看原文本刊更多论文

反复出现的hmm和草书手写识别图

标准的草书手写识别基于语言模型，主要是可能的单词假设或字符n-图的词典。结果是一个按信心排序的备选词列表。目前的应用程序使用非常大的语言模型，导致高计算成本和降低的准确性。针对一个标准的基于HMM的词识别系统，提出了一种新的递归HMM方法来实现快速的无词典识别。该模型的评估创建了一个“识别图”，即无词典识别的结果替代的紧凑表示。这种结构在形式上与单字符分割和识别的结果相同。因此，它可以直接评估解释算法遵循这一过程，甚至可以与这些结果合并。此外，识别图是进一步评价单词识别能力的基础。它允许快速评估单词假设，轻松集成各种语言模型，如n-grams，以及有效地提取无词典的n-最佳结果替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 10th International Conference on Document Analysis and Recognition

自引率

0.00%

发文量