Can we build language-independent OCR using LSTM networks?

MOCR '13 Pub Date : 2013-08-24 DOI:10.1145/2505377.2505394

A. Ul-Hasan, T. Breuel

引用次数: 39

Abstract

Language models or recognition dictionaries are usually considered an essential step in OCR. However, using a language model complicates training of OCR systems, and it also narrows the range of texts that an OCR system can be used with. Recent results have shown that Long Short-Term Memory (LSTM) based OCR yields low error rates even without language modeling. In this paper, we explore the question to what extent LSTM models can be used for multilingual OCR without the use of language models. To do this, we measure cross-language performance of LSTM models trained on different languages. LSTM models show good promise to be used for language-independent OCR. The recognition errors are very low (around 1%) without using any language model or dictionary correction.

查看原文本刊更多论文

我们可以使用LSTM网络构建语言无关的OCR吗?

语言模型或识别字典通常被认为是OCR的重要步骤。然而，使用语言模型使OCR系统的训练变得复杂，并且它也缩小了OCR系统可以使用的文本范围。最近的研究结果表明，即使没有语言建模，基于长短期记忆(LSTM)的OCR也能产生较低的错误率。在本文中，我们探讨了在不使用语言模型的情况下LSTM模型在多大程度上可以用于多语言OCR的问题。为此，我们测量了在不同语言上训练的LSTM模型的跨语言性能。LSTM模型在用于语言无关OCR方面表现出良好的前景。在不使用任何语言模型或字典校正的情况下，识别误差非常低(约1%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

MOCR '13

自引率

0.00%

发文量