在飞行中学习:困难OCR问题的无字体方法

2009 10th International Conference on Document Analysis and Recognition Pub Date : 2009-07-26 DOI:10.1109/ICDAR.2009.260

Andrew Kae, E. Learned-Miller

{"title":"在飞行中学习:困难OCR问题的无字体方法","authors":"Andrew Kae, E. Learned-Miller","doi":"10.1109/ICDAR.2009.260","DOIUrl":null,"url":null,"abstract":"Despite ubiquitous claims that optical character recognition (OCR) is a \"solved problem,'' many categories of documents continue to break modern OCR software such as documents with moderate degradation or unusual fonts. Many approaches rely on pre-computed or stored character models, but these are vulnerable to cases when the font of a particular document was not part of the training set, or when there is so much noise in a document that the font model becomes weak. To address these difficult cases, we present a form of iterative contextual modeling that learns character models directly from the document it is trying to recognize. We use these learned models both to segment the characters and to recognize them in an incremental, iterative process. We present results comparable to those of a commercial OCR system on a subset of characters from a difficult test document.","PeriodicalId":433762,"journal":{"name":"2009 10th International Conference on Document Analysis and Recognition","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":"{\"title\":\"Learning on the Fly: Font-Free Approaches to Difficult OCR Problems\",\"authors\":\"Andrew Kae, E. Learned-Miller\",\"doi\":\"10.1109/ICDAR.2009.260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite ubiquitous claims that optical character recognition (OCR) is a \\\"solved problem,'' many categories of documents continue to break modern OCR software such as documents with moderate degradation or unusual fonts. Many approaches rely on pre-computed or stored character models, but these are vulnerable to cases when the font of a particular document was not part of the training set, or when there is so much noise in a document that the font model becomes weak. To address these difficult cases, we present a form of iterative contextual modeling that learns character models directly from the document it is trying to recognize. We use these learned models both to segment the characters and to recognize them in an incremental, iterative process. We present results comparable to those of a commercial OCR system on a subset of characters from a difficult test document.\",\"PeriodicalId\":433762,\"journal\":{\"name\":\"2009 10th International Conference on Document Analysis and Recognition\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"34\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 10th International Conference on Document Analysis and Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2009.260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 10th International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2009.260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 34

摘要

尽管普遍声称光学字符识别(OCR)是一个“已解决的问题”，但许多类别的文档继续破坏现代OCR软件，例如具有中度退化或不寻常字体的文档。许多方法依赖于预先计算或存储的字符模型，但是当特定文档的字体不是训练集的一部分时，或者当文档中有太多噪声导致字体模型变弱时，这些方法很容易受到影响。为了解决这些困难的情况，我们提出了一种迭代上下文建模的形式，它直接从它试图识别的文档中学习角色模型。我们使用这些学习模型来分割字符，并在增量迭代过程中识别它们。我们给出的结果与商业OCR系统对来自困难测试文档的字符子集的结果相当。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning on the Fly: Font-Free Approaches to Difficult OCR Problems

Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem,'' many categories of documents continue to break modern OCR software such as documents with moderate degradation or unusual fonts. Many approaches rely on pre-computed or stored character models, but these are vulnerable to cases when the font of a particular document was not part of the training set, or when there is so much noise in a document that the font model becomes weak. To address these difficult cases, we present a form of iterative contextual modeling that learns character models directly from the document it is trying to recognize. We use these learned models both to segment the characters and to recognize them in an incremental, iterative process. We present results comparable to those of a commercial OCR system on a subset of characters from a difficult test document.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 10th International Conference on Document Analysis and Recognition

自引率

0.00%

发文量