Alpha-Numerical Sequences Extraction in Handwritten Documents

2010 12th International Conference on Frontiers in Handwriting Recognition Pub Date : 2010-11-16 DOI:10.1109/ICFHR.2010.44

Simon Thomas, Clément Chatelain, L. Heutte, T. Paquet

引用次数: 12

Abstract

In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.

查看原文本刊更多论文

手写文档中的alpha -数字序列提取

本文介绍了一种无约束手写体文档的字母数字序列提取系统(关键字、数字字段或字母数字序列)。与文献中提出的大多数方法相反，我们的系统依赖于描述两种信息的全局手写线模型:i)相关信息和ii)由浅解析模型表示的不相关信息。对孤立文本行的浅层解析允许在任何文档中快速提取信息，同时拒绝不相关的信息。在一个公共法语邮件数据库上的结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 12th International Conference on Frontiers in Handwriting Recognition

自引率

0.00%

发文量