Segmentation and validation of commercial documents logical structure

Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540) Pub Date : 2000-03-27 DOI:10.1109/ITCC.2000.844221

Miguel Diogenes Matrakas, Flávio Bortolozzi

引用次数: 3

Abstract

The main objective of the work is to present an approach to extract and validate the logical structure from the images that compose a commercial document. The nearest neighbor rule algorithm was used for labeling the elements, and the Run Length Smoothing Algorithm (RLSA) was used to segment the image of a commercial document of the type letter, official letter or memo. The most common classes considered are: date, logotype, text body, signature, addressee, invocation and greeting. The labeling of the elements is accomplished using the nearest neighbor rule algorithm with a vector comprising 28 characteristics. The accomplished study presented a good result for the classification of elements on commercial documents. It was created and used a base composed of 283 images of commercial documents in 256 gray levels for document element classification.

查看原文本刊更多论文

商业文档逻辑结构的分割和验证

这项工作的主要目标是提出一种从组成商业文档的图像中提取和验证逻辑结构的方法。使用最近邻规则算法对元素进行标记，并使用运行长度平滑算法(RLSA)对信件、公文或备忘录类型的商业文档图像进行分割。考虑的最常见的类是:日期、标识、文本主体、签名、收件人、调用和问候。使用包含28个特征的向量的最近邻规则算法完成元素的标记。所完成的研究为商业文献的要素分类提供了良好的结果。它创建并使用了一个由283张256级灰度的商业文档图像组成的库来进行文档元素分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540)

自引率

0.00%

发文量