Document Image Registration for Imposed Layer Extraction

ICTACT Journal on Image and Video Processing Pub Date : 2017-02-01 DOI:10.21917/IJIVP.2017.0205

S. Narayan, S. D. Gowda

{"title":"Document Image Registration for Imposed Layer Extraction","authors":"S. Narayan, S. D. Gowda","doi":"10.21917/IJIVP.2017.0205","DOIUrl":null,"url":null,"abstract":"Extraction of filled-in information from document images in the presence of template poses challenges due to geometrical distortion. Filled-in document image consists of null background, general information foreground and vital information imposed layer. Template document image consists of null background and general information foreground layer. In this paper a novel document image registration technique has been proposed to extract imposed layer from input document image. A convex polygon is constructed around the content of the input and the template image using convex hull. The vertices of the convex polygons of input and template are paired based on minimum Euclidean distance. Each vertex of the input convex polygon is subjected to transformation for the permutable combinations of rotation and scaling. Translation is handled by tight crop. For every transformation of the input vertices, Minimum Hausdorff distance (MHD) is computed. Minimum Hausdorff distance identifies the rotation and scaling values by which the input image should be transformed to align it to the template. Since transformation is an estimation process, the components in the input image do not overlay exactly on the components in the template, therefore connected component technique is applied to extract contour boxes at word level to identify partially overlapping components. Geometrical features such as density, area and degree of overlapping are extracted and compared between partially overlapping components to identify and eliminate components common to input image and template image. The residue constitutes imposed layer. Experimental results indicate the efficacy of the proposed model with computational complexity. Experiment has been conducted on variety of filled-in forms, applications and bank cheques. Data sets have been generated as test sets for comparative analysis.","PeriodicalId":30615,"journal":{"name":"ICTACT Journal on Image and Video Processing","volume":"07 1","pages":"1415-1423"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICTACT Journal on Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21917/IJIVP.2017.0205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Extraction of filled-in information from document images in the presence of template poses challenges due to geometrical distortion. Filled-in document image consists of null background, general information foreground and vital information imposed layer. Template document image consists of null background and general information foreground layer. In this paper a novel document image registration technique has been proposed to extract imposed layer from input document image. A convex polygon is constructed around the content of the input and the template image using convex hull. The vertices of the convex polygons of input and template are paired based on minimum Euclidean distance. Each vertex of the input convex polygon is subjected to transformation for the permutable combinations of rotation and scaling. Translation is handled by tight crop. For every transformation of the input vertices, Minimum Hausdorff distance (MHD) is computed. Minimum Hausdorff distance identifies the rotation and scaling values by which the input image should be transformed to align it to the template. Since transformation is an estimation process, the components in the input image do not overlay exactly on the components in the template, therefore connected component technique is applied to extract contour boxes at word level to identify partially overlapping components. Geometrical features such as density, area and degree of overlapping are extracted and compared between partially overlapping components to identify and eliminate components common to input image and template image. The residue constitutes imposed layer. Experimental results indicate the efficacy of the proposed model with computational complexity. Experiment has been conducted on variety of filled-in forms, applications and bank cheques. Data sets have been generated as test sets for comparative analysis.

查看原文本刊更多论文

用于强制层提取的文档图像配准

由于几何失真，在存在模板的情况下从文档图像中提取填充信息带来了挑战。填充文档图像由空背景、一般信息前景和重要信息附加层组成。模板文档图像由空背景层和一般信息前台层组成。本文提出了一种新的文档图像配准技术，从输入的文档图像中提取叠加层。使用凸包围绕输入的内容和模板图像构造凸多边形。基于最小欧氏距离对输入和模板的凸多边形的顶点进行配对。对于旋转和缩放的可变组合，对输入凸多边形的每个顶点进行变换。翻译是由紧缩处理的。对于每个输入顶点的变换，计算最小豪斯多夫距离（MHD）。最小豪斯多夫距离标识了旋转和缩放值，输入图像应通过该值进行转换以将其与模板对齐。由于变换是一个估计过程，输入图像中的分量并不完全覆盖在模板中的分量上，因此应用连接分量技术来提取单词级别的轮廓框，以识别部分重叠的分量。提取并比较部分重叠分量之间的几何特征，如密度、面积和重叠程度，以识别和消除输入图像和模板图像共同的分量。残留物构成了加铺层。实验结果表明，在计算复杂度较高的情况下，所提出的模型是有效的。对各种填写的表格、申请表和银行支票进行了实验。已生成数据集作为用于比较分析的测试集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICTACT Journal on Image and Video Processing

自引率

0.00%

发文量

审稿时长

8 weeks