{"title":"Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image","authors":"Yanwei Wang, Yuefang Sun, Changsong Liu","doi":"10.1109/ICDAR.2017.102","DOIUrl":null,"url":null,"abstract":"This paper introduced a layout and perspective distortion independent recognition framework for captured Chinese document image. Under the framework, 1) Conditional random field (CRF) is employed for text line extraction from a global point of view. As the text line extraction is layout independent it could be widely used in different type of document images 2) A text line image based perspective distortion correction method is detailed and used in three different ways. 3) The text line extraction and perspective distortion correction are combined with character recognition to construct a recognition system. On three captured document image datasets, the proposed framework improves the accuracies from 94.03% to 95.20%, 13.01% to 93.71% and 10.63% to 92.68% respectively for different distortion degrees. The experimental results demonstrate that the introduced recognition framework is promising for solving layout and perspective distortion problems in captured document image recognition.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper introduced a layout and perspective distortion independent recognition framework for captured Chinese document image. Under the framework, 1) Conditional random field (CRF) is employed for text line extraction from a global point of view. As the text line extraction is layout independent it could be widely used in different type of document images 2) A text line image based perspective distortion correction method is detailed and used in three different ways. 3) The text line extraction and perspective distortion correction are combined with character recognition to construct a recognition system. On three captured document image datasets, the proposed framework improves the accuracies from 94.03% to 95.20%, 13.01% to 93.71% and 10.63% to 92.68% respectively for different distortion degrees. The experimental results demonstrate that the introduced recognition framework is promising for solving layout and perspective distortion problems in captured document image recognition.
介绍了一种不依赖版面和视角畸变的中文文档图像识别框架。在该框架下,1)从全局角度出发,采用条件随机场(Conditional random field, CRF)进行文本行提取。由于文本线提取与版面无关,可以广泛应用于不同类型的文档图像。2)详细介绍了一种基于文本线图像的透视畸变校正方法,并给出了三种不同的方法。3)将文本行提取和视角畸变校正与字符识别相结合,构建识别系统。在三个捕获的文档图像数据集上,该框架在不同失真程度下将准确率分别从94.03%提高到95.20%、13.01%提高到93.71%和10.63%提高到92.68%。实验结果表明,所引入的识别框架能够很好地解决捕获文档图像识别中的布局和视角失真问题。