2014 11th IAPR International Workshop on Document Analysis Systems最新文献_第3页

Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images 结合焦点测量算子预测移动捕获文档图像OCR精度

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.11

Marçal Rusiñol, J. Chazalon, J. Ogier

{"title":"Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images","authors":"Marçal Rusiñol, J. Chazalon, J. Ogier","doi":"10.1109/DAS.2014.11","DOIUrl":"https://doi.org/10.1109/DAS.2014.11","url":null,"abstract":"Mobile document image acquisition is a new trend raising serious issues in business document processing workflows. Such digitization procedure is unreliable, and integrates many distortions which must be detected as soon as possible, on the mobile, to avoid paying data transmission fees, and losing information due to the inability to re-capture later a document with temporary availability. In this context, out-of-focus blur is major issue: users have no direct control over it, and it seriously degrades OCR recognition. In this paper, we concentrate on the estimation of focus quality, to ensure a sufficient legibility of a document image for OCR processing. We propose two contributions to improve OCR accuracy prediction for mobile-captured document images. First, we present 24 focus measures, never tested on document images, which are fast to compute and require no training. Second, we show that a combination of those measures enables state-of-the art performance regarding the correlation with OCR accuracy. The resulting approach is fast, robust, and easy to implement in a mobile device. Experiments are performed on a public dataset, and precise details about image processing are given.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129199969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 36

A Novel Learning-Free Word Spotting Approach Based on Graph Representation 一种新的基于图表示的免学习词识别方法

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.46

P. Wang, V. Eglin, Christophe Garcia, C. Largeron, J. Lladós, A. Fornés

引用次数: 39

A Seed-Based Segmentation Method for Scene Text Extraction 一种基于种子的场景文本分割方法

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.34

Bo Bai, Fei Yin, Cheng-Lin Liu

引用次数: 22

Historical Chinese Character Recognition Method Based on Style Transfer Mapping 基于风格迁移映射的历史汉字识别方法

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.33

Bohan Li, Liangrui Peng, Jingning Ji

引用次数: 17

Ground-Truth Production in the Transcriptorium Project 转录项目中的真实生产

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.23

B. Gatos, G. Louloudis, T. Causer, Kris Grint, Verónica Romero, Joan Andreu Sánchez, A. Toselli, E. Vidal

引用次数: 39

The A2iA Arabic Handwritten Text Recognition System at the Open HaRT2013 Evaluation A2iA阿拉伯语手写文本识别系统在Open HaRT2013的评估

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.40

Théodore Bluche, J. Louradour, Maxime Knibbe, Bastien Moysset, Mohamed Benzeghiba, Christopher Kermorvant

引用次数: 45

Color Descriptor for Content-Based Drawing Retrieval 基于内容的绘图检索的颜色描述符

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.70

Christophe Rigaud, Dimosthenis Karatzas, J. Burie, J. Ogier

引用次数: 14

A New Laplacian Method for Arbitrarily-Oriented Word Segmentation in Video 视频中任意方向分词的拉普拉斯新方法

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.21

P. Shivakumara, M. Suhil, D. S. Guru, C. Tan

{"title":"A New Laplacian Method for Arbitrarily-Oriented Word Segmentation in Video","authors":"P. Shivakumara, M. Suhil, D. S. Guru, C. Tan","doi":"10.1109/DAS.2014.21","DOIUrl":"https://doi.org/10.1109/DAS.2014.21","url":null,"abstract":"Word segmentation from video text line is challenging because video poses several challenges, such as complex background, low resolution, arbitrary orientation, etc. Besides, word segmentation is essential for improving text recognition accuracy. Therefore, we propose a novel method for segmenting words by exploring zero crossing points for each sliding window over text line. The candidate zero crossing pointes are defined based on characteristics of positive and negative Laplacian values at text region and non-text region. The percentage of candidate zero crossing points is calculated for each sliding window and is used for identifying the seed window that represents space between words. For the seed window, we propose a novel idea of horizontal and vertical sampling based on the percentage values to estimate the width and the height of the word spacing. Then the width and the height of the word spacing are used to validate the actual word spacing. Experimental results comparing with an existing method show that the proposed method is better than the existing method in terms of recall, precision and f-measure on curved, horizontal, non-horizontal, Hua's video data, as well as ICDAR data. We also test it on our own data containing multiscript text lines to show the robustness of the proposed method.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122746581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Evaluation of Texture Features for Offline Arabic Writer Identification 离线阿拉伯语作家识别的纹理特征评价

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.76

Chawki Djeddi, L. Souici-Meslati, I. Siddiqi, A. Ennaji, H. E. Abed, A. Gattal

引用次数: 17

A Context Based Text Summarization System 基于上下文的文本摘要系统

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.19

Rafael Ferreira, F. Freitas, L. Cabral, R. Lins, Rinaldo Lima, G. Silva, S. Simske, Luciano Favaro

引用次数: 61