2014 11th IAPR International Workshop on Document Analysis Systems最新文献_第7页

Forgery Detection Based on Intrinsic Document Contents 基于文档内在内容的伪造检测

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.26

Amr Gamal Hamed Ahmed, F. Shafait

引用次数: 28

Adapting Tesseract for Complex Scripts: An Example for Urdu Nastalique 为复杂的脚本改编Tesseract:以乌尔都语Nastalique为例

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.45

Q. Akram, S. Hussain, A. Niazi, Umair Anjum, Faheem Irfan

引用次数: 32

A Combined System for Text Line Extraction and Handwriting Recognition in Historical Documents 历史文献文本行提取与手写识别的组合系统

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.51

Andreas Fischer, M. Baechler, A. Garz, M. Liwicki, R. Ingold

引用次数: 24

Word-Graph Based Handwriting Key-Word Spotting: Impact of Word-Graph Size on Performance 基于词图的手写关键词识别:词图大小对性能的影响

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.65

A. Rossi, E. Vidal

引用次数: 9

Business Forms Classification Using Earth Mover's Distance 利用推土机的距离对业务形式进行分类

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.59

S. S. Bukhari, Markus Ebbecke, M. Gillmann

引用次数: 3

Text Line Segmentation Based on Matched Filtering and Top-Down Grouping for Handwritten Documents 基于匹配过滤和自顶向下分组的手写文档文本行分割

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.14

Youbao Tang, Xiangqian Wu, Wei Bu

引用次数: 7

Efficient Example-Based Super-Resolution of Single Text Images Based on Selective Patch Processing 基于选择性Patch处理的高效基于样例的单幅文本图像超分辨率

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.25

Nibal Nayef, J. Chazalon, Petra Gomez-Krämer, J. Ogier

{"title":"Efficient Example-Based Super-Resolution of Single Text Images Based on Selective Patch Processing","authors":"Nibal Nayef, J. Chazalon, Petra Gomez-Krämer, J. Ogier","doi":"10.1109/DAS.2014.25","DOIUrl":"https://doi.org/10.1109/DAS.2014.25","url":null,"abstract":"Example-based super-resolution (SR) methods learn the correspondences between low resolution (LR) and high-resolution (HR) image patches, where the patches are extracted from a training database. To reconstruct a single LR image into a HR one, each LR image patch is processed by the previously trained model to recover its corresponding HR patch. For this reason, they are computationally inefficient. We propose the use of a selective patch processing technique to carry out the super-resolution step more efficiently, while maintaining the output quality. In this technique, only patches of high variance are processed by the costly reconstruction steps, while the rest of the patches are processed by fast bicubic interpolation. We have applied the proposed improvement on representative example-based SR methods to super-resolve text images. The results show a significant speed up for text SR without a drop in theocrat accuracy. In order to carry out an extensive and solid performance evaluation, we also present a public database of text images for training and testing example-based SR methods.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125985178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Text Classification via iVector Based Feature Representation 基于向量特征表示的文本分类

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-01 DOI: 10.1109/DAS.2014.10

Shengxin Zha, Xujun Peng, Huaigu Cao, Xiaodan Zhuang, P. Natarajan, P. Natarajan

{"title":"Text Classification via iVector Based Feature Representation","authors":"Shengxin Zha, Xujun Peng, Huaigu Cao, Xiaodan Zhuang, P. Natarajan, P. Natarajan","doi":"10.1109/DAS.2014.10","DOIUrl":"https://doi.org/10.1109/DAS.2014.10","url":null,"abstract":"In this paper, we address the problem of text classification: classifying modern machine-printed text, handwritten text and historical typewritten text from degraded noisy documents. We propose a novel text classification approach based on iVector, a newly developed concept in speaker verification. To a given text line, the iVector is a fixed-length feature vector representation, transformed from a high-dimensional super vector based on means of Gaussian mixture model (GMM), where the text dependent component is separated from a universal background model (UBM) and can be represented by a low dimensional set of factors. We classify the text lines with a discriminative classifier - support vector machine (SVM) in iVector space. A baseline approach of text classification using GMM in feature space is also presented for evaluation purpose. Experimental results on an Arabic document database show accuracy of 92.04% for text line classification using the proposed method. Furthermore, the relative word error rate (WER) of 9.6% is decreased in optical character recognition (OCR) when coupled with the proposed iVector-SVM classifier. The proposed iVector-SVM approach is language independent, thus, can be applied to other scripts as well.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121247170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A Cache Language Model for Whole Document Handwriting Recognition 一种全文档手写识别的缓存语言模型

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-01 DOI: 10.1109/DAS.2014.56

Volkmar Frinken, Dimosthenis Karatzas, Andreas Fischer

引用次数: 1

A System for Recognizing Online Handwritten Mathematical Expressions and Improvement of Structure Analysis 一个在线手写数学表达式识别系统及结构分析的改进

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-01 DOI: 10.1109/DAS.2014.52

A. D. Le, T. V. Phan, M. Nakagawa

引用次数: 26