2014 11th IAPR International Workshop on Document Analysis Systems最新文献_第4页

Performance Improvement in Local Feature Based Camera-Captured Character Recognition 基于局部特征的相机捕捉字符识别性能改进

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.78

Takahiro Matsuda, M. Iwamura, K. Kise

引用次数: 5

Flexible Noisy Text Correction 灵活的噪声文本校正

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.12

Andrey C. Sariev, Vladislav Nenchev, Stefan Gerdjikov, Petar Mitankin, Hristo Ganchev, S. Mihov, Tinko Tinchev

引用次数: 6

Spotting Symbol Using Sparsity over Learned Dictionary of Local Descriptors 利用局部描述符学习字典上的稀疏性来定位符号

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.62

T. Do, S. Tabbone, O. R. Terrades

引用次数: 3

A Hierarchical Framework for Accent Based Writer Identification 基于口音的作者识别层次框架

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.69

Chetan Ramaiah, V. Govindaraju

引用次数: 2

A Study to Achieve Manga Character Retrieval Method for Manga Images 基于漫画图像的漫画字符检索方法研究

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.60

M. Iwata, Atsushi Ito, K. Kise

引用次数: 11

Multilingual Off-Line Handwriting Recognition in Real-World Images 真实世界图像中的多语言离线手写识别

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.8

M. Kozielski, P. Doetsch, M. Hamdani, H. Ney

引用次数: 14

Context-Dependent Confusions Rules for Building Error Model Using Weighted Finite State Transducers for OCR Post-Processing 基于上下文的模糊规则建立基于加权有限状态传感器的OCR后处理误差模型

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.75

M. A. Azawi, T. Breuel

{"title":"Context-Dependent Confusions Rules for Building Error Model Using Weighted Finite State Transducers for OCR Post-Processing","authors":"M. A. Azawi, T. Breuel","doi":"10.1109/DAS.2014.75","DOIUrl":"https://doi.org/10.1109/DAS.2014.75","url":null,"abstract":"In this paper, we propose a new technique to correct the OCR errors by means of weighted finite state transducers(WFST) with context-dependent confusion rules. We translate the OCR confusions which appear in the recognition outputs into edit operations, e.g. insertions, deletions and substitutions using Levenshtein edit distance algorithm. The edit operations are extracted in a form of rules with respect to the context of the incorrect string to build an error model using weighted finite state transducers. The context-dependent rules help to fit the rule in the appropriate strings. Our new error model avoids the calculations that occur in searching the language model and it also makes the language model eligible to correct incorrect words by using context-dependent confusion rules. Our approach is language independent. It designed to deal with different number of errors. It has no limited words size. In the set of experiments conducted on the ocred pages from the UWIII dataset, our new proposed error model outperforms. The evaluation shows the error rate of our model on the UWIII testset is 0.68%, while the baseline is 1.14% and the error rate of the existing state-of-the-art single character rules-based approach is 1.0%.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124702184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

A Complete Logo Detection/Recognition System for Document Images 一个完整的标识检测/识别系统的文件图像

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.79

Alireza Alaei, Mathieu Delalandre

{"title":"A Complete Logo Detection/Recognition System for Document Images","authors":"Alireza Alaei, Mathieu Delalandre","doi":"10.1109/DAS.2014.79","DOIUrl":"https://doi.org/10.1109/DAS.2014.79","url":null,"abstract":"In this paper, a complete logo detection/ recognition system for document images is proposed. In the proposed system, first, a logo detection method is employed to detect a few regions of interest (logo-patches), which likely contain the logo(s), in a document image. The detection method is based on the piece-wise painting algorithm (PPA) and some probability features along with a decision tree. For the logo recognition, a template based recognition approach is proposed to recognize the logo which may present in every detected logo-patch. The proposed logo recognition strategy uses a search space reduction technique to decrease the number of template logo-models needed for the recognition of a logo in a detected logo-patch. The features used for search space reduction are based on the geometric properties of a detected logo-patch. Based on our experimentations on 1290 document images of Tobacco800 dataset, 99.31% of the logos were detected as logo-patches. Among the detected logo-patches 97.90% of logos were fairly recognized. Considering both logo detection and recognition results, 97.22% of the logos in the document images could truly be detected/recognized as the overall performance of the proposed system.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125118781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

The Influence of Language Orthographic Characteristics on Digital Word Recognition 语言正字法特征对数字词识别的影响

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1093/llc/fqu051

Ofer Biller, Jihad El-Sana, K. Kedem

引用次数: 6

Towards a Robust OCR System for Indic Scripts 面向印度文字的健壮OCR系统

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.74

Praveen Krishnan, Naveen Sankaran, A. Singh, C. V. Jawahar

{"title":"Towards a Robust OCR System for Indic Scripts","authors":"Praveen Krishnan, Naveen Sankaran, A. Singh, C. V. Jawahar","doi":"10.1109/DAS.2014.74","DOIUrl":"https://doi.org/10.1109/DAS.2014.74","url":null,"abstract":"The current Optical Character Recognition OCR systems for Indic scripts are not robust enough for recognizing arbitrary collection of printed documents. Reasons for this limitation includes the lack of resources (e.g. not enough examples with natural variations, lack of documentation available about the possible font/style variations) and the architecture which necessitates hard segmentation of word images followed by an isolated symbol recognition. Variations among scripts, latent symbol to UNICODE conversion rules, non-standard fonts/styles and large degradations are some of the major reasons for the unavailability of robust solutions. In this paper, we propose a web based OCR system which (i) follows a unified architecture for seven Indian languages, (ii) is robust against popular degradations, (iii) follows a segmentation free approach, (iv) addresses the UNICODE re-ordering issues, and (v) can enable continuous learning with user inputs and feedbacks. Our system is designed to aid the continuous learning while being usable i.e., we capture the user inputs (say example images) for further improving the OCRs. We use the popular BLSTM based transcription scheme to achieve our target. This also enables incremental training and refinement in a seamless manner. We report superior accuracy rates in comparison with the available OCRs for the seven Indian languages.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132877344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28