2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)最新文献_第6页

A PHOC Decoder for Lexicon-Free Handwritten Word Recognition 一个PHOC解码器的无词典手写字识别

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.90

Giorgos Sfikas, George Retsinas, B. Gatos

引用次数: 2

Graph-Based Deep Learning for Graphics Classification 基于图的图形分类深度学习

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.262

Pau Riba, Anjan Dutta, J. Lladós, A. Fornés

引用次数: 5

Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks 基于全卷积网络的出生数字图像语义文本检测

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.145

Nibal Nayef, J. Ogier

{"title":"Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks","authors":"Nibal Nayef, J. Ogier","doi":"10.1109/ICDAR.2017.145","DOIUrl":"https://doi.org/10.1109/ICDAR.2017.145","url":null,"abstract":"Traditional layout analysis methods cannot be easily adapted to born-digital images which carry properties from both regular document images and natural scene images. One layout approach for analyzing born-digital images is to separate the text layer from the graphics layer before further analyzing any of them. In this paper, we propose a method for detecting text regions in such images by casting the detection problem as a semantic object segmentation problem. The text classification is done in a holistic approach using fully convolutional networks where the full image is fed as input to the network and the output is a pixel heat map of the same input image size. This solves the problem of low resolution images, and the variability of text scale within one image. It also eliminates the need for finding interest points, candidate text locations or low level components. The experimental evaluation of our method on the ICDAR 2013 dataset shows that our method outperforms state-of-the-art methods. The detected text regions also allow flexibility to later apply methods for finding text components at character, word or textline levels in different orientations.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124715059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Detection and Recognition of Arabic Text in Video Frames 视频帧中阿拉伯语文本的检测与识别

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.360

W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura

引用次数: 2

Local Enlacement Histograms for Historical Drop Caps Style Recognition 历史掉落帽样式识别的局部嵌套直方图

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.57

Michaël Clément, Mickaël Coustaty, Camille Kurtz, L. Wendling

{"title":"Local Enlacement Histograms for Historical Drop Caps Style Recognition","authors":"Michaël Clément, Mickaël Coustaty, Camille Kurtz, L. Wendling","doi":"10.1109/ICDAR.2017.57","DOIUrl":"https://doi.org/10.1109/ICDAR.2017.57","url":null,"abstract":"This article focuses on the specific issue of drop caps image recognition in the context of cultural heritage preservation. Due to their heterogeneity and their weakly structured properties, these historical images represent challenging data. An important aspect in the recognition process of drop caps is their background styles, which can be considered as discriminative features to identify both the printer and the period. Most existing methods for style recognition are based on low-level features such as color or texture properties. In this article, we present a novel framework for the recognition of drop caps style based on features of higher levels. We propose to capture the spatial structure carried by these images using relative position descriptors modeling the enlacement between local cells of pixel layers obtained from a document segmentation step. Such descriptors are then exploited in an efficient bag-of-features learning procedure. Experimental results obtained on a dataset of historical drop caps images highlight the interest of this approach, and in particular the benefit of considering spatial information.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128735497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Nonlinear Manifold Embedding on Keyword Spotting Using t-SNE 基于t-SNE的非线性流形嵌入关键字定位

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.86

George Retsinas, N. Stamatopoulos, G. Louloudis, Giorgos Sfikas, B. Gatos

引用次数: 6

A GRU-Based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition 基于gru的在线手写体数学表达式识别的注意编解码器方法

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.152

Jianshu Zhang, Jun Du, Lirong Dai

引用次数: 50

Lexicographical-Based Order for Post-OCR Correction of Named Entities 命名实体后ocr更正的基于词典编纂的顺序

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.197

Axel Jean-Caurant, Nouredine Tamani, V. Courboulay, J. Burie

引用次数: 6

Evaluation of Texture Descriptors for Validation of Counterfeit Documents 用于伪造文件验证的纹理描述符评估

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.204

Albert Berenguel Centeno, O. R. Terrades, Josep Lladós Canet, Cristina Cañero Morales

引用次数: 13

Into the Colorful World of Webtoons: Through the Lens of Neural Networks 走进网络漫画的多彩世界:通过神经网络的镜头

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI: 10.1109/ICDAR.2017.289

Ceyda Cinarel, Byoung-Tak Zhang

引用次数: 3