2014 11th IAPR International Workshop on Document Analysis Systems最新文献_第5页

Recognition of Handwritten Mathematical Characters on Whiteboards Using Colour Images 利用彩色图像识别白板上手写的数学字符

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.66

Behrang Sabeghi Saroui, V. Sorge

{"title":"Recognition of Handwritten Mathematical Characters on Whiteboards Using Colour Images","authors":"Behrang Sabeghi Saroui, V. Sorge","doi":"10.1109/DAS.2014.66","DOIUrl":"https://doi.org/10.1109/DAS.2014.66","url":null,"abstract":"Automatic handwriting recognition has enjoyed significant improvements in the past decades. In particular, online recognition of mathematical formulas has seen a number of important advancements both for pen input devices as well as for smart boards. However, in reality most mathematics is still taught and developed on regular whiteboards and that the offline recognition still remains a challenging task. In this paper we are therefore concerned with the offline recognition of handwritten notes on whiteboards, presenting a novel way of transforming offline data via image analysis into equivalent online data. We use trajectory recovery techniques and statistical classification on high quality colour images to extract information on the strokes composing a character, such as start or end points and stroke direction. This data is then appropriately prepared and passed to an online character recogniser specialising on mathematical characters for the actual recognition task. We demonstrate the effectiveness of our new technique with experiments on a collection of 1000 whiteboard images of different mathematical symbols, Latin and Greek characters that have been obtained from a variety of writers using different types of pens.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133549291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Two Level Algorithm for Text Detection in Natural Scene Images 自然场景图像文本检测的两级算法

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.41

Li Rong, Suyu Wang, Zhixin Shi

{"title":"A Two Level Algorithm for Text Detection in Natural Scene Images","authors":"Li Rong, Suyu Wang, Zhixin Shi","doi":"10.1109/DAS.2014.41","DOIUrl":"https://doi.org/10.1109/DAS.2014.41","url":null,"abstract":"In this paper we present a two-level method to detect text in natural scene images. In the first level, connected components (referred as CCs) are got from the images. Then candidate text lines are extracted and groups of connected components that align in horizontal or vertical direction are got. We think CCs in these groups have high probability are texts. To validate which CC is text, a SVM is trained to make an initial decision. The output of SVM is calibrated to posterior probability. Then we use the information of posterior probability of SVM and information of whether the connected component is in a group to divide the connected components into four classes: texts, non-texts, probable texts and undetermined CCs. In the second level, a conditional random field model is used to make final decision. Relationship between CCs is modeled by a network G(V, E), Vertices of the graph correspond to CCs. The determination in the first level will influence the second levels determination by giving different parameters of data term for the four classes of CCs. By this way, we not only use information of a single CCs feature, but also use the information of whether a CC is in a group to make final decision of whether the CC is text or non-text. Experiments show that the method is effective.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130424321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Separation of Graphics (Superimposed) and Scene Text in Video Frames 视频帧中图形(叠加)和场景文本的分离

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.20

P. Shivakumara, N. V. Kumar, D. S. Guru, C. Tan

{"title":"Separation of Graphics (Superimposed) and Scene Text in Video Frames","authors":"P. Shivakumara, N. V. Kumar, D. S. Guru, C. Tan","doi":"10.1109/DAS.2014.20","DOIUrl":"https://doi.org/10.1109/DAS.2014.20","url":null,"abstract":"The presence of both graphics and scene text in video frames makes text detection and recognition problem more challenging because the nature of the two texts differs significantly. This paper aims to propose a novel method for separation of graphics and scene text to achieve good recognition rate based on the fact that Canny and Sobel edge pattern share common property for text. We propose to use Ring Radius Transform to identify the radius that represents the medial axis in the edge image. We study the intra relationship between bins of the histograms over respective radius values, resulting in intra line graphs. In this way, the method finds intra line graphs for both Canny and Sobel edge images of the input text lines. To identify the unique distribution for separation of graphics and scene texts, we explore the inter relationship between intra line graphs of Canny and Sobel edge image with respective medial axes values. This results in Gaussian distribution for graphics and non-Gaussian for scene text. Experimental results on horizontal, non-horizontal, different scripts etc. show that the proposed method is effective for classification and the results of baseline recognition methods show that recognition rate is significantly improved after classification.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133891397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

The Maurdor Project: Improving Automatic Processing of Digital Documents 毛铎项目:改进数字文档的自动处理

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.58

S. Brunessaux, Patrick Giroux, B. Grilhères, M. Manta, Maylis Bodin, K. Choukri, Olivier Galibert, Juliette Kahn

引用次数: 47

Local Binary Patterns for Arabic Optical Font Recognition 阿拉伯文光学字体识别的局部二进制模式

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.71

Anguelos Nicolaou, Fouad Slimane, V. Märgner, M. Liwicki

{"title":"Local Binary Patterns for Arabic Optical Font Recognition","authors":"Anguelos Nicolaou, Fouad Slimane, V. Märgner, M. Liwicki","doi":"10.1109/DAS.2014.71","DOIUrl":"https://doi.org/10.1109/DAS.2014.71","url":null,"abstract":"Optical Font Recognition (OFR) has been proven to increase Optical Character Recognition (OCR) accuracy, but it can also help in harvesting semantic information from documents. It therefore becomes a part of many Document Image Analysis (DIA) pipelines. Our work is based on the hypothesis that Local Binary Patterns (LBP), as a generic texture classification method, can address several distinct DIA problems at the same time such as OFR, script detection, writer identification, etc. In this paper we strip down the Redundant Oriented LBP (RO-LBP) method, previously used in writer identification, and apply it for OFR with the goal of introducing a generic method that classifies text as oriented texture. We focus on Arabic OFR and try to perform a thorough comparison of our method and the leading Gaussian Mixture Model method that is developed specifically for the task. Depending on the nature of proposed OFR method, each method's performance is usually evaluated on different data and with different evaluation protocols. The proposed experimental procedure addresses this problem and allows us to compare OFR methods that are fundamentally different by adapting them to a common measurement protocol. In performed experiments LBP method achieves perfect results on large text blocks generated from the APTI database, while preserving its very broad generic attributes as proven by secondary experiments.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129767521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Plane Geometry Figure Retrieval with Bag of Shapes 用形状袋检索平面几何图形

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.53

Lu Liu, Xiaoqing Lu, Keqiang Li, J. Qu, Liangcai Gao, Zhi Tang

{"title":"Plane Geometry Figure Retrieval with Bag of Shapes","authors":"Lu Liu, Xiaoqing Lu, Keqiang Li, J. Qu, Liangcai Gao, Zhi Tang","doi":"10.1109/DAS.2014.53","DOIUrl":"https://doi.org/10.1109/DAS.2014.53","url":null,"abstract":"Digital education is serving an increasingly important function in most educational institutions, thus resulting in the production of a large number of digital documents online for education purposes. However, convenient ways to retrieve mathematic geometry questions are lacking because current retrieval systems largely rely on keywords instead of geometry figure images. This study focuses on plane geometry figure (PGF) image retrieval with the aim of retrieving relevant geometry images that contain more structural information than a question text stem. To fully use geometrical properties, a Bag-of-shapes (BoS) method is proposed to build the feature descriptor of an image. The BoS method contains either basic geometric primitives or dual-primitive structures along with several specific geometrical features for shape description. Based on the BoS feature descriptor, we apply cosine similarity with group feature weight as vector similarity measure for ranking to achieve high efficiency. For a PGF image query, the retrieval results are provided in an appropriate ranking order, which has high visual similarity with respect to human perception. Retrieval experiments and evaluation results show the effectiveness and efficiency of the proposed BoS shape descriptor.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125364099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Automatic Training Set Generation for Better Historic Document Transcription and Compression 自动训练集生成更好的历史文件转录和压缩

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.30

G. Silva, R. Lins, Cesar Gomes

引用次数: 2

Feasibility Study of Visualizing Diversity of Japanese Hiragana Handwritings by Multidimensional Scaling of Earth Mover's Distance toward Assisting Forensic Experts in Writer Verification 利用推土机距离多维尺度可视化日本平假名笔迹多样性，协助法医鉴定笔迹的可行性研究

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.13

Yoshinori Akao, Atsushi Yamamoto, Yoshiyasu Higashikawa

引用次数: 2

The RWTH Large Vocabulary Arabic Handwriting Recognition System RWTH大词汇阿拉伯手写识别系统

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.61

M. Hamdani, P. Doetsch, M. Kozielski, A. Mousa, H. Ney

引用次数: 32

End-to-End Text Recognition Using Local Ternary Patterns, MSER and Deep Convolutional Nets 使用局部三元模式、MSER和深度卷积网络的端到端文本识别

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI: 10.1109/DAS.2014.29

M. Opitz, Markus Diem, Stefan Fiel, Florian Kleber, Robert Sablatnig

引用次数: 35