{"title":"Online Handwritten Kanji Recognition Based on Inter-stroke Grammar","authors":"Ikumi Ota, Ryo Yamamoto, Shinji Sako, S. Sagayama","doi":"10.1109/ICDAR.2007.202","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.202","url":null,"abstract":"This paper presents a new approach to online recognition of handwritten Kanji characters focusing on their hierarchical structure. Stochastic context-free grammar (SCFG) is introduced to represent the Kanji character generating process in combination with Hidden Markov Models (HMM) representing Kanji substrokes and to improve the recognition accuracy of important and frequently used Kanji characters in which inter-stroke relative positions play important roles. Combining the stroke likelihood and the relative-position likelihood between character-parts in the parsing process is expected to compensate their ambiguities. By modeling relative positions and share the models across distinct Kanji categories, a small training data can yield effective results and enables us to recognize Kanji simply by defining the SCFG rules to represent their structures without training data. Experimental results on an online handwritten Kanji database from JAIST (Japan Advanced Institute of Science and Technology) showed significant improvements in the recognition rates of some important Kanji with relatively fewer strokes and also showed little difference between the trained- and the non-trained Kanji in recognition rates.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127849247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust Binarization for Video Text Recognition","authors":"Z. Saidane, Christophe Garcia","doi":"10.1109/ICDAR.2007.222","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.222","url":null,"abstract":"This paper presents an automatic binarization method for color text areas in images or videos, which is robust to complex background, low resolution or video coding artefacts. Based on a specific architecture of convolutional neural networks, the proposed system automatically learns how to perform binarization, from a training set of synthesized text images and their corresponding desired binary images, without making any assumptions or using tunable parameters. The proposed method is compared to state-of-the-art binarization techniques, with respect to Gaussian noise and contrast variations, demonstrating the robustness and the efficiency of our method. Text recognition experiments on a database of images extracted from video frames and web pages, with two classical OCRs applied on the obtained binary images show a strong enhancement of the recognition rate by more than 40%.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127941872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Transform-Invariance in Local Averaging Classifier for Handwritten Digit Pattern Recognition","authors":"Seiji Hotta","doi":"10.1109/ICDAR.2007.253","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.253","url":null,"abstract":"In this paper, a classification method designed by combining a local averaging classifier and a tangent distance is proposed for handwritten digit pattern recognition. In practice, first the k-nearest neighbors of an input sample are selected in each class by using a two-sided tangent distance. Next, the mean vectors of the selected transformed-neighbor samples are computed in individual classes. Finally, the input sample is classified to the class that minimizes the one sided tangent distance between the input sample and the mean one. The superior performance of the proposed method is verified with the experiments on benchmark datasets MNIST and USPS.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121307726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Srinivasan, Shrivardhan Kabra, Chen Huang, S. Srihari
{"title":"On Computing Strength of Evidence for Writer Verification","authors":"H. Srinivasan, Shrivardhan Kabra, Chen Huang, S. Srihari","doi":"10.1109/ICDAR.2007.193","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.193","url":null,"abstract":"The problem of writer verification is to make a decision of whether or not two handwritten documents are written by the same person. Providing a strength of evidence for any such decision is an integral part of the writer verification problem. The strength of evidence should incorporate (i) The amount of information compared in each of the two documents (line/half page/full page etc.), (ii) The nature of content present in the document (same/different content), (iii) Features used for comparison and (iv) The error rate of the model used for making the decision. This paper describes the statistical model used for writer verification and also introduces a mathematical formulation to include the above four mentioned parameters, for calculating strength of evidence of same/different writer. The statistical model uses Gamma and Gaussian densities to parametrically model the distance space distribution arising from comparing ensemble of pairs of documents. The strength of evidence is mapped to a 9-point qualitative scale for the decision; one that is often used by questioned document examiners. Experiments and results show that with increase in information content from just a single word to a full page of document, the verification accuracy of the model increases.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129168963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Human-Centric Off-Line Signature Verification System","authors":"H. Coetzer, R. Sabourin","doi":"10.1109/ICDAR.2007.13","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.13","url":null,"abstract":"The manual signature-based authentication of a large number of documents is a laborious and time-consuming task. Consequently many off-line signature verification systems were recently developed. In this paper we propose a human-centric system, which exploits the synergy between human and machine capabilities, and show that this combined system can perform better (than humans or a machine) for almost all operating costs. The combination strategy is based on techniques in receiver operating characteristics (ROC) analysis. We conduct an experiment on a data set that contains 765 test signatures from 51 writers, and record the performance of 23 human classifiers, and that of a hidden Markov model-based (HMM-based) classifier, in ROC space. We propose that a manager (human or machine) specifies acceptable operating costs (Neyman- Pearson criterion), after which our human-centric system makes an optimal decision by utilizing the maximum attainable combined classifier.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116939981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recognition of Fragmented Characters Using Multiple Feature-Subset Classifiers","authors":"Chien-Hsing Chou, Chien-Yang Guo, F. Chang","doi":"10.1109/ICDAR.2007.214","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.214","url":null,"abstract":"This paper addresses the problem of recognizing fragmented characters in printed documents of poor printing quality, which often causes characters to break up. To enhance the recognition accuracy of such characters, most existing approaches attempt to improve the quality of character images by means of some mending techniques. We propose an alternative approach that adopts a bagging-predictor method to build classifiers, using only intact characters as training samples. The resultant classifiers can classify both intact and fragmented characters with a high degree of accuracy. Applying this approach to characters in archived Chinese newspapers, we extract two types of features from character images and form bagging predictors, each of which takes a subset of features as input. As a result, we are able to achieve drastic improvements in the recognition of fragmented characters.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117269110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A View on the Past and Future of Character and Document Recognition","authors":"H. Fujisawa","doi":"10.1109/ICDAR.2007.39","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.39","url":null,"abstract":"The paper first gives an overview on the technical advances in the field of character and document recognition, decade by decade. Then, it highlights key technical developments especially for Kanji (Chinese character) recognition in Japan. Technical issues around post address recognition are then discussed, which have promoted advanced techniques including information integration. Robustness design principles are introduced. Finally, future prospects are discussed.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115535701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Document Processing via Trained Morphological Operators","authors":"N. Hirata","doi":"10.1109/ICDAR.2007.104","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.104","url":null,"abstract":"Morphological operators have proven to be useful for many image processing tasks. However, the design of an adequate operator for a given task is not simple in general. A possible approach to deal with this difficulty is to design operators using training based methods. This work shows the application of trained morphological operators for several document processing tasks including character recognition, text segmentation and graphics processing.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115623763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recognition of Broken Characters from Historical Printed Books Using Dynamic Bayesian Networks","authors":"Laurence Likforman-Sulem, M. Sigelle","doi":"10.1109/ICDAR.2007.213","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.213","url":null,"abstract":"This paper investigates the application of dynamic Bayesian networks (DBNs) to the recognition of degraded characters from historical printed books. This framework allows us to capture the 2D nature of character images by the coupling of two HMMs (Hidden Markov Models). The vertical HMM observes image columns while the horizontal HMM observes image rows respectively. Two coupled DBN architectures are proposed to model interactions between these two streams. We present experiments on real degraded characters extracted from an ancient printed book (17th century). These experiments demonstrate that coupled architectures significantly better cope with broken characters than non coupled ones and than discriminative methods such as SVMs.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116259725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Indexing Method for Graphical Documents","authors":"S. Tabbone, Daniel Zuwala","doi":"10.1109/ICDAR.2007.55","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.55","url":null,"abstract":"In this paper, a method to browse symbols into graphical documents is presented. More precisely, we propose a combined filtering and indexing mechanism that retrieves in an efficient way the most similar symbols to a given input query. For a database of 200000 symbols the retrieval time has been divided by a factor of 4, 5 compared to a linear search.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127415800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}