Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献_第8页

How conditional independence assumption affects handwritten character segmentation 条件独立假设如何影响手写字符分割

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953792

M. Maragoudakis, E. Kavallieratou, N. Fakotakis, G. Kokkinakis

引用次数: 3

Applying the T-Recs table recognition system to the business letter domain T-Recs表识别系统在商务信函领域的应用

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953843

T. Kieninger, A. Dengel

{"title":"Applying the T-Recs table recognition system to the business letter domain","authors":"T. Kieninger, A. Dengel","doi":"10.1109/ICDAR.2001.953843","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953843","url":null,"abstract":"This paper summarizes the core idea of the T-Recs table recognition system, an integrated system covering block-segmentation, table location and a model-free structural analysis of tables. T-Recs works on the output of commercial OCR systems that provide the word bounding box geometry together with the text itself (e.g. Xerox ScanWorX). While T-Recs performs well on a number of document categories, business letters still remained a challenging domain because the T-Recs location heuristics are mislead by their header or footer resulting in a low recognition precision. Business letters such as invoices are a very interesting domain for industrial applications due to the large amount of documents to be analyzed and the importance of the data carried within their tables. Hence, we developed a more restrictive approach which is implemented in the T-Recs++ prototype. This paper describes the ideas of the T-Recs++ location and also proposes a quality evaluation measure that reflects the bottom-up strategy of either T-Recs or T-Recs++. Finally, some results comparing both systems on a collection of business letters are given.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"435 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126983837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 80

Substroke approach to HMM-based on-line Kanji handwriting recognition 基于hmm的在线汉字手写识别的子笔划方法

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953838

M. Nakai, N. Akira, H. Shimodaira, S. Sagayama

{"title":"Substroke approach to HMM-based on-line Kanji handwriting recognition","authors":"M. Nakai, N. Akira, H. Shimodaira, S. Sagayama","doi":"10.1109/ICDAR.2001.953838","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953838","url":null,"abstract":"A new method is proposed for online handwriting recognition of Kanji characters. The method employs substroke HMM as minimum units to constitute Japanese Kanji characters and utilizes the direction of pen motion. The main motivation is to fully utilize the continuous speech recognition algorithm by relating sentence speech to Kanji character phonemes to substrokes, and grammar to Kanji structure. The proposed system consists input feature analysis, substroke HMM, a character structure dictionary and a decoder. The present approach has the following advantages over the conventional methods that employ whole character HMM. 1) Much smaller memory requirement for dictionary and models. 2) Fast recognition by employing efficient substroke network search. 3) Capability of recognizing characters not included in the training data if defined as a sequence of substrokes in the dictionary. 4) Capability of recognizing characters written by various different stroke orders with multiple definitions per one character in the dictionary. 5) Easiness in HMM adaptation to the user with a few sample character data.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125109209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 105

Measuring HMM similarity with the Bayes probability of error and its application to online handwriting recognition 用贝叶斯误差概率度量HMM相似度及其在在线手写识别中的应用

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953822

Claus Bahlmann, H. Burkhardt

引用次数: 69

Character pre-classification based on fuzzy typographical analysis 基于模糊排版分析的字符预分类

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953758

Lu Da, Pu Wei, B. McCane

引用次数: 1

An improved learning scheme for the moving window classifier 一种改进的移动窗口分类器学习方案

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953861

Sanaul Hoque, M. Fairhurst

引用次数: 3

AIDAS: incremental logical structure discovery in PDF documents AIDAS: PDF文档中的增量逻辑结构发现

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953816

A. Anjewierden

引用次数: 55

Handwritten country name identification using vector quantisation and hidden Markov model 使用矢量量化和隐马尔可夫模型的手写国家名称识别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953877

G. Leedham, W. Tan, Weng Lee Yap

引用次数: 3

Web sites thematic classification using hidden Markov models 使用隐马尔可夫模型的网站主题分类

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953955

Lyonel Serradura, M. Slimane, N. Vincent

引用次数: 1

Newspaper page decomposition using a split and merge approach 使用拆分和合并方法分解报纸页面

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953972

K. Hadjar, O. Hitz, R. Ingold

引用次数: 41