Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献_第2页

Handwritten numeral recognition using flexible matching based on learning of stroke statistics 基于笔画统计学习的灵活匹配手写数字识别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953862

Takashi Kobayashi, Kaori Nakamura, Hirokazu Muramatsu, Takahiro Sugiyama, K. Abe

引用次数: 2

Adaptive N-best-list handwritten word recognition 自适应n最佳列表手写单词识别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953777

T. Kwok, M. Perrone

引用次数: 7

Text extraction from color documents-clustering approaches in three and four dimensions 彩色文档的文本提取——三维和四维聚类方法

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953923

T. Perroud, K. Sobottka, H. Bunke, L. Hall

引用次数: 30

Robust feature extraction based on run-length compensation for degraded handwritten character recognition 基于游程补偿的退化手写字符识别鲁棒特征提取

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953870

M. Mori, M. Sawaki, N. Hagita, H. Murase, N. Mukawa

引用次数: 9

Text line segmentation and word recognition in a system for general writer independent handwriting recognition 文本行分割和词识别系统中一般写作者独立的手写识别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953775

Urs-Viktor Marti, H. Bunke

{"title":"Text line segmentation and word recognition in a system for general writer independent handwriting recognition","authors":"Urs-Viktor Marti, H. Bunke","doi":"10.1109/ICDAR.2001.953775","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953775","url":null,"abstract":"We present a system for recognizing unconstrained English handwritten text based on a large vocabulary. We describe the three main components of the system, which are preprocessing, feature extraction and recognition. In the preprocessing phase the handwritten texts are first segmented into lines. Then each line of text is normalized with respect to of skew, slant, vertical position and width. After these steps, text lines are segmented into single words. For this purpose distances between connected components are measured. Using a threshold, the distances are divided into distances within a word and distances between different words. A line of text is segmented at positions where the distances are larger than the chosen threshold. From each image representing a single word, a sequence of features is extracted. These features are input to a recognition procedure which is based on hidden Markov models. To investigate the stability of the segmentation algorithm the threshold that separates intra- and inter-word distances from each other is varied. If the threshold is small many errors are caused by over-segmentation, while for large thresholds under-segmentation errors occur. The best segmentation performance is 95.56% correctly segmented words, tested on 541 text lines containing 3899 words. Given a correct segmentation rate of 95.56%, a recognition rate of 73.45% on the word level is achieved.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132367340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 95

An OCR system for Telugu 泰卢固语的OCR系统

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953958

A. Negi, C. Bhagvati, B. Krishna

引用次数: 143

Discrimination of Oriental and Euramerican scripts using fractal feature 基于分形特征的东西方文字辨析

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953959

Yu Tao, Y. Tang

引用次数: 13

Creating generic text summaries 创建通用文本摘要

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953917

Yihong Gong, Xin Liu

引用次数: 22

A class-modularity for character recognition 用于字符识别的类模块化

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953756

Il-Seok Oh, Jin-Seon Lee, C. Suen

引用次数: 2

Word discrimination based on bigram co-occurrences 基于重字共现的词辨别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953773

A. El-Nasan, S. Veeramachaneni, G. Nagy

引用次数: 14