Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献_第4页

A scanning n-tuple classifier for online recognition of handwritten digits 用于手写数字在线识别的扫描n元组分类器

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953747

E. Ratzlaff

引用次数: 15

Multi-branch and two-pass HMM modeling approaches for off-line cursive handwriting recognition 离线草书手写识别的多分支和两次HMM建模方法

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953789

Wenwei Wang, A. Brakensiek, A. Kosmala, G. Rigoll

引用次数: 15

Character extraction and recognition in natural scene images 自然场景图像中的字符提取与识别

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953953

Xuewen Wang, Xiaoqing Ding, Changsong Liu

引用次数: 11

PenCalc: a novel application of on-line mathematical expression recognition technology 铅笔:在线数学表达式识别技术的一种新应用

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953893

Kam-Fai Chan, D. Yeung

引用次数: 37

A model guided document image analysis scheme 一个模型引导文档图像分析方案

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953963

Gaurav Harit, S. Chaudhury, Puneet Gupta, Neeti Vohra, S. Joshi

引用次数: 16

Why table ground-truthing is hard 为什么台面真相很难

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953768

Jianying Hu, R. Kashi, D. Lopresti, G. Wilfong, G. Nagy

引用次数: 98

Constructing Web-based legacy index card archives-architectural design issues and initial data acquisition 构建基于web的遗留索引卡存档——体系结构设计问题和初始数据获取

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953908

A. Downton, A. Tams, G. Wells, A. C. Holmes, S. Lucas, G. Beccaloni, M. Scoble, G. S. Robinson

{"title":"Constructing Web-based legacy index card archives-architectural design issues and initial data acquisition","authors":"A. Downton, A. Tams, G. Wells, A. C. Holmes, S. Lucas, G. Beccaloni, M. Scoble, G. S. Robinson","doi":"10.1109/ICDAR.2001.953908","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953908","url":null,"abstract":"Presents a progress report (after 1 year of a 3 year project) on the overall design for a flexible archive conversion system, intended eventually for widespread use as a tool to convert legacy typescript and handwritten archive card indexes into Internet-accessible and searchable databases. The VIADOCS system is being developed and evaluated on a demonstrator archive of 30,000 pyraloid moth cards at the UK Natural History Museum, and has already demonstrated a successful and efficient mechanism for image acquisition using a modified bank cheque scanner. Document image processing and analysis techniques, defined by an XML validating document type definition (DTD), are being used to correct defects in the acquired images and parse card sequences to match the hierarchical taxonomy of pyraloid moth species. Parsed data is processed by offline OCR engines augmented by field-specific subject dictionaries to produce a 'draft' online archive. This archive will then be validated interactively via a Web browser as it is used. It is hoped eventually to provide an efficient and configurable legacy archive document conversion system not only for the Natural History Museum, but also for all museums, libraries and archives where there is a need to interrogate legacy documents via computer.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133935798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

An investigation on MPEG audio segmentation by evolutionary algorithms 基于进化算法的MPEG音频分割研究

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953926

C. Stefano, A. D. Cioppa, A. Marcelli

引用次数: 2

Character-like region verification for extracting text in scene images 在场景图像中提取文本的类字符区域验证

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953927

Hao Wang, J. Kangas

引用次数: 18

On the influence of vocabulary size and language models in unconstrained handwritten text recognition 词汇量和语言模型对无约束手写体文本识别的影响

Proceedings of Sixth International Conference on Document Analysis and Recognition Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953795

Urs-Viktor Marti, H. Bunke

{"title":"On the influence of vocabulary size and language models in unconstrained handwritten text recognition","authors":"Urs-Viktor Marti, H. Bunke","doi":"10.1109/ICDAR.2001.953795","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953795","url":null,"abstract":"In this paper we present a system for unconstrained handwritten text recognition. The system consists of three components: preprocessing, feature extraction and recognition. In the preprocessing phase, a page of handwritten text is divided into its lines and the writing is normalized by means of skew and slant correction, positioning and scaling. From a normalized text line image, features are extracted using a sliding window technique. From each position of the window nine geometrical features are computed. The core of the system, the recognizes is based on hidden Markov models. For each individual character, a model is provided. The character models are concatenated to words using a vocabulary. Moreover, the word models are concatenated to models that represent full lines of text. Thus the difficult problem of segmenting a line of text into its individual words can be overcome. To enhance the recognition capabilities of the system, a statistical language model is integrated into the hidden Markov model framework. To preselect useful language models and compare them, perplexity is used. Both perplexity as originally proposed and normalized perplexity are considered.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114013725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 48