Proceedings of the Fourth International Conference on Document Analysis and Recognition最新文献_第2页

An interactive system to extract structured text from a geometrical representation 从几何表示中提取结构化文本的交互式系统

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619868

Benoit Poirier, M. Dagenais

{"title":"An interactive system to extract structured text from a geometrical representation","authors":"Benoit Poirier, M. Dagenais","doi":"10.1109/ICDAR.1997.619868","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619868","url":null,"abstract":"The proliferation of electronic document formats impedes the dissemination and management of documents. Indeed, a common format with structural information is required to obtain document indexing and navigation. While in some formats it is easy to decode and preserve the document structure information, often the only easily obtainable representation is Postscript, where only the geometrical information remains. Even if an organization is willing to convert all its document producing activities to a structure preserving format such as HTML, the existing documents need to be converted. The paper addresses the difficult problem of extracting the structure of a document from a geometrical representation. An interactive tool to extract the document content and structure from a geometric representation (Postscript) has been developed. It successfully analyzes several documents produced with different tools, and produces structural information using the HyperText Markup Language (HTML). The end user, when presented with the extracted document structure, can interactively modify it, if needed. The tool is easily extended to recognize new constructs and is aimed at organizations needing to convert numerous documents for searching and browsing on intranets or on the Internet.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126402025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Combining multiple representations and classifiers for pen-based handwritten digit recognition 结合多个表示和分类器的手写数字识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620583

F. Alimoglu, Ethem Alpaydin

{"title":"Combining multiple representations and classifiers for pen-based handwritten digit recognition","authors":"F. Alimoglu, Ethem Alpaydin","doi":"10.1109/ICDAR.1997.620583","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620583","url":null,"abstract":"We investigate techniques to combine multiple representations of a handwritten digit to increase classification accuracy without significantly increasing system complexity or recognition time. We compare multiexpert and multistage combination techniques and discuss in detail in a comparative manner methods for combining multiple learners: voting, mixture of experts, stacking, boosting and cascading. In pen based handwritten character recognition, the input is the dynamic movement of the pentip over the pressure sensitive tablet. There is also the image formed as a result of this movement. On a real world database, we notice that the two multi layer perceptron (MLP) neural network based classifiers using these representations separately make errors on different patterns, implying that a suitable combination of the two would lead to higher accuracy. Thus we implement and compare voting, mixture of experts, stacking and cascading. Combined classifiers have an error percentage less than individual ones. The final combined system of two MLPs has less complexity and memory requirement than a single k nearest neighbor using one of the representations.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128079718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 118

Performance comparison of several feature selection methods based on node pruning in handwritten character recognition 基于节点修剪的几种特征选择方法在手写字符识别中的性能比较

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619805

Kyusik Chung, Jongmin Yoon

引用次数: 14

Surfing an ODBMS (maintaining WWW documents with O/sub 2/) 浏览ODBMS(使用O/ sub2 /维护WWW文档)

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620627

F. Buddrus, Marco Bellavia

引用次数: 4

HMM word recognition engine HMM词识别引擎

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620559

D. Guillevic, C. Suen

引用次数: 53

Recovery of temporal information of cursively handwritten words for on-line recognition 用于在线识别的草书手写文字时间信息的恢复

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620647

H. Bunke, R. Ammann, Guido Kaufmann, T. M. Ha, M. Schenkel, R. Seiler, F. Eggimann

引用次数: 32

An object-oriented form description language and approach to handwritten form processing 一个面向对象的表单描述语言和手写表单处理方法

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619837

C. Cracknell, A. Downton, L. Du

引用次数: 4

Recognizing on-line handwritten Chinese character via FARG matching 基于FARG匹配的在线手写汉字识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620578

Jing Zheng, Xiaoqing Ding, Youshou Wu

引用次数: 13

Measuring the effects of OCR errors on similarity linking 测量OCR误差对相似链接的影响

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620654

A. Myka, Ulrich Güntzer

引用次数: 8

A study of moment functions and its use in Chinese character recognition 矩函数及其在汉字识别中的应用研究

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620566

S. Liao, Q. Lu

引用次数: 19