Proceedings of the Fourth International Conference on Document Analysis and Recognition最新文献

筛选
英文 中文
An interactive system to extract structured text from a geometrical representation 从几何表示中提取结构化文本的交互式系统
Benoit Poirier, M. Dagenais
{"title":"An interactive system to extract structured text from a geometrical representation","authors":"Benoit Poirier, M. Dagenais","doi":"10.1109/ICDAR.1997.619868","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619868","url":null,"abstract":"The proliferation of electronic document formats impedes the dissemination and management of documents. Indeed, a common format with structural information is required to obtain document indexing and navigation. While in some formats it is easy to decode and preserve the document structure information, often the only easily obtainable representation is Postscript, where only the geometrical information remains. Even if an organization is willing to convert all its document producing activities to a structure preserving format such as HTML, the existing documents need to be converted. The paper addresses the difficult problem of extracting the structure of a document from a geometrical representation. An interactive tool to extract the document content and structure from a geometric representation (Postscript) has been developed. It successfully analyzes several documents produced with different tools, and produces structural information using the HyperText Markup Language (HTML). The end user, when presented with the extracted document structure, can interactively modify it, if needed. The tool is easily extended to recognize new constructs and is aimed at organizations needing to convert numerous documents for searching and browsing on intranets or on the Internet.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126402025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Combining multiple representations and classifiers for pen-based handwritten digit recognition 结合多个表示和分类器的手写数字识别
F. Alimoglu, Ethem Alpaydin
{"title":"Combining multiple representations and classifiers for pen-based handwritten digit recognition","authors":"F. Alimoglu, Ethem Alpaydin","doi":"10.1109/ICDAR.1997.620583","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620583","url":null,"abstract":"We investigate techniques to combine multiple representations of a handwritten digit to increase classification accuracy without significantly increasing system complexity or recognition time. We compare multiexpert and multistage combination techniques and discuss in detail in a comparative manner methods for combining multiple learners: voting, mixture of experts, stacking, boosting and cascading. In pen based handwritten character recognition, the input is the dynamic movement of the pentip over the pressure sensitive tablet. There is also the image formed as a result of this movement. On a real world database, we notice that the two multi layer perceptron (MLP) neural network based classifiers using these representations separately make errors on different patterns, implying that a suitable combination of the two would lead to higher accuracy. Thus we implement and compare voting, mixture of experts, stacking and cascading. Combined classifiers have an error percentage less than individual ones. The final combined system of two MLPs has less complexity and memory requirement than a single k nearest neighbor using one of the representations.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128079718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 118
Performance comparison of several feature selection methods based on node pruning in handwritten character recognition 基于节点修剪的几种特征选择方法在手写字符识别中的性能比较
Kyusik Chung, Jongmin Yoon
{"title":"Performance comparison of several feature selection methods based on node pruning in handwritten character recognition","authors":"Kyusik Chung, Jongmin Yoon","doi":"10.1109/ICDAR.1997.619805","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619805","url":null,"abstract":"The paper presents a performance comparison of several feature selection methods based on neural network node pruning. Assuming the features are extracted and presented as the inputs of a 3 layered perceptron classifier, we apply the five feature selection methods before/during/after neural network training in order to prune only input nodes of the neural network. Four of them are node pruning methods such as node saliency method, node sensitivity method, and two interactive pruning methods using different contribution measures. The last one is a statistical method based on principle component analysis (PCA). The first two of them prune input nodes during training whereas the last three do before/after network training. For gradient and upper down, left right hole concavity features, we perform several experiments of handwritten English alphabet and digit recognition with/without pruning using the five feature selection algorithms, respectively. The experimental results show that node saliency method outperforms the others.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134431965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Surfing an ODBMS (maintaining WWW documents with O/sub 2/) 浏览ODBMS(使用O/ sub2 /维护WWW文档)
F. Buddrus, Marco Bellavia
{"title":"Surfing an ODBMS (maintaining WWW documents with O/sub 2/)","authors":"F. Buddrus, Marco Bellavia","doi":"10.1109/ICDAR.1997.620627","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620627","url":null,"abstract":"Deals with the idea of an object-oriented DBMS (O/sub 2/) working as an Internet server. The approach introduced offers fine-grained storage and retrieval of World Wide Web (WWW) documents, providing sophisticated document maintenance, supporting referential integrity, document integration, integration with application objects, access control and versioning. In contrast to comparable systems, it acts transparently to the WWW users, i.e. without using proprietary query languages or protocols. While achieving the full benefits of a modern document repository, it keeps the ease of the traditional file system it replaces.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131059574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
HMM word recognition engine HMM词识别引擎
D. Guillevic, C. Suen
{"title":"HMM word recognition engine","authors":"D. Guillevic, C. Suen","doi":"10.1109/ICDAR.1997.620559","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620559","url":null,"abstract":"We describe a hidden Markov model (HMM) based word recognition engine being developed to be integrated with the CENPARMI bank cheque processing system. The various modules are described in detail, and preliminary results are compared with our previous global feature recognition scheme. The engine is tested on words from a database of over 4,500 cheques of 1,400 writers.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133695958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Recovery of temporal information of cursively handwritten words for on-line recognition 用于在线识别的草书手写文字时间信息的恢复
H. Bunke, R. Ammann, Guido Kaufmann, T. M. Ha, M. Schenkel, R. Seiler, F. Eggimann
{"title":"Recovery of temporal information of cursively handwritten words for on-line recognition","authors":"H. Bunke, R. Ammann, Guido Kaufmann, T. M. Ha, M. Schenkel, R. Seiler, F. Eggimann","doi":"10.1109/ICDAR.1997.620647","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620647","url":null,"abstract":"On-line recognition differs from off-line recognition in that additional information about the drawing order of the strokes is available. This temporal information makes it easier to recognize handwritten texts with an on-line recognition system. In this paper we present a method for the recovery of the stroke order from static handwritten images. The algorithm was tested by classifying the words of an off-line database with a state-of-the-art on-line recognition system. On this database with 150 different words, written by four cooperative writers, a recognition rate of 97.4% was obtained.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128834482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
An object-oriented form description language and approach to handwritten form processing 一个面向对象的表单描述语言和手写表单处理方法
C. Cracknell, A. Downton, L. Du
{"title":"An object-oriented form description language and approach to handwritten form processing","authors":"C. Cracknell, A. Downton, L. Du","doi":"10.1109/ICDAR.1997.619837","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619837","url":null,"abstract":"Virtually all form processing systems rely upon a form description file which holds some form design specific information. Such files are written in a form description language. This paper presents a novel form description language and a new approach to form processing which makes full use of the language. The form description language which is presented is novel in two ways: firstly, it allows the comprehensive utilisation of contextual information; secondly, it enables the storage of form models which economically describe the variations in the form design which are allowed for a particular class of forms.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127387349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Recognizing on-line handwritten Chinese character via FARG matching 基于FARG匹配的在线手写汉字识别
Jing Zheng, Xiaoqing Ding, Youshou Wu
{"title":"Recognizing on-line handwritten Chinese character via FARG matching","authors":"Jing Zheng, Xiaoqing Ding, Youshou Wu","doi":"10.1109/ICDAR.1997.620578","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620578","url":null,"abstract":"The paper presents a novel method for online handwritten Chinese character recognition. In our method, each category of character is described by a fuzzy attributed relational graph (FARG). A relaxation algorithm is developed to match the input pattern with every FARG. For decision making, a similarity measure is established via statistical technique to calculate the matching degree between the input pattern and referenced FARG, according to which the recognition result is determined. The principle of our method makes it very robust against stroke connection and stroke order variation as well as stroke shape deformation. A database of 22530 samples collected from 6 subjects is used to test our recognition system which can recognize 3755 categories of Chinese characters. The result shows that our method is very effective: a top 1 recognition rate of 98.8% and a top 10 of 99.7% are reached.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114827209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Measuring the effects of OCR errors on similarity linking 测量OCR误差对相似链接的影响
A. Myka, Ulrich Güntzer
{"title":"Measuring the effects of OCR errors on similarity linking","authors":"A. Myka, Ulrich Güntzer","doi":"10.1109/ICDAR.1997.620654","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620654","url":null,"abstract":"The vector-space model offers an easy and robust model for Information Retrieval. Thereby, the similarities between queries and documents as well as the similarities between documents themselves are of importance. Document similarities may be used in order to generate links between documents that lead users from one document to related ones. Studies have shown that the vector-space model is robust in the context of OCR-processing if manually constructed queries are used. However it is not clear whether this model, if used for hypertext construction, is robust with regard to data corruption as caused by OCR engines. In this paper, we describe the performance of automatic hypertext construction, based on the vector-space model, with regard to three different measures: the number of overtakings within the used rankings, the accumulated distance of a document's position within the rankings and a comparison based on recall-precision graphs.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116898667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A study of moment functions and its use in Chinese character recognition 矩函数及其在汉字识别中的应用研究
S. Liao, Q. Lu
{"title":"A study of moment functions and its use in Chinese character recognition","authors":"S. Liao, Q. Lu","doi":"10.1109/ICDAR.1997.620566","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620566","url":null,"abstract":"Moment descriptors have been developed as features in pattern recognition since the moment method was first introduced. In this paper, new moment features for Chinese character recognition are proposed. These provide significant improvements in terms of Chinese character recognition, especially for those characters that are very close in shapes.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116986164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信