Proceedings of the Fourth International Conference on Document Analysis and Recognition最新文献_第4页

Image and text coupling for creating electronic books from manuscripts 从手稿中创建电子图书的图像和文本耦合

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620626

Laurent Robert, Laurence Likforman-Sulem, É. Lecolinet

引用次数: 7

Table image segmentation 表格图像分割

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620599

Konstantin Zuyev

引用次数: 48

An Image Consulting Framework for document analysis of Internet graphics 一个用于网络图形文档分析的图像咨询框架

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620625

M. Köppen, L. Lohmann, B. Nickolay

{"title":"An Image Consulting Framework for document analysis of Internet graphics","authors":"M. Köppen, L. Lohmann, B. Nickolay","doi":"10.1109/ICDAR.1997.620625","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620625","url":null,"abstract":"A new system approach for image understanding, called the Image Consulting Framework, is proposed. It allows for the validation of image properties. The kinds of image properties considered are textual, textural, hierarchical, color and symbolic. Its main application field is information filtering from images used in World Wide Web documents. The Image Consulting Framework consists of four stages: the color separation stage, the information granulation-verification modules (GVMs), the task stage and the recognition stage. At the base of the framework are the GVMs, which are designed to solve very special tasks. They consists of three parts: a method maintainer, a parameter chooser and a tester (verifier). The parameter chooser uses a given set of parameter settings for different runs of the maintained method on the input images of the GVM. The resulting images are tested for the occurrence of the property for which the GVM is designed. All successful images are put into a queue. The task stage calls new GVMs due to the filling of the queue, and it also assigns input images to the GVMs. All fully-treated images are passed to the recognition stage, where the information extraction is performed.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125233012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Retrieval methods for English-text with missrecognized OCR characters 英文文本OCR字符识别错误的检索方法

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620651

Manabu Ohta, A. Takasu, J. Adachi

{"title":"Retrieval methods for English-text with missrecognized OCR characters","authors":"Manabu Ohta, A. Takasu, J. Adachi","doi":"10.1109/ICDAR.1997.620651","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620651","url":null,"abstract":"This paper presents three probabilistic text retrieval methods designed to carry out a full-text search of English documents containing OCR errors. By searching for any query term on the premise that there are errors in the recognized text, the methods presented can tolerate such errors, and therefore costly manual post-editing is not required after OCR recognition. In the applied approach, confusion matrices are used to store characters which are likely to be interchanged when a particular character is missrecognized, and the respective probability of each occurrence. Moreover, a 2-gram matrix is used to store probabilities of character connection, i.e., which letter is likely to come after another. Multiple search terms are generated for an input query term by making reference to confusion matrices, after which a full-text search is run for each search term. The validity of retrieved terms is determined based on error-occurrence and character connection probabilities. The performance of these methods is experimentally evaluated by determining retrieval effectiveness, i.e., by calculating recall and precision rates. Results indicate marked improvement in comparison with exact matching.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121878919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 38

Generalized contextual recognition of hand-printed documents using semantic trees with lazy evaluation 基于延迟求值的语义树手印文档的广义上下文识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619848

L. Du, A. Downton, S. Lucas, Badr Al-Badr

引用次数: 8

Information capture and semantic indexing of digital libraries through machine learning techniques 基于机器学习技术的数字图书馆信息捕获和语义索引

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620603

F. Esposito, D. Malerba, G. Semeraro, Cesare Daniele Antifora, G. D. Gennaro

引用次数: 9

Variations on the analysis of architectural drawings 建筑图纸分析的变化

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619869

Christian Ah-Soon, K. Tombre

引用次数: 47

New features for Chinese character recognition 新增汉字识别功能

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620571

T. Caesar

引用次数: 0

Confidence computation improvement in an optical field reading system 光学场读数系统置信度计算改进

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620629

A. Benedetti, Z. Kovács-Vajna

引用次数: 1

Handwritten ZIP code recognition 手写邮政编码识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620613

Gregory I. Dzuba, Alexander Filatov, A. Volgunin

引用次数: 30