Proceedings of the Fourth International Conference on Document Analysis and Recognition最新文献

筛选
英文 中文
Evaluating OCR and non-OCR text representations for learning document classifiers 评估OCR和非OCR文本表示用于学习文档分类器
Markus Junker, R. Hoch
{"title":"Evaluating OCR and non-OCR text representations for learning document classifiers","authors":"Markus Junker, R. Hoch","doi":"10.1109/ICDAR.1997.620671","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620671","url":null,"abstract":"In the literature, many feature types and learning algorithms have been proposed for document classification. However, an extensive and systematic evaluation of the various approaches has not been done yet. In order to investigate different text representations for document classification, we have developed a tool which transforms documents into feature-value representations that are suitable for standard learning algorithms. In this paper, we investigate seven document representations for German texts based on n-grams and single words. We compare their effectiveness in classifying OCR texts and the corresponding correct ASCII texts in two domains: business letters and abstracts of technical reports. Our results indicate that the use of n-grams is an attractive technique which can even compare to techniques relying on a morphological analysis. This holds for OCR texts as well as for correct ASCII texts.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"14 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115071972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
An evolutionary neuro-fuzzy approach to recognize on-line Arabic handwriting 一种进化神经模糊方法识别在线阿拉伯笔迹
A. Alimi
{"title":"An evolutionary neuro-fuzzy approach to recognize on-line Arabic handwriting","authors":"A. Alimi","doi":"10.1109/ICDAR.1997.619875","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619875","url":null,"abstract":"The author describes a system that recognizes on-line Arabic cursive handwriting. In this system, a genetic algorithm is used to select the best combination of characters recognized by a fuzzy neural network. The handwritten words used in this system are modelled by a theory of movement generation. Based on this motor theory, the features extracted from each character are the neuro-physiological and biomechanical parameters of the equation describing the curvilinear velocity of the script. The evolutionary approach proposed permits the recognition of cursive handwriting with a segmentation procedure allowing overlapped strokes having neuro-physiological meaning.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114658407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 98
Layout and language: preliminary investigations in recognizing the structure of tables 布局与语言:认识表格结构的初步调查
Matthew F. Hurst, Shona Douglas
{"title":"Layout and language: preliminary investigations in recognizing the structure of tables","authors":"Matthew F. Hurst, Shona Douglas","doi":"10.1109/ICDAR.1997.620668","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620668","url":null,"abstract":"Describes a prototype system for assigning table cells to their proper place in the logical structure of the table, based on a simple model of table structure combined with a number of measures of cohesion between cells. A framework is presented for examining the effect of particular variables on the performance of the system, and preliminary results are presented showing the effect of cohesion measures based on the simplest domain-independent analyses, with the aim allowing future comparison with more knowledge-intensive analyses based on natural language processing. These baseline results suggest that very simple string-based cohesion measures are not sufficient to support the extraction of tuples as we require. Future work will pursue the aim of more adequate approximations to a notional subtype/supertype definition of the relationship between value cells and label cells.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128239473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
High accuracy handwritten Chinese character recognition by improved feature matching method 基于改进特征匹配方法的高精度手写体汉字识别
Cheng-Lin Liu, In-Jung Kim, J. H. Kim
{"title":"High accuracy handwritten Chinese character recognition by improved feature matching method","authors":"Cheng-Lin Liu, In-Jung Kim, J. H. Kim","doi":"10.1109/ICDAR.1997.620666","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620666","url":null,"abstract":"Proposes some strategies to improve the recognition performance of a feature matching method for handwritten Chinese character recognition (HCCR). Favorable modifications are given to all stages throughout the recognition. In pre-processing, we devised a modified nonlinear normalization algorithm and a connectivity-preserving smoothing algorithm. For feature extraction, an efficient directional decomposition algorithm and a systematic approach to design a blurring mask are presented. Finally, a modified LVQ3 algorithm is applied to optimize the reference vectors for classification. The integrated effect of these strategies significantly improves the recognition performance. Recognition results on the large-vocabulary databases ETL8B2 and ETL9B are promising.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128627510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Revealing the hidden Markov recognizer 揭示隐藏的马尔可夫识别器
Claus Aufmuth
{"title":"Revealing the hidden Markov recognizer","authors":"Claus Aufmuth","doi":"10.1109/ICDAR.1997.620563","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620563","url":null,"abstract":"The article describes a tool for visualizing hidden Markov recognizers (HMR) which allows the developer to get a detailed view of the recognition process. Improvements are suggested for a hidden Markov recognizer using an appropriate processing and visualization tool.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134187712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Moby Dick meets GEOCR: lexical considerations in word recognition 《白鲸记》符合GEOCR:单词识别中的词汇考虑
A. Spitz
{"title":"Moby Dick meets GEOCR: lexical considerations in word recognition","authors":"A. Spitz","doi":"10.1109/ICDAR.1997.619845","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619845","url":null,"abstract":"The author has previously (Proc. Int. Conf. on Doc. Anal. and Recognition, Montreal, pp. 723-728, 1995) described a high-speed, lexically driven OCR called GEOCR (Good Enough Optical Character Recognition). This paper expands on that work by describing the effects of lexical content, structure and processing on the performance of GEOCR as a word recognition engine, describing the recognition of a particular text, Moby Dick. Word recognition performance is shown to be enhanced by the application of an appropriate lexicon. Recognition speed is essentially independent of the details of lexical content, provided that the intersection of the occurrences of words in the document and the lexicon is high. Word recognition accuracy is dependent on both the intersection and specificity of the lexicon.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"19 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134404616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Scalable image coding by spline approximation for a gray-scale image 灰度图像的样条近似可扩展图像编码
R. Haruki, T. Horiuchi
{"title":"Scalable image coding by spline approximation for a gray-scale image","authors":"R. Haruki, T. Horiuchi","doi":"10.1109/ICDAR.1997.619879","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619879","url":null,"abstract":"The proposed method expresses a gray-scale image by parametric spline functions for edge components and by two-variable spline functions for low frequency components. It can reconstruct the image keeping its quality for the basic shape transformation. If a binary image is input as a special case, the proposed method can make a scalable vector font automatically. The performance of the proposed method is verified by some experiments.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134016161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Recognition of facsimile documents using a database of robust features 利用鲁棒特征数据库识别传真文档
G. Raza, A. Hennig, N. Sherkat, R. Whitrow
{"title":"Recognition of facsimile documents using a database of robust features","authors":"G. Raza, A. Hennig, N. Sherkat, R. Whitrow","doi":"10.1109/ICDAR.1997.619886","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619886","url":null,"abstract":"A method for the recognition of poor quality documents containing touching characters is presented. The method is based on extraction of independent and robust features of each object of a sample word, where objects consist of single letters or of several touching ones. Thus avoiding letter segmentation the method eliminates errors frequently introduced in segmentation based approaches. Features are attributed by their position and extent in order to facilitate discrimination between different classes of objects. A method for automatic construction of a comprehensive database is presented. From a given dictionary every possible letter combination is obtained and the images of the artificially touching letters created. These images are subjected to noise and their features extracted. For recognition, alternatives for each object are found based on the database. Object alternatives are then combined into valid word alternatives using lexicon lookup. It has been observed that the developed method is effective for the recognition of poor quality documents.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132996863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Construction of retrieval system for pictorial book of flora 植物图画书检索系统的构建
Yasuhiko Watanabe, M. Nagao
{"title":"Construction of retrieval system for pictorial book of flora","authors":"Yasuhiko Watanabe, M. Nagao","doi":"10.1109/ICDAR.1997.620653","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620653","url":null,"abstract":"Pattern information and natural language information used together can complement and reinforce each other to enable more effective communication than can either medium alone. A good example is a pictorial book of flora (PBF). In the PBF, readable explanations combine texts and pictures. However, it is difficult to retrieve explanation text and pictures from the PBF when we don't know the names of flowers. To solve this problem, we propose a retrieval method for the PBF using the color feature of each flower and fruit, and construct an experimental retrieval system for the PBF. For obtaining the color feature of each flower and fruit, we analysed the PBF pictures and found several problems as follows: Pictures of the PBF contain many kinds of objects. In addition to flowers and fruits, there are leaves, stems, skies, soils, and sometimes humans in the PBF pictures. The position, size, and direction of flowers and fruits vary quite widely in each picture. Each flower and fruit has its unique shape, color, and texture which are commonly different from those of the others. Because of these problems, it is difficult to build the general and precise model for analyzing the PBF pictures in advance. We propose a method for image analysis using natural language information. Our method works as follows. First, we analyse the PBF explanation texts for extracting the color information on each flower and fruit. Then, we analyse the PBF pictures by using the results of the natural language processing, and finally obtain the color feature of each flower and fruit.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133154770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Skew and slant correction for document images using gradient direction 使用梯度方向对文档图像进行倾斜和倾斜校正
Changming Sun, Deyi Si
{"title":"Skew and slant correction for document images using gradient direction","authors":"Changming Sun, Deyi Si","doi":"10.1109/ICDAR.1997.619830","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619830","url":null,"abstract":"A fast algorithm is presented for skew and slant correction in printed document images. The algorithm employs only the gradient information. The skew angle is obtained by searching for a peak in the histogram of the gradient orientation of the input grey-level image. The skewness of the document is corrected by a rotation at such an angle. The slant of characters can also be detected using the same technique, and can be corrected by a shear operation. A second method for character slant correction by fitting parallelograms to the connected components is also described. Document images with different contents (tables, figures, and photos) have been tested for skew correction and the algorithm gives accurate results on all the test images, and the algorithm is very easy to implement.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132390563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 93
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信