表单单元识别的递归分析

Hiroshi Shinjo, Eiichi Hadano, K. Marukawa, Y. Shima, H. Sako
{"title":"表单单元识别的递归分析","authors":"Hiroshi Shinjo, Eiichi Hadano, K. Marukawa, Y. Shima, H. Sako","doi":"10.1109/ICDAR.2001.953879","DOIUrl":null,"url":null,"abstract":"It is very difficult to analyze form structures because of breaks in lines and additional noises in the form image. This paper focuses on cell recognition in low quality form images. The recognition method has two features to achieve robustness in cell recognition. One is grid representation using several types of intersection and the terminal points of the frame lines. The other is the recursive modification of the representation. A new representation is created according to the determination of the breaks in the line and the hypothesized location of the missed intersections by using the previous representation. The modification is processed recursively until the representation has perfect consistency and all form cells are detected. In an experiment using 1565 form samples, all cells in 1538 samples (98.3% of 1565 samples) were recognized correctly by this method.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"A recursive analysis for form cell recognition\",\"authors\":\"Hiroshi Shinjo, Eiichi Hadano, K. Marukawa, Y. Shima, H. Sako\",\"doi\":\"10.1109/ICDAR.2001.953879\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is very difficult to analyze form structures because of breaks in lines and additional noises in the form image. This paper focuses on cell recognition in low quality form images. The recognition method has two features to achieve robustness in cell recognition. One is grid representation using several types of intersection and the terminal points of the frame lines. The other is the recursive modification of the representation. A new representation is created according to the determination of the breaks in the line and the hypothesized location of the missed intersections by using the previous representation. The modification is processed recursively until the representation has perfect consistency and all form cells are detected. In an experiment using 1565 form samples, all cells in 1538 samples (98.3% of 1565 samples) were recognized correctly by this method.\",\"PeriodicalId\":277816,\"journal\":{\"name\":\"Proceedings of Sixth International Conference on Document Analysis and Recognition\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of Sixth International Conference on Document Analysis and Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2001.953879\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Sixth International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2001.953879","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

由于表单图像中存在断行和附加噪声,使得表单结构分析变得非常困难。本文主要研究低质量表单图像中的细胞识别问题。该识别方法具有两个特征,以实现细胞识别的鲁棒性。一种是使用几种类型的交点和框架线的端点的网格表示。另一种是表示的递归修改。通过使用先前的表示,根据线中断点的确定和错过交叉点的假设位置,创建新的表示。递归地处理修改,直到表示具有完美的一致性并检测到所有表单单元格。在使用1565个表单样本的实验中,该方法对1538个样本中的所有细胞(占1565个样本的98.3%)进行了正确识别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A recursive analysis for form cell recognition
It is very difficult to analyze form structures because of breaks in lines and additional noises in the form image. This paper focuses on cell recognition in low quality form images. The recognition method has two features to achieve robustness in cell recognition. One is grid representation using several types of intersection and the terminal points of the frame lines. The other is the recursive modification of the representation. A new representation is created according to the determination of the breaks in the line and the hypothesized location of the missed intersections by using the previous representation. The modification is processed recursively until the representation has perfect consistency and all form cells are detected. In an experiment using 1565 form samples, all cells in 1538 samples (98.3% of 1565 samples) were recognized correctly by this method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信