一种阿拉伯和印度数字邮政编码的混合OCR系统

Y. Alginahi, A. A. Siddiqi
{"title":"一种阿拉伯和印度数字邮政编码的混合OCR系统","authors":"Y. Alginahi, A. A. Siddiqi","doi":"10.1109/ICCTD.2009.162","DOIUrl":null,"url":null,"abstract":"Arabic text recognition offers unique technical challenges and has been addressed more recently in the document analysis research field than other languages. Automatic Arabic/Indian numeral Optical Character Recognition (OCR) system for postal services are used in many countries, but still there are problems in such systems where machines still provide errors in reading the crucial information needed to distribute the mail efficiently. The need to investigate fast and efficient recognition methods is important so as to correctly read the postal codes from mail envelopes. The significance of this study is to recognize essential information, e.g., postal codes from the mail envelopes, by applying the OCR methods. The proposed system is a hybrid of three different feature extraction methods and classification methods. The proposed system, systematically compares the performance of each and every method, and makes sure that the numeral is recognized or rejected. The results provide a recognition rate of 99.4%.","PeriodicalId":269403,"journal":{"name":"2009 International Conference on Computer Technology and Development","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2009-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Proposed Hybrid OCR System for Arabic and Indian Numerical Postal Codes\",\"authors\":\"Y. Alginahi, A. A. Siddiqi\",\"doi\":\"10.1109/ICCTD.2009.162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Arabic text recognition offers unique technical challenges and has been addressed more recently in the document analysis research field than other languages. Automatic Arabic/Indian numeral Optical Character Recognition (OCR) system for postal services are used in many countries, but still there are problems in such systems where machines still provide errors in reading the crucial information needed to distribute the mail efficiently. The need to investigate fast and efficient recognition methods is important so as to correctly read the postal codes from mail envelopes. The significance of this study is to recognize essential information, e.g., postal codes from the mail envelopes, by applying the OCR methods. The proposed system is a hybrid of three different feature extraction methods and classification methods. The proposed system, systematically compares the performance of each and every method, and makes sure that the numeral is recognized or rejected. The results provide a recognition rate of 99.4%.\",\"PeriodicalId\":269403,\"journal\":{\"name\":\"2009 International Conference on Computer Technology and Development\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Computer Technology and Development\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCTD.2009.162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Computer Technology and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCTD.2009.162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

阿拉伯语文本识别提供了独特的技术挑战,并且比其他语言最近在文档分析研究领域得到了解决。许多国家都在使用用于邮政服务的自动阿拉伯/印度数字光学字符识别(OCR)系统,但这种系统仍然存在问题,即机器在读取有效分发邮件所需的关键信息时仍然会出现错误。为了正确地从信封中读取邮政编码,研究快速有效的识别方法是很重要的。本研究的意义在于通过OCR方法从邮件信封中识别出必要的信息,例如邮政编码。该系统混合了三种不同的特征提取方法和分类方法。该系统系统地比较了每种方法的性能,并确保数字被识别或拒绝。结果表明,识别率为99.4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Proposed Hybrid OCR System for Arabic and Indian Numerical Postal Codes
Arabic text recognition offers unique technical challenges and has been addressed more recently in the document analysis research field than other languages. Automatic Arabic/Indian numeral Optical Character Recognition (OCR) system for postal services are used in many countries, but still there are problems in such systems where machines still provide errors in reading the crucial information needed to distribute the mail efficiently. The need to investigate fast and efficient recognition methods is important so as to correctly read the postal codes from mail envelopes. The significance of this study is to recognize essential information, e.g., postal codes from the mail envelopes, by applying the OCR methods. The proposed system is a hybrid of three different feature extraction methods and classification methods. The proposed system, systematically compares the performance of each and every method, and makes sure that the numeral is recognized or rejected. The results provide a recognition rate of 99.4%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信