A fast recognition system for isolated arabic characters

J. Cowell, F. Hussain
{"title":"A fast recognition system for isolated arabic characters","authors":"J. Cowell, F. Hussain","doi":"10.1109/IV.2002.1028844","DOIUrl":null,"url":null,"abstract":"This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.","PeriodicalId":308951,"journal":{"name":"Proceedings Sixth International Conference on Information Visualisation","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Sixth International Conference on Information Visualisation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV.2002.1028844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37

Abstract

This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.
孤立阿拉伯字符的快速识别系统
本文提出了一种快速的多阶段非拉丁文字识别算法。虽然示例使用阿拉伯文字,但该系统可以在几分钟内适应处理任何字符集,特别是没有商用OCR系统的非拉丁字符。所使用的方法对孤立字符的大小进行规范化,并根据字符的行和列中的黑色像素数提取图像签名,并将这些值与该集合中典型字符的一组签名进行比较。该技术不仅确定了壁橱匹配,还给出了与集合中所有其他字符的匹配度,这用三角形混淆矩阵表示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信