基于KNN分类器的离线手写阿拉伯词识别新方法

J. AlKhateeb, F. Khelifi, Jianmin Jiang, S. Ipson
{"title":"基于KNN分类器的离线手写阿拉伯词识别新方法","authors":"J. AlKhateeb, F. Khelifi, Jianmin Jiang, S. Ipson","doi":"10.1109/ICSIPA.2009.5478620","DOIUrl":null,"url":null,"abstract":"Due to similarities between Arabic letters, and the various writing styles employed, recognition of Arabic handwritten text can be difficult. In this paper, an off-line Arabic handwritten word recognition system is proposed, in which technical details are presented in terms of three stages, i.e. preprocessing, feature extraction and classification. Firstly, words are segmented from input scripts and also normalized in size. Secondly, each segmented word is divided into overlapping blocks. Absolute mean values computed for each block of segmented words constitutes a feature vector. Finally, the resulting feature vectors are used to classify the words using the K nearest Neighbour classifier (KNN). The proposed system has been successfully tested on the IFN/ENIT database consisting of 32492 Arabic handwritten words which are written by more than 1000 different writers. Experimental results show a good recognition rate when compared with other methods.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"A new approach for off-line handwritten Arabic word recognition using KNN classifier\",\"authors\":\"J. AlKhateeb, F. Khelifi, Jianmin Jiang, S. Ipson\",\"doi\":\"10.1109/ICSIPA.2009.5478620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to similarities between Arabic letters, and the various writing styles employed, recognition of Arabic handwritten text can be difficult. In this paper, an off-line Arabic handwritten word recognition system is proposed, in which technical details are presented in terms of three stages, i.e. preprocessing, feature extraction and classification. Firstly, words are segmented from input scripts and also normalized in size. Secondly, each segmented word is divided into overlapping blocks. Absolute mean values computed for each block of segmented words constitutes a feature vector. Finally, the resulting feature vectors are used to classify the words using the K nearest Neighbour classifier (KNN). The proposed system has been successfully tested on the IFN/ENIT database consisting of 32492 Arabic handwritten words which are written by more than 1000 different writers. Experimental results show a good recognition rate when compared with other methods.\",\"PeriodicalId\":400165,\"journal\":{\"name\":\"2009 IEEE International Conference on Signal and Image Processing Applications\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE International Conference on Signal and Image Processing Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIPA.2009.5478620\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Signal and Image Processing Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2009.5478620","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

摘要

由于阿拉伯字母之间的相似性,以及所采用的各种写作风格,识别阿拉伯手写文本可能很困难。本文提出了一种离线阿拉伯文手写体词识别系统,从预处理、特征提取和分类三个阶段详细介绍了该系统的技术细节。首先,从输入脚本中分割单词,并在大小上进行规范化。其次,将每个分词分成重叠的块。对每个分词块计算的绝对平均值构成一个特征向量。最后,使用K近邻分类器(KNN)将得到的特征向量用于对单词进行分类。该系统已在IFN/ENIT数据库上成功测试,该数据库包含32492个阿拉伯语手写单词,这些单词由1000多个不同的作者撰写。实验结果表明,该方法具有较好的识别率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A new approach for off-line handwritten Arabic word recognition using KNN classifier
Due to similarities between Arabic letters, and the various writing styles employed, recognition of Arabic handwritten text can be difficult. In this paper, an off-line Arabic handwritten word recognition system is proposed, in which technical details are presented in terms of three stages, i.e. preprocessing, feature extraction and classification. Firstly, words are segmented from input scripts and also normalized in size. Secondly, each segmented word is divided into overlapping blocks. Absolute mean values computed for each block of segmented words constitutes a feature vector. Finally, the resulting feature vectors are used to classify the words using the K nearest Neighbour classifier (KNN). The proposed system has been successfully tested on the IFN/ENIT database consisting of 32492 Arabic handwritten words which are written by more than 1000 different writers. Experimental results show a good recognition rate when compared with other methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信