Bag-of-keypoints approach for Tamil handwritten character recognition using SVMs

A. Subashini, N. Kodikara
{"title":"Bag-of-keypoints approach for Tamil handwritten character recognition using SVMs","authors":"A. Subashini, N. Kodikara","doi":"10.1109/ICTER.2011.6075033","DOIUrl":null,"url":null,"abstract":"In this paper, the bag-of-keypoints approach for the off-line recognition of Tamil handwritten characters is investigated. Various pre-processing operations are performed on the digitised image to enhance the quality of an image. In the proposed method each pre-processed character image is represented by a set of local-invariant SIFT feature vectors. From a set of reference vectors, the key idea is to create a codebook for each character using K-means clustering algorithm. Then, the bag-of-keypoints are computed for the total number of character images. These features are used to train a linear support vector machine. A target character is predicted to exactly one of the twenty character classes. An average recognition rate of 81.62% on the character level has been achieved in experiments using six thousand training and two thousand testing images of twenty selected character classes. These results clearly demonstrate that the method produces good recognition accuracy on the handwritten Tamil character database and can be extended with more characters and more samples being recognised.","PeriodicalId":325730,"journal":{"name":"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTER.2011.6075033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

In this paper, the bag-of-keypoints approach for the off-line recognition of Tamil handwritten characters is investigated. Various pre-processing operations are performed on the digitised image to enhance the quality of an image. In the proposed method each pre-processed character image is represented by a set of local-invariant SIFT feature vectors. From a set of reference vectors, the key idea is to create a codebook for each character using K-means clustering algorithm. Then, the bag-of-keypoints are computed for the total number of character images. These features are used to train a linear support vector machine. A target character is predicted to exactly one of the twenty character classes. An average recognition rate of 81.62% on the character level has been achieved in experiments using six thousand training and two thousand testing images of twenty selected character classes. These results clearly demonstrate that the method produces good recognition accuracy on the handwritten Tamil character database and can be extended with more characters and more samples being recognised.
基于支持向量机的泰米尔语手写字符识别的关键点袋方法
本文研究了泰米尔语手写体字符离线识别的关键点袋方法。对数字化图像进行各种预处理操作,以提高图像的质量。在该方法中,每个预处理的字符图像由一组局部不变的SIFT特征向量表示。从一组参考向量中,关键思想是使用K-means聚类算法为每个字符创建一个码本。然后,计算字符图像总数的关键点袋。这些特征被用来训练线性支持向量机。目标角色被预测为20个角色类别中的一个。在选取20个字符类的6000张训练图像和2000张测试图像进行实验,在字符水平上的平均识别率达到81.62%。结果表明,该方法在泰米尔语手写体字符库上具有较好的识别精度,并且可以扩展到更多的字符和更多的样本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信