{"title":"基于支持向量机的泰米尔语手写字符识别的关键点袋方法","authors":"A. Subashini, N. Kodikara","doi":"10.1109/ICTER.2011.6075033","DOIUrl":null,"url":null,"abstract":"In this paper, the bag-of-keypoints approach for the off-line recognition of Tamil handwritten characters is investigated. Various pre-processing operations are performed on the digitised image to enhance the quality of an image. In the proposed method each pre-processed character image is represented by a set of local-invariant SIFT feature vectors. From a set of reference vectors, the key idea is to create a codebook for each character using K-means clustering algorithm. Then, the bag-of-keypoints are computed for the total number of character images. These features are used to train a linear support vector machine. A target character is predicted to exactly one of the twenty character classes. An average recognition rate of 81.62% on the character level has been achieved in experiments using six thousand training and two thousand testing images of twenty selected character classes. These results clearly demonstrate that the method produces good recognition accuracy on the handwritten Tamil character database and can be extended with more characters and more samples being recognised.","PeriodicalId":325730,"journal":{"name":"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Bag-of-keypoints approach for Tamil handwritten character recognition using SVMs\",\"authors\":\"A. Subashini, N. Kodikara\",\"doi\":\"10.1109/ICTER.2011.6075033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, the bag-of-keypoints approach for the off-line recognition of Tamil handwritten characters is investigated. Various pre-processing operations are performed on the digitised image to enhance the quality of an image. In the proposed method each pre-processed character image is represented by a set of local-invariant SIFT feature vectors. From a set of reference vectors, the key idea is to create a codebook for each character using K-means clustering algorithm. Then, the bag-of-keypoints are computed for the total number of character images. These features are used to train a linear support vector machine. A target character is predicted to exactly one of the twenty character classes. An average recognition rate of 81.62% on the character level has been achieved in experiments using six thousand training and two thousand testing images of twenty selected character classes. These results clearly demonstrate that the method produces good recognition accuracy on the handwritten Tamil character database and can be extended with more characters and more samples being recognised.\",\"PeriodicalId\":325730,\"journal\":{\"name\":\"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)\",\"volume\":\"153 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTER.2011.6075033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Advances in ICT for Emerging Regions (ICTer)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTER.2011.6075033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Bag-of-keypoints approach for Tamil handwritten character recognition using SVMs
In this paper, the bag-of-keypoints approach for the off-line recognition of Tamil handwritten characters is investigated. Various pre-processing operations are performed on the digitised image to enhance the quality of an image. In the proposed method each pre-processed character image is represented by a set of local-invariant SIFT feature vectors. From a set of reference vectors, the key idea is to create a codebook for each character using K-means clustering algorithm. Then, the bag-of-keypoints are computed for the total number of character images. These features are used to train a linear support vector machine. A target character is predicted to exactly one of the twenty character classes. An average recognition rate of 81.62% on the character level has been achieved in experiments using six thousand training and two thousand testing images of twenty selected character classes. These results clearly demonstrate that the method produces good recognition accuracy on the handwritten Tamil character database and can be extended with more characters and more samples being recognised.