F-ratio Based Weighted Feature Extraction for Similar Shape Character Recognition

T. Wakabayashi, U. Pal, F. Kimura, Y. Miyake
{"title":"F-ratio Based Weighted Feature Extraction for Similar Shape Character Recognition","authors":"T. Wakabayashi, U. Pal, F. Kimura, Y. Miyake","doi":"10.1109/ICDAR.2009.197","DOIUrl":null,"url":null,"abstract":"Recognition of handwritten similar shaped character is a difficult problem and in character recognition system most of the errors occur from similar shaped characters. In this paper we proposed a novel feature extraction technique to improve the recognition results of two similar shaped characters. The technique is based on F-ratio (Fisher Ratio), a statistical measure defined by the ratio to the between-class variance and within-class variance. F-ratio modifies the feature vector of two similar shape characters by weighting the feature elements. This weighting scheme enhances the feature elements that belongs to the distinguishable portions of the similar shaped characters and reduces the feature elements of the common portion of the characters, so that similar shaped characters can be identified easily. We considered pair of handwritten similar shape characters of different scripts like Arabic/Persian, Devnagari English, Bangla, Oriya, Tamil, Kannada, Telugu etc. and we noted that f-ratio based feature weighting shows better recognition results.","PeriodicalId":433762,"journal":{"name":"2009 10th International Conference on Document Analysis and Recognition","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"43","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 10th International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2009.197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 43

Abstract

Recognition of handwritten similar shaped character is a difficult problem and in character recognition system most of the errors occur from similar shaped characters. In this paper we proposed a novel feature extraction technique to improve the recognition results of two similar shaped characters. The technique is based on F-ratio (Fisher Ratio), a statistical measure defined by the ratio to the between-class variance and within-class variance. F-ratio modifies the feature vector of two similar shape characters by weighting the feature elements. This weighting scheme enhances the feature elements that belongs to the distinguishable portions of the similar shaped characters and reduces the feature elements of the common portion of the characters, so that similar shaped characters can be identified easily. We considered pair of handwritten similar shape characters of different scripts like Arabic/Persian, Devnagari English, Bangla, Oriya, Tamil, Kannada, Telugu etc. and we noted that f-ratio based feature weighting shows better recognition results.
基于F-ratio的相似形状字符加权特征提取
手写体仿形字符的识别是一个难题,在字符识别系统中,大部分错误都是由仿形字符产生的。本文提出了一种新的特征提取技术,以提高两个形状相似字符的识别效果。该技术基于F-ratio (Fisher Ratio),这是一种由类间方差和类内方差之比定义的统计度量。F-ratio通过加权特征元素来修改两个相似形状字符的特征向量。该加权方案增强了属于相似形字符可区分部分的特征元素,减少了属于相似形字符共同部分的特征元素,从而使相似形字符易于识别。我们考虑了阿拉伯语/波斯语、德文加里英语、孟加拉语、奥里亚语、泰米尔语、卡纳达语、泰卢固语等不同文字的手写相似形状的字符,我们注意到基于f比率的特征加权显示出更好的识别结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信