基于特征提取的离线中文手写字符识别

Yucheng Luo, Rui Xia, M. Abdulghafour
{"title":"基于特征提取的离线中文手写字符识别","authors":"Yucheng Luo, Rui Xia, M. Abdulghafour","doi":"10.1109/CGIV.2016.83","DOIUrl":null,"url":null,"abstract":"In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.","PeriodicalId":351561,"journal":{"name":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Offline Chinese Handwriting Character Recognition through Feature Extraction\",\"authors\":\"Yucheng Luo, Rui Xia, M. Abdulghafour\",\"doi\":\"10.1109/CGIV.2016.83\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.\",\"PeriodicalId\":351561,\"journal\":{\"name\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CGIV.2016.83\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIV.2016.83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

本文提出了一种新的汉字识别方法。这些方法都是基于特征提取,并与HCL2000数据库进行对比匹配[1]。对单个汉字图像进行了二值化、平滑、降噪和细化等处理。然后将图像交给结构特征提取算法,该算法将该特征转化为所有节点坐标唯一的无向图。将得到的图与数据库中的3755个样本逐一进行比较,提取其特征并存储在图中。通过比较从HCL 2000数据库图像中生成的字符无向图与对应图的边缘,得到两个字符之间的总偏差。通过测量线之间的长度、方向和面积,选择最佳匹配作为识别结果。为了保证准确性,还包括了其他原则。在匹配过程中,有时可能会对图进行轻微的转换或修改,以使适应度标准最大化。通过使用1000个随机字符来验证该方法的有效性。识别系统的准确性是非常重要的。给出了分析和实验结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Offline Chinese Handwriting Character Recognition through Feature Extraction
In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信