基于特征提取的离线中文手写字符识别

2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV) Pub Date : 2016-03-01 DOI:10.1109/CGIV.2016.83

Yucheng Luo, Rui Xia, M. Abdulghafour

{"title":"基于特征提取的离线中文手写字符识别","authors":"Yucheng Luo, Rui Xia, M. Abdulghafour","doi":"10.1109/CGIV.2016.83","DOIUrl":null,"url":null,"abstract":"In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.","PeriodicalId":351561,"journal":{"name":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Offline Chinese Handwriting Character Recognition through Feature Extraction\",\"authors\":\"Yucheng Luo, Rui Xia, M. Abdulghafour\",\"doi\":\"10.1109/CGIV.2016.83\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.\",\"PeriodicalId\":351561,\"journal\":{\"name\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CGIV.2016.83\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIV.2016.83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

本文提出了一种新的汉字识别方法。这些方法都是基于特征提取，并与HCL2000数据库进行对比匹配[1]。对单个汉字图像进行了二值化、平滑、降噪和细化等处理。然后将图像交给结构特征提取算法，该算法将该特征转化为所有节点坐标唯一的无向图。将得到的图与数据库中的3755个样本逐一进行比较，提取其特征并存储在图中。通过比较从HCL 2000数据库图像中生成的字符无向图与对应图的边缘，得到两个字符之间的总偏差。通过测量线之间的长度、方向和面积，选择最佳匹配作为识别结果。为了保证准确性，还包括了其他原则。在匹配过程中，有时可能会对图进行轻微的转换或修改，以使适应度标准最大化。通过使用1000个随机字符来验证该方法的有效性。识别系统的准确性是非常重要的。给出了分析和实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Offline Chinese Handwriting Character Recognition through Feature Extraction

In this paper, new methods were developed to successfully identify Chinese handwriting characters. These methods are based on features extraction as compared and matched with HCL2000 database [1]. Several algorithms were applied for binarization, smoothing, noise reduction and thinning to an image of a single Chinese character. Then the image is given to a structural feature extracting algorithm, which transforms that character into an undirected graph with unique coordinates of all nodes. The resulting graph was compared with 3755 samples from the database one by one, whose features are also extracted and stored in a graph. The total deviation between two characters was obtained by comparing edges from the generated undirected graph representing a character and its counterpart graph which is generated from the image in HCL 2000 database. Based on the measurements of lengths, orientation, and areas between lines, the best match was selected as the result of recognition. Additional principles are also included in order to assure the accuracy. During the matching, the graph may sometimes be slightly transformed or modified to maximize the fitness criteria. Experimental results are accomplished by the use of 1000 random characters to test the effectiveness. The accuracy of the recognition system is significant. Analysis and experimental results are presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)

自引率

0.00%

发文量