使用levenshtein距离的结构化离线手写字符识别

M. Putra, I. Supriana
{"title":"使用levenshtein距离的结构化离线手写字符识别","authors":"M. Putra, I. Supriana","doi":"10.1109/ICEEI.2015.7352465","DOIUrl":null,"url":null,"abstract":"Research in offline handwriting recognition for unconstrained text remains a difficult challenge. Some problems such as noise in image, skew of text, cursive letters, and various handwriting style is still an open problem. Many method has been researched to solve those problems, such as k-NN, Neural Network, SVM and HMM. And to improve the recognition result, there are many methods can be implemented in the prerocessing stage. One of them is Kimura's method for slant correction that using chain code for character slant prediction. Those method consumes time and resource in its computation, meanwhile the accuracy is not improved significantly. Therefore, this paper propose to create handwritten character into graph with string representation based on structural approach. The purpose is to provide ability in improving recognition accuracy without relying on normalisation technique. The similarity distance between graphs measured using levenshtein distance. Experiment conducted to recognize handwritten upper-case letters and digits character images which taken from ETL-1 AIST databases. Levenshtein distance has an accuracy of 84.69% on digits and 67.01% on alphabet with 5% size of data for training and value 10 for string representation length. As a comparison, the Kimura's method are implemented for slant correction which results in a reduction of accuracy until 6%. Comparisons also made with some of previous work.","PeriodicalId":426454,"journal":{"name":"2015 International Conference on Electrical Engineering and Informatics (ICEEI)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Structural offline handwriting character recognition using levenshtein distance\",\"authors\":\"M. Putra, I. Supriana\",\"doi\":\"10.1109/ICEEI.2015.7352465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research in offline handwriting recognition for unconstrained text remains a difficult challenge. Some problems such as noise in image, skew of text, cursive letters, and various handwriting style is still an open problem. Many method has been researched to solve those problems, such as k-NN, Neural Network, SVM and HMM. And to improve the recognition result, there are many methods can be implemented in the prerocessing stage. One of them is Kimura's method for slant correction that using chain code for character slant prediction. Those method consumes time and resource in its computation, meanwhile the accuracy is not improved significantly. Therefore, this paper propose to create handwritten character into graph with string representation based on structural approach. The purpose is to provide ability in improving recognition accuracy without relying on normalisation technique. The similarity distance between graphs measured using levenshtein distance. Experiment conducted to recognize handwritten upper-case letters and digits character images which taken from ETL-1 AIST databases. Levenshtein distance has an accuracy of 84.69% on digits and 67.01% on alphabet with 5% size of data for training and value 10 for string representation length. As a comparison, the Kimura's method are implemented for slant correction which results in a reduction of accuracy until 6%. Comparisons also made with some of previous work.\",\"PeriodicalId\":426454,\"journal\":{\"name\":\"2015 International Conference on Electrical Engineering and Informatics (ICEEI)\",\"volume\":\"206 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Electrical Engineering and Informatics (ICEEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEEI.2015.7352465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Electrical Engineering and Informatics (ICEEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEI.2015.7352465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

无约束文本离线手写识别的研究仍然是一个困难的挑战。一些问题,如图像噪声、文本倾斜、草书字母和各种手写风格仍然是一个悬而未决的问题。人们研究了许多方法来解决这些问题,如k-NN、神经网络、支持向量机和HMM。为了提高识别效果,在预处理阶段可以采用多种方法。其中之一是木村的斜度校正方法,即使用链码进行字符斜度预测。该方法在计算过程中消耗大量时间和资源,且精度没有明显提高。因此,本文提出基于结构化的方法,将手写字符用字符串表示成图形。目的是提供在不依赖于归一化技术的情况下提高识别精度的能力。用levenshtein距离测量的图之间的相似距离。对取自ETL-1 AIST数据库的手写大写字母和数字字符图像进行了识别实验。Levenshtein距离在数字上的准确率为84.69%,在字母上的准确率为67.01%,训练数据大小为5%,字符串表示长度为10。作为比较,木村的方法是实现倾斜校正,导致精度降低到6%。还与之前的一些工作进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Structural offline handwriting character recognition using levenshtein distance
Research in offline handwriting recognition for unconstrained text remains a difficult challenge. Some problems such as noise in image, skew of text, cursive letters, and various handwriting style is still an open problem. Many method has been researched to solve those problems, such as k-NN, Neural Network, SVM and HMM. And to improve the recognition result, there are many methods can be implemented in the prerocessing stage. One of them is Kimura's method for slant correction that using chain code for character slant prediction. Those method consumes time and resource in its computation, meanwhile the accuracy is not improved significantly. Therefore, this paper propose to create handwritten character into graph with string representation based on structural approach. The purpose is to provide ability in improving recognition accuracy without relying on normalisation technique. The similarity distance between graphs measured using levenshtein distance. Experiment conducted to recognize handwritten upper-case letters and digits character images which taken from ETL-1 AIST databases. Levenshtein distance has an accuracy of 84.69% on digits and 67.01% on alphabet with 5% size of data for training and value 10 for string representation length. As a comparison, the Kimura's method are implemented for slant correction which results in a reduction of accuracy until 6%. Comparisons also made with some of previous work.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信