A New Approach for Synthesis and Recognition of Large Scale Handwritten Chinese Words

Gang Liu, Lianwen Jin, Kai Ding, Hanyu Yan
{"title":"A New Approach for Synthesis and Recognition of Large Scale Handwritten Chinese Words","authors":"Gang Liu, Lianwen Jin, Kai Ding, Hanyu Yan","doi":"10.1109/ICFHR.2010.94","DOIUrl":null,"url":null,"abstract":"Lacking of dataset is still a serious problem for researchers who study on online handwriting word recognition (HWR). In this paper, a handwritten Chinese word synthesis method is proposed for the first time to generate a large scale handwritten Chinese word dataset. The distributions of shape and position characteristics, such as aspect radio, character interval and the angle of gravity center line in each word sample of the Word8888 dataset have been estimated respectively. Based on this, we synthesize as large as 44,208 categories of 8,311,104 unconstrained handwritten Chinese word samples. To verify the validity of the synthesized dataset, a practical rotation free handwriting Chinese word recognition system is presented based on a new holistic approach. Experimental results for randomly rotated word samples demonstrate that the holistic approach can achieve 91.96% recognition accuracy, which provides evidence for the effectiveness of our method.","PeriodicalId":335044,"journal":{"name":"2010 12th International Conference on Frontiers in Handwriting Recognition","volume":"172 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 12th International Conference on Frontiers in Handwriting Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2010.94","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Lacking of dataset is still a serious problem for researchers who study on online handwriting word recognition (HWR). In this paper, a handwritten Chinese word synthesis method is proposed for the first time to generate a large scale handwritten Chinese word dataset. The distributions of shape and position characteristics, such as aspect radio, character interval and the angle of gravity center line in each word sample of the Word8888 dataset have been estimated respectively. Based on this, we synthesize as large as 44,208 categories of 8,311,104 unconstrained handwritten Chinese word samples. To verify the validity of the synthesized dataset, a practical rotation free handwriting Chinese word recognition system is presented based on a new holistic approach. Experimental results for randomly rotated word samples demonstrate that the holistic approach can achieve 91.96% recognition accuracy, which provides evidence for the effectiveness of our method.
一种大规模手写汉字合成与识别的新方法
数据集的缺乏一直是困扰在线手写词识别研究人员的一个严重问题。本文首次提出了一种手写体中文词合成方法,用于生成大规模手写体中文词数据集。分别估计了Word8888数据集每个词样本的形状和位置特征的分布,如纵横比、字符间隔和重心中心线角度。在此基础上,我们合成了8,311,104个无约束手写体中文单词样本中多达44,208个类别。为了验证合成数据集的有效性,提出了一种基于整体方法的实用的无旋转手写汉字识别系统。对于随机旋转的单词样本,实验结果表明,整体方法的识别准确率达到91.96%,证明了方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信