A Segmentation-driven Handwritten Uighur Word Recognition Algorithm Based on Feedback Structure

Yamei Xu, Jili Xue
{"title":"A Segmentation-driven Handwritten Uighur Word Recognition Algorithm Based on Feedback Structure","authors":"Yamei Xu, Jili Xue","doi":"10.1109/ICSESS47205.2019.9040846","DOIUrl":null,"url":null,"abstract":"Uighur script is cursive in both printed and handwritten forms. For offline handwritten Uighur word, this study proposes a new segmentation-driven recognition algorithm that combines feedback structure and grapheme analysis. Firstly, a handwritten Uighur word is over-segmented into a two-queue grapheme sequence using a MSAC (main segmentation and additional clustering) algorithm. Secondly, a feedback-based grapheme merging strategy is designed to provide the best segmented character sequence and obtain the word recognition result. Three feedback errors accordingly are defined, which are error of grapheme shape, error of character recognition and word matching error. A word recognition rate of 90.82% is obtained during experiments conducted with a database consisting of 11,500 samples.","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Uighur script is cursive in both printed and handwritten forms. For offline handwritten Uighur word, this study proposes a new segmentation-driven recognition algorithm that combines feedback structure and grapheme analysis. Firstly, a handwritten Uighur word is over-segmented into a two-queue grapheme sequence using a MSAC (main segmentation and additional clustering) algorithm. Secondly, a feedback-based grapheme merging strategy is designed to provide the best segmented character sequence and obtain the word recognition result. Three feedback errors accordingly are defined, which are error of grapheme shape, error of character recognition and word matching error. A word recognition rate of 90.82% is obtained during experiments conducted with a database consisting of 11,500 samples.
基于反馈结构的分词驱动手写维吾尔语词识别算法
维吾尔文印刷和手写都是草书。针对离线手写维吾尔语单词,本文提出了一种结合反馈结构和字素分析的分词驱动识别算法。首先,使用MSAC(主分词和附加聚类)算法将手写的维吾尔语单词过度分词为双队列字素序列。其次,设计了一种基于反馈的字素合并策略,以提供最佳的字符分割序列,获得词识别结果;据此定义了三种反馈错误,即字素形状错误、字符识别错误和词匹配错误。在包含11500个样本的数据库中进行实验,获得了90.82%的单词识别率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信