{"title":"A Segmentation-driven Handwritten Uighur Word Recognition Algorithm Based on Feedback Structure","authors":"Yamei Xu, Jili Xue","doi":"10.1109/ICSESS47205.2019.9040846","DOIUrl":null,"url":null,"abstract":"Uighur script is cursive in both printed and handwritten forms. For offline handwritten Uighur word, this study proposes a new segmentation-driven recognition algorithm that combines feedback structure and grapheme analysis. Firstly, a handwritten Uighur word is over-segmented into a two-queue grapheme sequence using a MSAC (main segmentation and additional clustering) algorithm. Secondly, a feedback-based grapheme merging strategy is designed to provide the best segmented character sequence and obtain the word recognition result. Three feedback errors accordingly are defined, which are error of grapheme shape, error of character recognition and word matching error. A word recognition rate of 90.82% is obtained during experiments conducted with a database consisting of 11,500 samples.","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Uighur script is cursive in both printed and handwritten forms. For offline handwritten Uighur word, this study proposes a new segmentation-driven recognition algorithm that combines feedback structure and grapheme analysis. Firstly, a handwritten Uighur word is over-segmented into a two-queue grapheme sequence using a MSAC (main segmentation and additional clustering) algorithm. Secondly, a feedback-based grapheme merging strategy is designed to provide the best segmented character sequence and obtain the word recognition result. Three feedback errors accordingly are defined, which are error of grapheme shape, error of character recognition and word matching error. A word recognition rate of 90.82% is obtained during experiments conducted with a database consisting of 11,500 samples.