{"title":"An End-to-End Model for Printed Uyghur Text Recognition","authors":"Zhiwei You, Qiong Li, Chuang Liu","doi":"10.1109/ICARCE55724.2022.10046535","DOIUrl":null,"url":null,"abstract":"The recognition method based on the end-to-end model has a wide range of applications in Chinese and English text recognition, but there are few researches of Uygur text recognition. Using the end-to-end method can effectively avoid the wrong recognition problem caused by the mis-segmentation of Uyghur letters. Based on the Transformer model, we propose a model named EfficientNet-Transformer for printed Uyghur text recognition. By replacing the SE attention of EfficientNet with Triplet attention, the computing ability of the network for spatial and channel attention is improved. The encoder of the original Transformer model is replaced by an improved EfficientNet, which makes the model simpler and fewer parameters. The dataset is expanded by synthesizing words through Uyghur syllable rules, and comparative experiments are carried out on this dataset by using our model with other’s RNN-based model. Experiments show this model is superior to others models in Character error rate, recognition speed and space occupation.","PeriodicalId":416305,"journal":{"name":"2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARCE55724.2022.10046535","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The recognition method based on the end-to-end model has a wide range of applications in Chinese and English text recognition, but there are few researches of Uygur text recognition. Using the end-to-end method can effectively avoid the wrong recognition problem caused by the mis-segmentation of Uyghur letters. Based on the Transformer model, we propose a model named EfficientNet-Transformer for printed Uyghur text recognition. By replacing the SE attention of EfficientNet with Triplet attention, the computing ability of the network for spatial and channel attention is improved. The encoder of the original Transformer model is replaced by an improved EfficientNet, which makes the model simpler and fewer parameters. The dataset is expanded by synthesizing words through Uyghur syllable rules, and comparative experiments are carried out on this dataset by using our model with other’s RNN-based model. Experiments show this model is superior to others models in Character error rate, recognition speed and space occupation.