An Improved Lightweight Framework for Handwritten Chinese Text Recognition Based on CRNN

Lu Shen, Su-Kit Tang, S. Mirri
{"title":"An Improved Lightweight Framework for Handwritten Chinese Text Recognition Based on CRNN","authors":"Lu Shen, Su-Kit Tang, S. Mirri","doi":"10.1109/ISoIRS57349.2022.00010","DOIUrl":null,"url":null,"abstract":"Robots with computer vision and text recognition functions are widely used in industrial production, especially in highly automated factories. However, most robots have an excellent ability to recognize printed characters and show low accuracy in recognition of handwritten characters. Therefore, this paper considers recognizing handwritten text in the intelligent processing of handwritten documents. Its high accuracy prediction results are closely related to the effectiveness of manuscript input, intelligent translation, and intelligent scoring. Handwritten text is more difficult to recognize because it contains sequential information, and the images are more complex than single-character images. This paper proposes a new handwritten Chinese text recognition (HCTR) framework based on existing classical convolutional neural network (CNN) and recurrent neural network (RNN) algorithms. We use a handwritten Chinese text dataset from CASIA-HWDB containing numbers and symbols close to real application scenarios to train the model and compare the performance of various models, such as MobileNetV1 and MobileNetV2, with the proposed model. From the analysis of experimental results, it can be found that the proposed method can achieve higher performance with fewer parameters. In addition, we optimize the dropout rates of input blocks and obtain the best CER of our method is 6.11%.","PeriodicalId":405065,"journal":{"name":"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISoIRS57349.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Robots with computer vision and text recognition functions are widely used in industrial production, especially in highly automated factories. However, most robots have an excellent ability to recognize printed characters and show low accuracy in recognition of handwritten characters. Therefore, this paper considers recognizing handwritten text in the intelligent processing of handwritten documents. Its high accuracy prediction results are closely related to the effectiveness of manuscript input, intelligent translation, and intelligent scoring. Handwritten text is more difficult to recognize because it contains sequential information, and the images are more complex than single-character images. This paper proposes a new handwritten Chinese text recognition (HCTR) framework based on existing classical convolutional neural network (CNN) and recurrent neural network (RNN) algorithms. We use a handwritten Chinese text dataset from CASIA-HWDB containing numbers and symbols close to real application scenarios to train the model and compare the performance of various models, such as MobileNetV1 and MobileNetV2, with the proposed model. From the analysis of experimental results, it can be found that the proposed method can achieve higher performance with fewer parameters. In addition, we optimize the dropout rates of input blocks and obtain the best CER of our method is 6.11%.
基于CRNN的手写体中文文本识别改进轻量级框架
具有计算机视觉和文本识别功能的机器人广泛应用于工业生产,特别是高度自动化的工厂。然而,大多数机器人在识别印刷字符方面具有出色的能力,而在识别手写字符方面却表现出较低的准确性。因此,本文考虑在手写文档的智能处理中对手写文本进行识别。其准确度高的预测结果与稿件输入、智能翻译、智能评分的有效性密切相关。手写文本更难识别,因为它包含顺序信息,而且图像比单字符图像更复杂。本文提出了一种基于经典卷积神经网络(CNN)和递归神经网络(RNN)算法的手写中文文本识别(HCTR)框架。我们使用来自CASIA-HWDB的包含接近实际应用场景的数字和符号的手写体中文文本数据集来训练模型,并将各种模型(如MobileNetV1和MobileNetV2)与所提出的模型的性能进行比较。通过对实验结果的分析,可以发现该方法可以在较少的参数下获得较高的性能。此外,我们对输入块的丢包率进行了优化,得到了该方法的最佳识别率为6.11%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信