基于双向LSTM和CTC的在线手写蒙古语词识别端到端模型

Da Teng, Daoerji Fan, Fengshan Bai, Yuecai Pan
{"title":"基于双向LSTM和CTC的在线手写蒙古语词识别端到端模型","authors":"Da Teng, Daoerji Fan, Fengshan Bai, Yuecai Pan","doi":"10.1109/ICIST55546.2022.9926844","DOIUrl":null,"url":null,"abstract":"An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.","PeriodicalId":211213,"journal":{"name":"2022 12th International Conference on Information Science and Technology (ICIST)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-End Model Based on Bidirectional LSTM and CTC for Online Handwritten Mongolian Word Recognition\",\"authors\":\"Da Teng, Daoerji Fan, Fengshan Bai, Yuecai Pan\",\"doi\":\"10.1109/ICIST55546.2022.9926844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.\",\"PeriodicalId\":211213,\"journal\":{\"name\":\"2022 12th International Conference on Information Science and Technology (ICIST)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Conference on Information Science and Technology (ICIST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIST55546.2022.9926844\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST55546.2022.9926844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

提出了一种传统蒙文在线手写词识别的端到端模型。根据输入输出数据的特点,该模型由双向长短期记忆(LSTM)网络和连接时间分类(CTC)网络组成。双向LSTM网络是模型的核心,在LSTM网络中加入了CTC网络。本研究的关键步骤是通过CTC层将LSTM网络输出转换为标签序列上的条件概率分布。因此,对于每个给定的输入序列,模型通过选择最可能的标签来完成识别任务。此外,关于在线手写体蒙古语识别的研究并不多。因此,在本研究中,我们还将着重于识别错误标签,找出错误的类型,并分析错误的可能原因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
End-to-End Model Based on Bidirectional LSTM and CTC for Online Handwritten Mongolian Word Recognition
An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信