{"title":"基于双向LSTM和CTC的在线手写蒙古语词识别端到端模型","authors":"Da Teng, Daoerji Fan, Fengshan Bai, Yuecai Pan","doi":"10.1109/ICIST55546.2022.9926844","DOIUrl":null,"url":null,"abstract":"An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.","PeriodicalId":211213,"journal":{"name":"2022 12th International Conference on Information Science and Technology (ICIST)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-End Model Based on Bidirectional LSTM and CTC for Online Handwritten Mongolian Word Recognition\",\"authors\":\"Da Teng, Daoerji Fan, Fengshan Bai, Yuecai Pan\",\"doi\":\"10.1109/ICIST55546.2022.9926844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.\",\"PeriodicalId\":211213,\"journal\":{\"name\":\"2022 12th International Conference on Information Science and Technology (ICIST)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Conference on Information Science and Technology (ICIST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIST55546.2022.9926844\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST55546.2022.9926844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
End-to-End Model Based on Bidirectional LSTM and CTC for Online Handwritten Mongolian Word Recognition
An end-to-end model for Traditional Mongolian online handwritten word recognition is proposed in this paper. According to the characteristics of input and output data, the proposed model consists of a bidirectional Long Short-Term Memory(LSTM) network and a Connectionist Temporal Classification(CTC) network. Bidirectional LSTM network is the core of the model, and the CTC network is added to LSTM network. The key step of this research is to switch from the LSTM network output to the conditional probability distribution on the label sequence through the CTC layer. Therefore, for each given input sequence, the model completes the recognition task by choosing the most possible label. In addition, There is not many researchs on online handwritten Mongolian recognition. Therefore, in this study, we will also focus on recognizing wrong labels, finding out the types of errors, and analyzing the possible causes of errors.