Bappaditya Chakraborty, P. Mukherjee, U. Bhattacharya
{"title":"Bangla online handwriting recognition using recurrent neural network architecture","authors":"Bappaditya Chakraborty, P. Mukherjee, U. Bhattacharya","doi":"10.1145/3009977.3010072","DOIUrl":null,"url":null,"abstract":"Recognition of unconstrained handwritten texts is always a difficult problem, particularly if the style of handwriting is a mixed cursive one. Among various Indian scripts, only Bangla has this additional difficulty of tackling mixed cur-siveness of its handwriting style in the pipeline of a method towards its automatic recognition. A few other common recognition difficulties of handwriting in an Indian script include the large size of its alphabet and the extremely cursive nature of the shapes of its alphabetic characters. These are among the reasons of achieving only limited success in the study of unconstrained handwritten Bangla text recognition. Artificial Neural Network (ANN) models have often been used for solving difficult real-life pattern recognition problems. Recurrent Neural Network models (RNN) have been studied in the literature for modeling sequence data. In this study, we consider Long Short Term Memory (LSTM) network model, a useful member of this family. In fact, Bidirectional Long Short-Term Memory (BLSTM) neural networks is a special kind of RNN and have recently attracted special attention in solving sequence labelling problems. In this article, we present a BLSTM architecture based approach for unconstrained online handwritten Bangla text recognition.","PeriodicalId":93806,"journal":{"name":"Proceedings. Indian Conference on Computer Vision, Graphics & Image Processing","volume":"39 1","pages":"63:1-63:8"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Indian Conference on Computer Vision, Graphics & Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3009977.3010072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
Recognition of unconstrained handwritten texts is always a difficult problem, particularly if the style of handwriting is a mixed cursive one. Among various Indian scripts, only Bangla has this additional difficulty of tackling mixed cur-siveness of its handwriting style in the pipeline of a method towards its automatic recognition. A few other common recognition difficulties of handwriting in an Indian script include the large size of its alphabet and the extremely cursive nature of the shapes of its alphabetic characters. These are among the reasons of achieving only limited success in the study of unconstrained handwritten Bangla text recognition. Artificial Neural Network (ANN) models have often been used for solving difficult real-life pattern recognition problems. Recurrent Neural Network models (RNN) have been studied in the literature for modeling sequence data. In this study, we consider Long Short Term Memory (LSTM) network model, a useful member of this family. In fact, Bidirectional Long Short-Term Memory (BLSTM) neural networks is a special kind of RNN and have recently attracted special attention in solving sequence labelling problems. In this article, we present a BLSTM architecture based approach for unconstrained online handwritten Bangla text recognition.
识别不受约束的手写文本一直是一个难题,特别是如果手写风格是混合草书。在各种各样的印度文字中,只有孟加拉文在自动识别的过程中遇到了这种额外的困难,即处理其手写风格的混合潦草性。其他一些常见的识别印度文字的困难包括其字母的大尺寸和其字母字符形状的极端草书性质。这些都是在无约束手写体孟加拉文本识别研究中取得有限成功的原因之一。人工神经网络(ANN)模型经常被用于解决现实生活中的模式识别难题。文献中已经研究了递归神经网络模型(RNN)用于序列数据的建模。在本研究中,我们考虑长短期记忆(LSTM)网络模型,这是这个家族的一个有用的成员。事实上,双向长短期记忆(Bidirectional Long - short - Memory, BLSTM)神经网络是一种特殊的RNN,近年来在解决序列标记问题方面受到了特别的关注。在本文中,我们提出了一种基于BLSTM架构的无约束在线手写体孟加拉文本识别方法。