Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi
{"title":"Cursive Handwritten Text Recognition using Bi-Directional LSTMs: A Case Study on Urdu Handwriting","authors":"Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi","doi":"10.1109/Deep-ML.2019.00021","DOIUrl":null,"url":null,"abstract":"Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"490 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Deep-ML.2019.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
Abstract
Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.