Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi
{"title":"基于双向lstm的草书手写文本识别:以乌尔都语手写为例","authors":"Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi","doi":"10.1109/Deep-ML.2019.00021","DOIUrl":null,"url":null,"abstract":"Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"490 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Cursive Handwritten Text Recognition using Bi-Directional LSTMs: A Case Study on Urdu Handwriting\",\"authors\":\"Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi\",\"doi\":\"10.1109/Deep-ML.2019.00021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.\",\"PeriodicalId\":228378,\"journal\":{\"name\":\"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)\",\"volume\":\"490 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/Deep-ML.2019.00021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Deep-ML.2019.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cursive Handwritten Text Recognition using Bi-Directional LSTMs: A Case Study on Urdu Handwriting
Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.