Pinaki Ranjan Sarkar, Deepak Mishra, Gorthi R. K. S. S. Manyam
{"title":"Improving Isolated Bangla Compound Character Recognition Through Feature-map Alignment","authors":"Pinaki Ranjan Sarkar, Deepak Mishra, Gorthi R. K. S. S. Manyam","doi":"10.1109/ICAPR.2017.8593008","DOIUrl":null,"url":null,"abstract":"Due to high variability in writing style of different individuals, non-centered and non-uniformly scaled optical characters are very difficult to recognize. Several techniques are proposed in-order to solve the recognition problem. In this work, we highlight that the performance of optical character classifiers which are based on the deep learning framework can be improved through feature-map alignment. Here, we have used spatial transformer network to align the feature maps of a convolutional neural network model which is proposed for the classification problem. We demonstrate that with the proposed framework not only the slight transformed versions which are usually considered in the conventional datasets can be classified with high accuracy, but also highly non-uniform in scale characters can also be fairly recognized with quite higher accuracy. We evaluate our proposed model on CMATERdb 3.1.3 database which consists of isolated Bangla handwritten compound characters and our model obtained 97.86 % recognition accuracy in the original database and 96.34 % on various rotated data in training and testing.","PeriodicalId":239965,"journal":{"name":"2017 Ninth International Conference on Advances in Pattern Recognition (ICAPR)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Ninth International Conference on Advances in Pattern Recognition (ICAPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAPR.2017.8593008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Due to high variability in writing style of different individuals, non-centered and non-uniformly scaled optical characters are very difficult to recognize. Several techniques are proposed in-order to solve the recognition problem. In this work, we highlight that the performance of optical character classifiers which are based on the deep learning framework can be improved through feature-map alignment. Here, we have used spatial transformer network to align the feature maps of a convolutional neural network model which is proposed for the classification problem. We demonstrate that with the proposed framework not only the slight transformed versions which are usually considered in the conventional datasets can be classified with high accuracy, but also highly non-uniform in scale characters can also be fairly recognized with quite higher accuracy. We evaluate our proposed model on CMATERdb 3.1.3 database which consists of isolated Bangla handwritten compound characters and our model obtained 97.86 % recognition accuracy in the original database and 96.34 % on various rotated data in training and testing.