{"title":"From machine generated to handwritten character recognition; a deep learning approach","authors":"Kian Peymani, M. Soryani","doi":"10.1109/PRIA.2017.7983055","DOIUrl":null,"url":null,"abstract":"While the task of Optical Character Recognition is deemed to be a solved problem in many languages, it still requires certain improvements in some languages with more complex script structures such as Farsi. Furthermore, Deep Convolution Neural Networks have reached excellent results in various computer vision tasks, including character recognition. Although, these networks require a great amount of data to be properly learned and (in some cases) lack generalization. In order to address this issue, in this work, we propose a tailored dataset and a delicately designed model that can be trained on only machine-generated character images with various typefaces and not only achieve an excellent result on machine generated images, but also achieve a decent accuracy in detecting handwritten characters.","PeriodicalId":336066,"journal":{"name":"2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRIA.2017.7983055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
While the task of Optical Character Recognition is deemed to be a solved problem in many languages, it still requires certain improvements in some languages with more complex script structures such as Farsi. Furthermore, Deep Convolution Neural Networks have reached excellent results in various computer vision tasks, including character recognition. Although, these networks require a great amount of data to be properly learned and (in some cases) lack generalization. In order to address this issue, in this work, we propose a tailored dataset and a delicately designed model that can be trained on only machine-generated character images with various typefaces and not only achieve an excellent result on machine generated images, but also achieve a decent accuracy in detecting handwritten characters.