{"title":"Character Recognition via a Compact Convolutional Neural Network","authors":"Haifeng Zhao, Yong Hu, Jinxia Zhang","doi":"10.1109/DICTA.2017.8227414","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) in the scanned documents has been a well-studied problem in the past. However, when these characters come from the natural scenes, it becomes a much more challenging problem, as there exist many difficulties in these images, e.g., illumination variance, cluttered backgrounds, geometry distortion. In this paper, we propose to use a deep learning method that based on the convolutional neural networks to recognize this kind of characters and word in the scene images. Based on the original VGG-Net, we focus on how to make a compact architecture on this net, and get both the character and word recognition results under the same framework. We conducted several experiments on the benchmark datasets of the natural scene images. The experiments has shown that our method can achieve the state-of-art performance and at the same time has a more compact representation.","PeriodicalId":194175,"journal":{"name":"2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2017.8227414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
Optical Character Recognition (OCR) in the scanned documents has been a well-studied problem in the past. However, when these characters come from the natural scenes, it becomes a much more challenging problem, as there exist many difficulties in these images, e.g., illumination variance, cluttered backgrounds, geometry distortion. In this paper, we propose to use a deep learning method that based on the convolutional neural networks to recognize this kind of characters and word in the scene images. Based on the original VGG-Net, we focus on how to make a compact architecture on this net, and get both the character and word recognition results under the same framework. We conducted several experiments on the benchmark datasets of the natural scene images. The experiments has shown that our method can achieve the state-of-art performance and at the same time has a more compact representation.