{"title":"A skeleton based binarization approach for video text recognition","authors":"Haojin Yang, B. Quehl, Harald Sack","doi":"10.1109/WIAMIS.2012.6226754","DOIUrl":null,"url":null,"abstract":"Text in video data comes in different resolutions and with heterogeneous background resulting in difficult contrast ratios that most times prohibit valid OCR (Optical Character Recognition) results. Therefore, the text has to be separated from its background before applying standard OCR process. This pre-processing task can be achieved by a suitable image binarization procedure. In this paper, we propose a novel binarization method for video text images with complex background. The proposed method is based on a seed-region growing strategy. First, the text gradient direction is approximated by analyzing the content distribution of image skeleton maps. Then, the text seed-pixels are selected by calculating the average grayscale value of skeleton pixels. And finally, an automated seed region growing algorithm is applied to obtain the text pixels. The accuracy of the proposed approach is shown by evaluation.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIAMIS.2012.6226754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Text in video data comes in different resolutions and with heterogeneous background resulting in difficult contrast ratios that most times prohibit valid OCR (Optical Character Recognition) results. Therefore, the text has to be separated from its background before applying standard OCR process. This pre-processing task can be achieved by a suitable image binarization procedure. In this paper, we propose a novel binarization method for video text images with complex background. The proposed method is based on a seed-region growing strategy. First, the text gradient direction is approximated by analyzing the content distribution of image skeleton maps. Then, the text seed-pixels are selected by calculating the average grayscale value of skeleton pixels. And finally, an automated seed region growing algorithm is applied to obtain the text pixels. The accuracy of the proposed approach is shown by evaluation.