{"title":"Farsi/Arabic text extraction from video images by corner detection","authors":"Mohieddin Moradi, S. Mozaffari, A. Orouji","doi":"10.1109/IRANIANMVIP.2010.5941145","DOIUrl":null,"url":null,"abstract":"Video text information plays an important role in semantic-based video analysis, indexing and retrieval. In this paper, we proposed a novel Farsi text detection approach based on intrinsic characteristics of Farsi text lines, which is more robust to complex backgrounds and various font styles. First, by an edge detector operator, all the possible edges in vertical, horizontal, 45 and 135 degrees are extracted. Then, for extracting text strokes, some pre-processing such as dilation and erosion are done according to the font size. Afterward, by finding the edges cross points, corners map is extracted. To discard non-text corners and finding real font size, histogram analysis is done. After finding real font size, input image is rescaled and a new corner map is extracted. Finally, the detected candidate text areas undergo the empirical rules analysis to identify text areas and project profile analysis for verification and text lines extraction. Experimental results demonstrate that the proposed method is robust to font size, font colour, and background complexity.","PeriodicalId":350778,"journal":{"name":"2010 6th Iranian Conference on Machine Vision and Image Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 6th Iranian Conference on Machine Vision and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRANIANMVIP.2010.5941145","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 36
Abstract
Video text information plays an important role in semantic-based video analysis, indexing and retrieval. In this paper, we proposed a novel Farsi text detection approach based on intrinsic characteristics of Farsi text lines, which is more robust to complex backgrounds and various font styles. First, by an edge detector operator, all the possible edges in vertical, horizontal, 45 and 135 degrees are extracted. Then, for extracting text strokes, some pre-processing such as dilation and erosion are done according to the font size. Afterward, by finding the edges cross points, corners map is extracted. To discard non-text corners and finding real font size, histogram analysis is done. After finding real font size, input image is rescaled and a new corner map is extracted. Finally, the detected candidate text areas undergo the empirical rules analysis to identify text areas and project profile analysis for verification and text lines extraction. Experimental results demonstrate that the proposed method is robust to font size, font colour, and background complexity.