Yu Zhou, Shuang Liu, Yongzheng Zhang, Yipeng Wang, Weiyao Lin
{"title":"Text localization in natural scene images with stroke width histogram and superpixel","authors":"Yu Zhou, Shuang Liu, Yongzheng Zhang, Yipeng Wang, Weiyao Lin","doi":"10.1109/APSIPA.2014.7041656","DOIUrl":null,"url":null,"abstract":"A novel stroke-based method to localize text in unconstrained natural scene images is proposed. Firstly, in order to improve the edge detection in tough situations where the texts are partially occluded or noisy, we use stroke width histogram as guidance to generate a series of superpixels. Secondly, we present a novel way of using distance transform and sobel operator to extract character skeleton and then use the skeleton to improve stroke-width accuracy. Our method was evaluated on two standard datasets: ICDAR 2005 and ICDAR 2011, and the experimental results show that it achieves state-of-the-art performance.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2014.7041656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
A novel stroke-based method to localize text in unconstrained natural scene images is proposed. Firstly, in order to improve the edge detection in tough situations where the texts are partially occluded or noisy, we use stroke width histogram as guidance to generate a series of superpixels. Secondly, we present a novel way of using distance transform and sobel operator to extract character skeleton and then use the skeleton to improve stroke-width accuracy. Our method was evaluated on two standard datasets: ICDAR 2005 and ICDAR 2011, and the experimental results show that it achieves state-of-the-art performance.