{"title":"一种改进的场景文本和文档图像二值化方案","authors":"R. Ghoshal, A. Banerjee","doi":"10.1109/RAIT.2018.8389021","DOIUrl":null,"url":null,"abstract":"Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.","PeriodicalId":219972,"journal":{"name":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"An improved scene text and document image binarization scheme\",\"authors\":\"R. Ghoshal, A. Banerjee\",\"doi\":\"10.1109/RAIT.2018.8389021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.\",\"PeriodicalId\":219972,\"journal\":{\"name\":\"2018 4th International Conference on Recent Advances in Information Technology (RAIT)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 4th International Conference on Recent Advances in Information Technology (RAIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAIT.2018.8389021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAIT.2018.8389021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
摘要
文本部分的识别对智能运输系统、文档图像处理、机器人技术和基于内容的图像检索系统具有至关重要的影响。因此,对于OCR等基于文本的场景图像处理任务,需要一种准确的文本识别方法。场景文本图像二值化在文本识别算法中起着重要的作用,影响着OCR的性能。本文提出了一种基于边缘和灰度方差信息跟踪文本边界的自然场景文本图像二值化方法。此外,将破碎的边界连接起来构建完整的边界图。该方法基于边界边缘信息确定自适应阈值,实现图像的有效二值化。与其他二值化方法相比,该方法在自然场景图像对比度低、分辨率低、光照不均匀和有噪声的情况下更为有效。实验采用ICDAR 2003鲁棒阅读大赛、ICDAR 2011 Born Digital数据集、街景文本(SVT)数据集、DIBCO数据集和我们实验室制作的孟加拉语数据集进行。实验结果令人满意。
An improved scene text and document image binarization scheme
Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.