An improved scene text and document image binarization scheme

R. Ghoshal, A. Banerjee
{"title":"An improved scene text and document image binarization scheme","authors":"R. Ghoshal, A. Banerjee","doi":"10.1109/RAIT.2018.8389021","DOIUrl":null,"url":null,"abstract":"Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.","PeriodicalId":219972,"journal":{"name":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAIT.2018.8389021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.
一种改进的场景文本和文档图像二值化方案
文本部分的识别对智能运输系统、文档图像处理、机器人技术和基于内容的图像检索系统具有至关重要的影响。因此,对于OCR等基于文本的场景图像处理任务,需要一种准确的文本识别方法。场景文本图像二值化在文本识别算法中起着重要的作用,影响着OCR的性能。本文提出了一种基于边缘和灰度方差信息跟踪文本边界的自然场景文本图像二值化方法。此外,将破碎的边界连接起来构建完整的边界图。该方法基于边界边缘信息确定自适应阈值,实现图像的有效二值化。与其他二值化方法相比,该方法在自然场景图像对比度低、分辨率低、光照不均匀和有噪声的情况下更为有效。实验采用ICDAR 2003鲁棒阅读大赛、ICDAR 2011 Born Digital数据集、街景文本(SVT)数据集、DIBCO数据集和我们实验室制作的孟加拉语数据集进行。实验结果令人满意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信