Local Contrast Based Thresholding for Document Binarization

Mohammad Kamrul Hasan, Md. Mujibur Rahman Majumder, Orvila Sarker, Abdul Matin
{"title":"Local Contrast Based Thresholding for Document Binarization","authors":"Mohammad Kamrul Hasan, Md. Mujibur Rahman Majumder, Orvila Sarker, Abdul Matin","doi":"10.1109/CEEICT.2018.8628120","DOIUrl":null,"url":null,"abstract":"Global thresholding and local thresholding are two basic ways for image binarization. Global thresholding, not suitable for complex documents, may produce noise along the page borders when the intensity of an image is non-uniform. For degraded, noisy and illuminated images local thresholding is preferred. This paper presents a local thresholding scheme when the image is a collection of text, pictures, and background. It calculates the local mean and local contrast to separate the background from the foreground. Local contrast, not suitable for integral images, can be achieved by the differing local maximum, and local minimum intensity. The proposed approach doesn’t calculate standard deviation like much other local thresholding method but locally adapted with local mean and local contrast. This proposed algorithm has been tested with different types of images including ancient documents, noisy background, and pictures. The test results are compared with a global and some traditional local thresholding techniques in terms of MSE and PSNR. The comparison shows that the proposed method results improved PSNR and better extraction in OCR operation.","PeriodicalId":417359,"journal":{"name":"2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEEICT.2018.8628120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Global thresholding and local thresholding are two basic ways for image binarization. Global thresholding, not suitable for complex documents, may produce noise along the page borders when the intensity of an image is non-uniform. For degraded, noisy and illuminated images local thresholding is preferred. This paper presents a local thresholding scheme when the image is a collection of text, pictures, and background. It calculates the local mean and local contrast to separate the background from the foreground. Local contrast, not suitable for integral images, can be achieved by the differing local maximum, and local minimum intensity. The proposed approach doesn’t calculate standard deviation like much other local thresholding method but locally adapted with local mean and local contrast. This proposed algorithm has been tested with different types of images including ancient documents, noisy background, and pictures. The test results are compared with a global and some traditional local thresholding techniques in terms of MSE and PSNR. The comparison shows that the proposed method results improved PSNR and better extraction in OCR operation.
基于局部对比度阈值的文档二值化
全局阈值法和局部阈值法是图像二值化的两种基本方法。全局阈值法不适用于复杂文档,当图像强度不均匀时,可能会在页面边缘产生噪声。对于退化的、有噪声的和光照的图像,首选局部阈值。本文提出了一种局部阈值分割算法,用于图像是文本、图片和背景的集合。它计算局部均值和局部对比度来分离背景和前景。局部对比度可以通过局部最大值和局部最小值强度的不同来实现,但不适用于积分图像。该方法不像许多其他局部阈值法那样计算标准差,而是局部适应于局部均值和局部对比度。该算法已经在不同类型的图像上进行了测试,包括古代文档、噪声背景和图片。在MSE和PSNR方面,将测试结果与全局阈值法和一些传统的局部阈值法进行了比较。对比表明,该方法在OCR操作中提高了PSNR,提取效果更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信