基于两种互补算法组合的退化文档图像二值化

M. Valizadeh, M. Komeili, N. Armanfard, E. Kabir
{"title":"基于两种互补算法组合的退化文档图像二值化","authors":"M. Valizadeh, M. Komeili, N. Armanfard, E. Kabir","doi":"10.1109/ACTEA.2009.5227898","DOIUrl":null,"url":null,"abstract":"In this paper we combine two binarization algorithms that are complementary to each other. The main idea is to select the better algorithm in each part of document image. There are algorithms that properly distinguish the text from the background in the regions close to the text, but get wrong in the regions far from the text and introduce some part of background as text. We propose a new binarization algorithm that effectively eliminates background and reliably extracts some parts of each character. Then according to the distance of each pixel form the text, the appropriate algorithm is selected to binarize that pixel. Proposed method is applicable for various types of degraded document images. After extensive experiment, the proposed binarization algorithm demonstrate superior performance against four well-know binarization algorithms on a set of degraded document images captured with camera.","PeriodicalId":308909,"journal":{"name":"2009 International Conference on Advances in Computational Tools for Engineering Applications","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Degraded document image binarization based on combination of two complementary algorithms\",\"authors\":\"M. Valizadeh, M. Komeili, N. Armanfard, E. Kabir\",\"doi\":\"10.1109/ACTEA.2009.5227898\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we combine two binarization algorithms that are complementary to each other. The main idea is to select the better algorithm in each part of document image. There are algorithms that properly distinguish the text from the background in the regions close to the text, but get wrong in the regions far from the text and introduce some part of background as text. We propose a new binarization algorithm that effectively eliminates background and reliably extracts some parts of each character. Then according to the distance of each pixel form the text, the appropriate algorithm is selected to binarize that pixel. Proposed method is applicable for various types of degraded document images. After extensive experiment, the proposed binarization algorithm demonstrate superior performance against four well-know binarization algorithms on a set of degraded document images captured with camera.\",\"PeriodicalId\":308909,\"journal\":{\"name\":\"2009 International Conference on Advances in Computational Tools for Engineering Applications\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Advances in Computational Tools for Engineering Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACTEA.2009.5227898\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Advances in Computational Tools for Engineering Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACTEA.2009.5227898","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16

摘要

本文结合了两种互补的二值化算法。其主要思想是在文档图像的各个部分选择较好的算法。有些算法在接近文本的区域可以正确地区分文本和背景,但在远离文本的区域会出错,并将背景的一部分作为文本引入。我们提出了一种新的二值化算法,该算法可以有效地去除背景,并可靠地提取每个字符的某些部分。然后根据每个像素与文本的距离,选择合适的算法对像素进行二值化。该方法适用于各种类型的退化文档图像。经过大量的实验,本文提出的二值化算法在一组用相机捕获的退化文档图像上,比四种常用的二值化算法表现出了更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Degraded document image binarization based on combination of two complementary algorithms
In this paper we combine two binarization algorithms that are complementary to each other. The main idea is to select the better algorithm in each part of document image. There are algorithms that properly distinguish the text from the background in the regions close to the text, but get wrong in the regions far from the text and introduce some part of background as text. We propose a new binarization algorithm that effectively eliminates background and reliably extracts some parts of each character. Then according to the distance of each pixel form the text, the appropriate algorithm is selected to binarize that pixel. Proposed method is applicable for various types of degraded document images. After extensive experiment, the proposed binarization algorithm demonstrate superior performance against four well-know binarization algorithms on a set of degraded document images captured with camera.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信