Review of Different Binarization Approaches on Degraded Document Images

W. Mustafa, H. Aziz, W. Khairunizam, Zunaidi Ibrahim, A. Shahriman, Z. Razlan
{"title":"Review of Different Binarization Approaches on Degraded Document Images","authors":"W. Mustafa, H. Aziz, W. Khairunizam, Zunaidi Ibrahim, A. Shahriman, Z. Razlan","doi":"10.1109/ICASSDA.2018.8477621","DOIUrl":null,"url":null,"abstract":"Binarization is used to read text documents automatically by using optical character recognition. It is a very important step to segment foreground text form background images. Binarization processes become a challenging task when it comes to old document images which usually suffer from degradations. The different types of document degradation such as uneven illumination, image contrast variation and bleeding-through, binarization surely become an enormous challenge for all researchers. Binary image representation is the essential format for document analysis. This paper presents comparisons of several image binarization techniques in order to find the best approach for the binarizing document image. Several binarization techniques such as Bernsen, Multiple Thresholding, Deghost, Fuzzy C-Means and Triangle methods have been selected for comparison and applied on H-DIBCO 2013 dataset. According to the image quality assessment (IQA) results, it is obvious to state that the Fuzzy C-Means method is successful and effective compared to other methods. Hence, the implications of this image analysis would give researchers a direction for future research.","PeriodicalId":185167,"journal":{"name":"2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA)","volume":"207 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSDA.2018.8477621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Binarization is used to read text documents automatically by using optical character recognition. It is a very important step to segment foreground text form background images. Binarization processes become a challenging task when it comes to old document images which usually suffer from degradations. The different types of document degradation such as uneven illumination, image contrast variation and bleeding-through, binarization surely become an enormous challenge for all researchers. Binary image representation is the essential format for document analysis. This paper presents comparisons of several image binarization techniques in order to find the best approach for the binarizing document image. Several binarization techniques such as Bernsen, Multiple Thresholding, Deghost, Fuzzy C-Means and Triangle methods have been selected for comparison and applied on H-DIBCO 2013 dataset. According to the image quality assessment (IQA) results, it is obvious to state that the Fuzzy C-Means method is successful and effective compared to other methods. Hence, the implications of this image analysis would give researchers a direction for future research.
退化文档图像二值化方法综述
利用光学字符识别,采用二值化技术自动读取文本文档。从背景图像中分割前景文本是非常重要的一步。当涉及到经常遭受退化的旧文档图像时,二值化过程成为一项具有挑战性的任务。不同类型的文档退化,如光照不均匀,图像对比度变化和透血,二值化无疑成为所有研究人员面临的巨大挑战。二值图像表示是文档分析的基本格式。本文对几种图像二值化技术进行了比较,以期找到对文档图像进行二值化的最佳方法。选择Bernsen、Multiple Thresholding、Deghost、Fuzzy C-Means和Triangle等二值化方法对H-DIBCO 2013数据集进行比较和应用。从图像质量评价(IQA)结果可以明显看出,模糊c -均值方法与其他方法相比是成功有效的。因此,这一图像分析的意义将为研究人员未来的研究提供一个方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信