A PDE system based on background estimation for document binarization with non-uniform background

IF 2.9 2区 数学 Q1 MATHEMATICS, APPLIED
Yu Wang, Chuanjiang He
{"title":"A PDE system based on background estimation for document binarization with non-uniform background","authors":"Yu Wang,&nbsp;Chuanjiang He","doi":"10.1016/j.camwa.2025.06.009","DOIUrl":null,"url":null,"abstract":"<div><div>Document binarization, despite extensive research, still remains a challenging task for images with non-uniform background (NUB). Background estimation (BE) is an effective preprocessing technique to cope with this challenge. Previous BE-based binarization estimates a background component from the input and uses that estimate to preprocess the input to yield the compensated image, followed by a commonly sophisticated processing step for binarization. In this paper, inspired by BE, we introduce a binarization model in the framework of partial differential equation (PDE) for NUB document images. Specifically, we first model the NUB degradation as an uneven scaling map that adjusts the contrast of a normal document to produce the NUB document image. In other words, a NUB document image can be represented as the pixel-wise product of the uneven scaling map and the normal image (i.e., compensated image). The problem of compensating for the background is therefore rendered into the restoration problem of recovering the normal document image from the input NUB document image. Based on the NUB model, we propose a PDE system for binarization of document images with NUB degradation, which consists of an evolution PDE (w.r.t the scaling component) with the input NUB image as the initial condition, and a nonlinear diffusion equation (w.r.t the compensated component) with a pure white image as the initial condition. This PDE system estimates the scaling component and the compensated image dynamically and alternately during evolution. In the final compensated image, the text and background pixels are divided into two primary modes with a separation of 0.5, thereby enabling us to achieve binarization of the input NUB document image just by a binary projection separated by 0.5. Finally, the PDE system is evaluated on twenty-nine NUB document images from publicly available DIBCO datasets, in comparison to seven related PDE binarization models. Experimental results show that the proposed PDE system outperforms the seven models for comparison in terms of binarization of NUB document images.</div></div>","PeriodicalId":55218,"journal":{"name":"Computers & Mathematics with Applications","volume":"194 ","pages":"Pages 158-176"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Mathematics with Applications","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0898122125002561","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

Document binarization, despite extensive research, still remains a challenging task for images with non-uniform background (NUB). Background estimation (BE) is an effective preprocessing technique to cope with this challenge. Previous BE-based binarization estimates a background component from the input and uses that estimate to preprocess the input to yield the compensated image, followed by a commonly sophisticated processing step for binarization. In this paper, inspired by BE, we introduce a binarization model in the framework of partial differential equation (PDE) for NUB document images. Specifically, we first model the NUB degradation as an uneven scaling map that adjusts the contrast of a normal document to produce the NUB document image. In other words, a NUB document image can be represented as the pixel-wise product of the uneven scaling map and the normal image (i.e., compensated image). The problem of compensating for the background is therefore rendered into the restoration problem of recovering the normal document image from the input NUB document image. Based on the NUB model, we propose a PDE system for binarization of document images with NUB degradation, which consists of an evolution PDE (w.r.t the scaling component) with the input NUB image as the initial condition, and a nonlinear diffusion equation (w.r.t the compensated component) with a pure white image as the initial condition. This PDE system estimates the scaling component and the compensated image dynamically and alternately during evolution. In the final compensated image, the text and background pixels are divided into two primary modes with a separation of 0.5, thereby enabling us to achieve binarization of the input NUB document image just by a binary projection separated by 0.5. Finally, the PDE system is evaluated on twenty-nine NUB document images from publicly available DIBCO datasets, in comparison to seven related PDE binarization models. Experimental results show that the proposed PDE system outperforms the seven models for comparison in terms of binarization of NUB document images.
基于背景估计的PDE系统用于非均匀背景下的文档二值化
文档二值化虽然已经得到了广泛的研究,但对于具有非均匀背景的图像,二值化仍然是一项具有挑战性的任务。背景估计(BE)是一种有效的预处理技术。先前基于be的二值化从输入中估计背景分量,并使用该估计对输入进行预处理以产生补偿图像,然后进行通常复杂的二值化处理步骤。在本文中,受BE的启发,我们在偏微分方程(PDE)框架下引入了一种用于NUB文档图像的二值化模型。具体来说,我们首先将NUB退化建模为一个不均匀缩放映射,该映射调整正常文档的对比度以产生NUB文档图像。换句话说,NUB文档图像可以表示为不均匀缩放图和正常图像(即补偿图像)的逐像素乘积。因此,补偿背景的问题就变成了从输入的NUB文档图像中恢复正常文档图像的恢复问题。基于NUB模型,提出了一种用于NUB退化文档图像二值化的PDE系统,该系统由以输入NUB图像为初始条件的演化PDE (w.r.t为缩放分量)和以纯白图像为初始条件的非线性扩散方程(w.r.t为补偿分量)组成。该系统在演化过程中动态交替估计缩放分量和被补偿图像。在最终的补偿图像中,文本和背景像素被划分为两个主模式,间隔为0.5,从而使我们能够仅通过间隔为0.5的二值投影来实现输入NUB文档图像的二值化。最后,对来自DIBCO公开数据集的29个NUB文档图像进行PDE系统评估,并与7个相关的PDE二值化模型进行比较。实验结果表明,本文提出的PDE系统在NUB文档图像的二值化方面优于7种比较模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Mathematics with Applications
Computers & Mathematics with Applications 工程技术-计算机:跨学科应用
CiteScore
5.10
自引率
10.30%
发文量
396
审稿时长
9.9 weeks
期刊介绍: Computers & Mathematics with Applications provides a medium of exchange for those engaged in fields contributing to building successful simulations for science and engineering using Partial Differential Equations (PDEs).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信