历史手写文件的混合二值化方法

IF 0.7 4区 计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING
D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh
{"title":"历史手写文件的混合二值化方法","authors":"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh","doi":"10.1134/s0361768823090037","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>","PeriodicalId":54555,"journal":{"name":"Programming and Computer Software","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Binarization Method for Historical Handwritten Documents\",\"authors\":\"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh\",\"doi\":\"10.1134/s0361768823090037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>\",\"PeriodicalId\":54555,\"journal\":{\"name\":\"Programming and Computer Software\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Programming and Computer Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1134/s0361768823090037\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Programming and Computer Software","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1134/s0361768823090037","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

摘要 历史文献的二值化是一项相当复杂的任务,全世界的研究人员都在对其进行深入研究。人们提出了大量的方法、程序和二值化算法,但尚未提出在所有情况下都同样有效的方法。文献提供了各种评估二值化结果质量的标准。就古代手写文本的二值化而言,二值化算法质量的标准是使用视觉方法或技术手段对文本的可读程度进行评估。文献中提出的提高二值化结果质量的方法之一是使用滤波方法、形态分析、光谱分析等对原始图像进行预处理。本文提出了一种混合二值化方法,由任意全局或自适应二值化算法和用于选择特定大小片段的特殊分割程序组成。所提出的程序可以识别图像中特定大小的物体,特别是二值化图像中存在的伪影。这项工作通过实验探索了应用所建议的程序提高二值化图像质量的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Hybrid Binarization Method for Historical Handwritten Documents

Hybrid Binarization Method for Historical Handwritten Documents

Abstract

Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Programming and Computer Software
Programming and Computer Software 工程技术-计算机:软件工程
CiteScore
1.60
自引率
28.60%
发文量
35
审稿时长
>12 weeks
期刊介绍: Programming and Computer Software is a peer reviewed journal devoted to problems in all areas of computer science: operating systems, compiler technology, software engineering, artificial intelligence, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信