历史手写文件的混合二值化方法

IF 0.5 4区计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Programming and Computer Software Pub Date : 2024-01-26 DOI:10.1134/s0361768823090037

D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh

{"title":"历史手写文件的混合二值化方法","authors":"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh","doi":"10.1134/s0361768823090037","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>","PeriodicalId":54555,"journal":{"name":"Programming and Computer Software","volume":"26 1","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Binarization Method for Historical Handwritten Documents\",\"authors\":\"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh\",\"doi\":\"10.1134/s0361768823090037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>\",\"PeriodicalId\":54555,\"journal\":{\"name\":\"Programming and Computer Software\",\"volume\":\"26 1\",\"pages\":\"\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2024-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Programming and Computer Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1134/s0361768823090037\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Programming and Computer Software","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1134/s0361768823090037","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

摘要历史文献的二值化是一项相当复杂的任务，全世界的研究人员都在对其进行深入研究。人们提出了大量的方法、程序和二值化算法，但尚未提出在所有情况下都同样有效的方法。文献提供了各种评估二值化结果质量的标准。就古代手写文本的二值化而言，二值化算法质量的标准是使用视觉方法或技术手段对文本的可读程度进行评估。文献中提出的提高二值化结果质量的方法之一是使用滤波方法、形态分析、光谱分析等对原始图像进行预处理。本文提出了一种混合二值化方法，由任意全局或自适应二值化算法和用于选择特定大小片段的特殊分割程序组成。所提出的程序可以识别图像中特定大小的物体，特别是二值化图像中存在的伪影。这项工作通过实验探索了应用所建议的程序提高二值化图像质量的可能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Hybrid Binarization Method for Historical Handwritten Documents

查看原文本刊更多论文

Hybrid Binarization Method for Historical Handwritten Documents

Abstract

Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Programming and Computer Software 工程技术-计算机：软件工程

CiteScore

1.60

自引率

28.60%

发文量

审稿时长

>12 weeks

期刊介绍： Programming and Computer Software is a peer reviewed journal devoted to problems in all areas of computer science: operating systems, compiler technology, software engineering, artificial intelligence, etc.