D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh
{"title":"历史手写文件的混合二值化方法","authors":"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh","doi":"10.1134/s0361768823090037","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>","PeriodicalId":54555,"journal":{"name":"Programming and Computer Software","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Binarization Method for Historical Handwritten Documents\",\"authors\":\"D. G. Asatryan, M. E. Haroutunian, G. S. Sazhumyan, A. V. Kupriyanov, R. A. Paringer, D. V. Kirsh\",\"doi\":\"10.1134/s0361768823090037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.</p>\",\"PeriodicalId\":54555,\"journal\":{\"name\":\"Programming and Computer Software\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Programming and Computer Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1134/s0361768823090037\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Programming and Computer Software","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1134/s0361768823090037","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
Hybrid Binarization Method for Historical Handwritten Documents
Abstract
Binarization of historical documents is a rather complex task that is being intensively studied by researchers all over the world. A large number of approaches, procedures, and binarization algorithms have been proposed, but methods that work equally well in all cases have not yet been proposed. The literature offers various criteria for assessing the quality of the binarization result. In the case of binarization of ancient handwritten texts, the criterion for the quality of the binarization algorithm is the degree of readability of the text using a visual method or technical means. One of the approaches proposed in the literature to improve the quality of the binarization result is pre-processing the original image using filtering methods, morphological analysis, spectral analysis, etc. This article proposes a hybrid binarization method, consisting of an arbitrary global or adaptive binarization algorithm and a special segmentation procedure for selecting segments of certain sizes. The proposed procedure makes it possible to identify objects of certain sizes in an image, in particular artifacts that exist in a binarized image. This work experimentally explores the possibility of improving the quality of a binary image by applying the proposed procedure.
期刊介绍:
Programming and Computer Software is a peer reviewed journal devoted to problems in all areas of computer science: operating systems, compiler technology, software engineering, artificial intelligence, etc.