{"title":"Towards Physical Distortion Identification and Removal in Document Images","authors":"Tan Lu, A. Dooms","doi":"10.1109/EUVIP.2018.8611786","DOIUrl":null,"url":null,"abstract":"Physical distortions, next to digital artefacts, are commonly seen in document images. Their presence sabotages the optical character recognition (OCR) process which not only leads to a reduced amount of automatically retrievable content, but also deteriorates the performance of other document analysis algorithms that rely on layout analysis or content recognition. This paper proposes a method to identify and remove certain types of physical distortions from document images. By exploiting the intensity and spatial relation of distorted pixels, we construct a conditional random field (CRF) based method for distortion identification. Furthermore, a peak searching method is proposed so that the model parameters of the energy functions in the conditional probability are automatically learnt from the image. Discrimination of the pixels from original document content and those from physical noises is obtained by maximizing the conditional probability in the CRF model. Examples from real-life image samples demonstrate the effectiveness of the proposed method.","PeriodicalId":252212,"journal":{"name":"2018 7th European Workshop on Visual Information Processing (EUVIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th European Workshop on Visual Information Processing (EUVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUVIP.2018.8611786","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Physical distortions, next to digital artefacts, are commonly seen in document images. Their presence sabotages the optical character recognition (OCR) process which not only leads to a reduced amount of automatically retrievable content, but also deteriorates the performance of other document analysis algorithms that rely on layout analysis or content recognition. This paper proposes a method to identify and remove certain types of physical distortions from document images. By exploiting the intensity and spatial relation of distorted pixels, we construct a conditional random field (CRF) based method for distortion identification. Furthermore, a peak searching method is proposed so that the model parameters of the energy functions in the conditional probability are automatically learnt from the image. Discrimination of the pixels from original document content and those from physical noises is obtained by maximizing the conditional probability in the CRF model. Examples from real-life image samples demonstrate the effectiveness of the proposed method.