{"title":"Region based adaptive binarization for optical character recognition purposes","authors":"H. Michalak, K. Okarma","doi":"10.1109/IIPHDW.2018.8388391","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) applications usually require the use of uniformly illuminated images which can be obtained using the flatbed scanners. However, rapid development of mobile technologies causes the growing popularity of document images captured by built-in cameras being the integral parts of modern mobile phones and tablets. Many companies and administration offices accept not only the scanned documents but also high resolution photos which can be enough e.g. for insurance purposes. Unfortunately such images can be unevenly illuminated causing some problems for the OCR applications used for text recognition especially if the QR, Aztec or some other popular 2D codes are not present. Proper text recognition from camera images requires image preprocessing including its binarization which cannot be conducted using typical global thresholding due to the presence of local intensity changes. On the other hand the use of pixel-based adaptive methods is time-consuming and not always leads to satisfactory results. To fill this gap and balance the recognition accuracy and high processing speed a region based approach to image binarization is proposed in this paper being an extension of well-known Niblack thresholding algorithm.","PeriodicalId":405270,"journal":{"name":"2018 International Interdisciplinary PhD Workshop (IIPhDW)","volume":"164 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Interdisciplinary PhD Workshop (IIPhDW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIPHDW.2018.8388391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Optical Character Recognition (OCR) applications usually require the use of uniformly illuminated images which can be obtained using the flatbed scanners. However, rapid development of mobile technologies causes the growing popularity of document images captured by built-in cameras being the integral parts of modern mobile phones and tablets. Many companies and administration offices accept not only the scanned documents but also high resolution photos which can be enough e.g. for insurance purposes. Unfortunately such images can be unevenly illuminated causing some problems for the OCR applications used for text recognition especially if the QR, Aztec or some other popular 2D codes are not present. Proper text recognition from camera images requires image preprocessing including its binarization which cannot be conducted using typical global thresholding due to the presence of local intensity changes. On the other hand the use of pixel-based adaptive methods is time-consuming and not always leads to satisfactory results. To fill this gap and balance the recognition accuracy and high processing speed a region based approach to image binarization is proposed in this paper being an extension of well-known Niblack thresholding algorithm.