{"title":"Image processing based degraded camera captured document enhancement for improved OCR accuracy","authors":"Pooja Sharma, Shanu Sharma","doi":"10.1109/CONFLUENCE.2016.7508160","DOIUrl":null,"url":null,"abstract":"Over the past decade the document analysis and processing related to camera based document images has gained the interest of research community. Nowadays, cameras are easily available in the smart phones that can be carried in the small space of our pockets while being lightweight, portable and relieving us from the burden of walking down to a scanner for a digital copy of a document. But even though capturing a document image through a phone camera appears simple, the chances of obtaining a perfect picture are scanty. As when the picture is captured in an unconstrained environment, there are chances of degradation to creep in that will hamper the visual quality of the document image which further effect the readability(in terms of OCR accuracy). Low quality documents give poor results. Document images contain various degradations such as blur, uneven illumination, perspective distortion, low resolution, smear etc. Quality enhancement is helpful to recognize a camera captured document more accurately and if not completely removing the degradations, it can be used for suppressing them and making the text more readable. This paper evaluates the performance of various deblurring techniques for noisy and blurred camera captured documents.","PeriodicalId":299044,"journal":{"name":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONFLUENCE.2016.7508160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Over the past decade the document analysis and processing related to camera based document images has gained the interest of research community. Nowadays, cameras are easily available in the smart phones that can be carried in the small space of our pockets while being lightweight, portable and relieving us from the burden of walking down to a scanner for a digital copy of a document. But even though capturing a document image through a phone camera appears simple, the chances of obtaining a perfect picture are scanty. As when the picture is captured in an unconstrained environment, there are chances of degradation to creep in that will hamper the visual quality of the document image which further effect the readability(in terms of OCR accuracy). Low quality documents give poor results. Document images contain various degradations such as blur, uneven illumination, perspective distortion, low resolution, smear etc. Quality enhancement is helpful to recognize a camera captured document more accurately and if not completely removing the degradations, it can be used for suppressing them and making the text more readable. This paper evaluates the performance of various deblurring techniques for noisy and blurred camera captured documents.