{"title":"利用二值化技术从退化文档图像中检索内容","authors":"S. Tamilselvan, S. Sowmya","doi":"10.1109/ICCPEIC.2014.6915401","DOIUrl":null,"url":null,"abstract":"Retrieving content from degraded document image is very big and difficult task due to high variation between document background and foreground text. To simplify the task, novel system first constructs correct contrast to the image. Correct contrast to be constructed, based on the local maximum minimum algorithm process's result. Then find text edge stroke detection which lies on high contrast pixel in dark region by canny edge detection method and Otsu's method. After that, calculate local threshold value for the document image, and comparing each pixel with threshold value level, give result as `0' or `1'. Hence it is called binarization technique. Finally, post processing is done for sharpening the text edges.","PeriodicalId":176197,"journal":{"name":"2014 International Conference on Computation of Power, Energy, Information and Communication (ICCPEIC)","volume":"135 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Content retrieval from degraded document images using binarization technique\",\"authors\":\"S. Tamilselvan, S. Sowmya\",\"doi\":\"10.1109/ICCPEIC.2014.6915401\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retrieving content from degraded document image is very big and difficult task due to high variation between document background and foreground text. To simplify the task, novel system first constructs correct contrast to the image. Correct contrast to be constructed, based on the local maximum minimum algorithm process's result. Then find text edge stroke detection which lies on high contrast pixel in dark region by canny edge detection method and Otsu's method. After that, calculate local threshold value for the document image, and comparing each pixel with threshold value level, give result as `0' or `1'. Hence it is called binarization technique. Finally, post processing is done for sharpening the text edges.\",\"PeriodicalId\":176197,\"journal\":{\"name\":\"2014 International Conference on Computation of Power, Energy, Information and Communication (ICCPEIC)\",\"volume\":\"135 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Computation of Power, Energy, Information and Communication (ICCPEIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCPEIC.2014.6915401\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Computation of Power, Energy, Information and Communication (ICCPEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCPEIC.2014.6915401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Content retrieval from degraded document images using binarization technique
Retrieving content from degraded document image is very big and difficult task due to high variation between document background and foreground text. To simplify the task, novel system first constructs correct contrast to the image. Correct contrast to be constructed, based on the local maximum minimum algorithm process's result. Then find text edge stroke detection which lies on high contrast pixel in dark region by canny edge detection method and Otsu's method. After that, calculate local threshold value for the document image, and comparing each pixel with threshold value level, give result as `0' or `1'. Hence it is called binarization technique. Finally, post processing is done for sharpening the text edges.