Detection and segmentation of lines and words in Gurmukhi handwritten text

2010 IEEE 2nd International Advance Computing Conference (IACC) Pub Date : 2010-03-01 DOI:10.1109/IADCC.2010.5422927

Rajiv Kumar, Amardeep Singh

{"title":"Detection and segmentation of lines and words in Gurmukhi handwritten text","authors":"Rajiv Kumar, Amardeep Singh","doi":"10.1109/IADCC.2010.5422927","DOIUrl":null,"url":null,"abstract":"The scanned text image is a non editable image though it has the text but one can not edit it or make any change, if required, to that scanned document. This provides a basis for the optical character recognition (OCR) theory. OCR is the process of recognizing a segmented part of the scanned image as a character. The overall OCR process consists of three major sub processes like pre processing, segmentation and then recognition. Out of these three, the segmentation process is the back bone of the overall OCR process. We can say that the segmentation process is the most significant process because if the segmentation is incorrect then we can not have the correct results; it is just like garbage in and garbage out. But it is not an easy job, because segmentation is one of the complex processes. It is more difficult if the document is handwritten because in that case only few points are there which can be used to make segmentation. In this paper, we formulate an approach to segment the scanned document image. As per this approach, initially this considers the whole image as one large window. Then this large window is broken into less large windows giving lines, once the lines are identified then each window consisting of a line is used to find a word present in that line and finally to characters. For that purpose we used the concept of variable sized window, that is, the window whose size can be adjusted according to needs. This concept was implemented and results were analyzed. After the analysis the same concept was modified and finally tried on different documents and we got good reasonable results.","PeriodicalId":249763,"journal":{"name":"2010 IEEE 2nd International Advance Computing Conference (IACC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 2nd International Advance Computing Conference (IACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IADCC.2010.5422927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 34

Abstract

The scanned text image is a non editable image though it has the text but one can not edit it or make any change, if required, to that scanned document. This provides a basis for the optical character recognition (OCR) theory. OCR is the process of recognizing a segmented part of the scanned image as a character. The overall OCR process consists of three major sub processes like pre processing, segmentation and then recognition. Out of these three, the segmentation process is the back bone of the overall OCR process. We can say that the segmentation process is the most significant process because if the segmentation is incorrect then we can not have the correct results; it is just like garbage in and garbage out. But it is not an easy job, because segmentation is one of the complex processes. It is more difficult if the document is handwritten because in that case only few points are there which can be used to make segmentation. In this paper, we formulate an approach to segment the scanned document image. As per this approach, initially this considers the whole image as one large window. Then this large window is broken into less large windows giving lines, once the lines are identified then each window consisting of a line is used to find a word present in that line and finally to characters. For that purpose we used the concept of variable sized window, that is, the window whose size can be adjusted according to needs. This concept was implemented and results were analyzed. After the analysis the same concept was modified and finally tried on different documents and we got good reasonable results.

查看原文本刊更多论文

古穆克语手写文本的行字检测与分词

扫描的文本图像是一个不可编辑的图像，虽然它有文本，但一个人不能编辑它或做任何改变，如果需要，扫描的文件。这为光学字符识别(OCR)理论提供了基础。OCR是将扫描图像的分割部分识别为字符的过程。整个OCR过程包括预处理、分割和识别三个主要的子过程。在这三个过程中，分割过程是整个OCR过程的骨干。我们可以说，分割过程是最重要的过程，因为如果分割不正确，我们就无法得到正确的结果;这就像垃圾进垃圾出。但这不是一件容易的工作，因为分割是一个复杂的过程。如果文档是手写的，这就更困难了，因为在这种情况下，只有很少的点可以用来分割。本文提出了一种对扫描文档图像进行分割的方法。根据这种方法，最初它将整个图像视为一个大窗口。然后这个大窗口被分成几个较小的窗口，给出行，一旦行被识别出来，每个由行组成的窗口被用来查找该行中出现的单词，最后是字符。为此，我们使用了可变大小窗口的概念，即窗口的大小可以根据需要进行调整。对这一概念进行了实施，并对结果进行了分析。经过分析，对相同的概念进行了修改，最后在不同的文档上进行了尝试，得到了很好的合理结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE 2nd International Advance Computing Conference (IACC)

自引率

0.00%

发文量