{"title":"Use of the Hough transform to separate merged text/graphics in forms","authors":"J. Gloger","doi":"10.1109/ICPR.1992.201770","DOIUrl":null,"url":null,"abstract":"Presents a new method for the separation of merged text/form-structure components in forms. The technique described uses a modified version of the Hough transform to detect the structure of the form. The closed contours of the connected components are approximated by piecewise linear line segments. The parameters of the Hesse normal form of each line segment serve as input for the Hough transform. Compared to the vectorized boundary of characters, the lines of the form structure consist of appreciable more line segments with the same orientation and distance. So, the problem of the form structure detection in the database of line segments can be reduced to the detection of local peaks in the Hough space. Subsequent processing steps reconstruct the remaining contour fragments to characters.<<ETX>>","PeriodicalId":34917,"journal":{"name":"模式识别与人工智能","volume":"1 1","pages":"268-271"},"PeriodicalIF":0.0000,"publicationDate":"1992-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"模式识别与人工智能","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ICPR.1992.201770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 16
Abstract
Presents a new method for the separation of merged text/form-structure components in forms. The technique described uses a modified version of the Hough transform to detect the structure of the form. The closed contours of the connected components are approximated by piecewise linear line segments. The parameters of the Hesse normal form of each line segment serve as input for the Hough transform. Compared to the vectorized boundary of characters, the lines of the form structure consist of appreciable more line segments with the same orientation and distance. So, the problem of the form structure detection in the database of line segments can be reduced to the detection of local peaks in the Hough space. Subsequent processing steps reconstruct the remaining contour fragments to characters.<>