{"title":"A methodology of separating images from text using an OCR approach","authors":"N. Bourbakis","doi":"10.1109/IJSIS.1996.565084","DOIUrl":null,"url":null,"abstract":"This paper presents a document processing methodology based on an OCR approach. The document methodology separates text from images by keeping their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing a single character, after its recognition, continues with the framing of a word, and ends with the framing of all text lines.","PeriodicalId":437491,"journal":{"name":"Proceedings IEEE International Joint Symposia on Intelligence and Systems","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE International Joint Symposia on Intelligence and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJSIS.1996.565084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This paper presents a document processing methodology based on an OCR approach. The document methodology separates text from images by keeping their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing a single character, after its recognition, continues with the framing of a word, and ends with the framing of all text lines.