{"title":"A simple text detection in document images using classification-based techniques","authors":"Khanabhorn Kawattikul, P. Chomphuwiset","doi":"10.1109/ISCMI.2017.8279610","DOIUrl":null,"url":null,"abstract":"Text regions can be useful to computer vision applications. It can be used to label and train automatic layout learning systems or to detect and locate the title, keywords, subheadings, paragraphs and image regions in images. This work proposes a technique to separate text regions from image documents. Images are divided into small non-overlapping windows. Textural features are extracted from these image windows before a classification is performed. Two refinement processes are carried out to reject misclassified windows, i.e window merging and Markov Random Files (MRFs). Window merging determine the similarity of a window and its neighbouring windows (based-on a distance-based technique). MRF examines the relationships between each window and it's neighbouring one using an energy minimization technique. The experimental results demonstrate that the refinement method is superior to the original classification without a refinement.","PeriodicalId":119111,"journal":{"name":"2017 IEEE 4th International Conference on Soft Computing & Machine Intelligence (ISCMI)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 4th International Conference on Soft Computing & Machine Intelligence (ISCMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCMI.2017.8279610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Text regions can be useful to computer vision applications. It can be used to label and train automatic layout learning systems or to detect and locate the title, keywords, subheadings, paragraphs and image regions in images. This work proposes a technique to separate text regions from image documents. Images are divided into small non-overlapping windows. Textural features are extracted from these image windows before a classification is performed. Two refinement processes are carried out to reject misclassified windows, i.e window merging and Markov Random Files (MRFs). Window merging determine the similarity of a window and its neighbouring windows (based-on a distance-based technique). MRF examines the relationships between each window and it's neighbouring one using an energy minimization technique. The experimental results demonstrate that the refinement method is superior to the original classification without a refinement.