{"title":"Segmentation and validation of commercial documents logical structure","authors":"Miguel Diogenes Matrakas, Flávio Bortolozzi","doi":"10.1109/ITCC.2000.844221","DOIUrl":null,"url":null,"abstract":"The main objective of the work is to present an approach to extract and validate the logical structure from the images that compose a commercial document. The nearest neighbor rule algorithm was used for labeling the elements, and the Run Length Smoothing Algorithm (RLSA) was used to segment the image of a commercial document of the type letter, official letter or memo. The most common classes considered are: date, logotype, text body, signature, addressee, invocation and greeting. The labeling of the elements is accomplished using the nearest neighbor rule algorithm with a vector comprising 28 characteristics. The accomplished study presented a good result for the classification of elements on commercial documents. It was created and used a base composed of 283 images of commercial documents in 256 gray levels for document element classification.","PeriodicalId":146581,"journal":{"name":"Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITCC.2000.844221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The main objective of the work is to present an approach to extract and validate the logical structure from the images that compose a commercial document. The nearest neighbor rule algorithm was used for labeling the elements, and the Run Length Smoothing Algorithm (RLSA) was used to segment the image of a commercial document of the type letter, official letter or memo. The most common classes considered are: date, logotype, text body, signature, addressee, invocation and greeting. The labeling of the elements is accomplished using the nearest neighbor rule algorithm with a vector comprising 28 characteristics. The accomplished study presented a good result for the classification of elements on commercial documents. It was created and used a base composed of 283 images of commercial documents in 256 gray levels for document element classification.