{"title":"Boundary feature extraction from gray-scale document images","authors":"H. Nishida","doi":"10.1109/ICDAR.1997.619828","DOIUrl":null,"url":null,"abstract":"A novel method is presented for extracting closed boundaries of document components such as characters and symbols directly from gray-scale document images based on the surface data structures along with structural features. The method is based on the simple model assuming that a closed boundary of document components can be approximated as a series of horizontal line segments and can be extracted by linking surface components with steep gradients which share commonly intersecting horizontal planes. The proposed algorithm is compared with some binarization algorithms, shown to be effective for improving recognition accuracy for very poor quality data.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"183 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.1997.619828","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A novel method is presented for extracting closed boundaries of document components such as characters and symbols directly from gray-scale document images based on the surface data structures along with structural features. The method is based on the simple model assuming that a closed boundary of document components can be approximated as a series of horizontal line segments and can be extracted by linking surface components with steep gradients which share commonly intersecting horizontal planes. The proposed algorithm is compared with some binarization algorithms, shown to be effective for improving recognition accuracy for very poor quality data.