{"title":"A new framework based on signature patches, micro registration, and sparse representation for optical text recognition","authors":"R. F. Moghaddam, F. F. Moghaddam, M. Cheriet","doi":"10.1109/ISSPA.2012.6310485","DOIUrl":null,"url":null,"abstract":"A framework for development of segmentation-free optical recognizers of ancient manuscripts, which work free from line, word, and character segmentation, is proposed. The framework introduces a new representation of visual text using the concept of signature patches. These patches which are free from traditional guidelines of text, such as the baseline, are registered to each other using a microscale registration method based on the estimation of the active regions using a multilevel classifier, the directional map. Then, an one-dimensional feature vector is extracted from the registered signature patches, named spiral features. The incremental learning process is performed using a sparse representation using a dictionary of spiral feature atoms. The framework is applied to the George Washington database with promising results.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A framework for development of segmentation-free optical recognizers of ancient manuscripts, which work free from line, word, and character segmentation, is proposed. The framework introduces a new representation of visual text using the concept of signature patches. These patches which are free from traditional guidelines of text, such as the baseline, are registered to each other using a microscale registration method based on the estimation of the active regions using a multilevel classifier, the directional map. Then, an one-dimensional feature vector is extracted from the registered signature patches, named spiral features. The incremental learning process is performed using a sparse representation using a dictionary of spiral feature atoms. The framework is applied to the George Washington database with promising results.