{"title":"CSM-based feature extraction for degraded machine printed character recognition","authors":"A. Namane, M. Maamoun, E. Soubari, P. Meyrueis","doi":"10.1109/UKRICIS.2010.5898105","DOIUrl":null,"url":null,"abstract":"This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.","PeriodicalId":359942,"journal":{"name":"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKRICIS.2010.5898105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.