{"title":"基于csm的退化机器打印字符识别特征提取","authors":"A. Namane, M. Maamoun, E. Soubari, P. Meyrueis","doi":"10.1109/UKRICIS.2010.5898105","DOIUrl":null,"url":null,"abstract":"This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.","PeriodicalId":359942,"journal":{"name":"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"CSM-based feature extraction for degraded machine printed character recognition\",\"authors\":\"A. Namane, M. Maamoun, E. Soubari, P. Meyrueis\",\"doi\":\"10.1109/UKRICIS.2010.5898105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.\",\"PeriodicalId\":359942,\"journal\":{\"name\":\"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UKRICIS.2010.5898105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 9th International Conference on Cyberntic Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKRICIS.2010.5898105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CSM-based feature extraction for degraded machine printed character recognition
This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.