Li-xin Zhang, Yannan Zhao, Zehong Yang, Jiaxin Wang
{"title":"手写体汉字识别中的特征选择","authors":"Li-xin Zhang, Yannan Zhao, Zehong Yang, Jiaxin Wang","doi":"10.1109/ICMLC.2002.1167382","DOIUrl":null,"url":null,"abstract":"Recognition of handwritten Chinese characters is a large-scale pattern recognition task, which is difficult and time consuming to build the corresponding classifiers. In this paper, two feature selection methods are proposed to reduce the complexity and speed up the handwritten Chinese recognition: one is the ReliefF-Wrapper method which evaluates the original features with the ReliefF method, and then uses the wrapper method to decide the number of features to be selected; and the other is GA-Wrapper that uses genetic algorithm to search the optimal subset of features with high training accuracy. Experiments were performed on 800 most frequently used Chinese characters, with 80,000 handwritten samples. Results show that the ReliefF-Wrapper method has good interpretation and high speed and GA-Wrapper gains higher accuracy. Limitations of the both methods and future work are also discussed.","PeriodicalId":90702,"journal":{"name":"Proceedings. International Conference on Machine Learning and Cybernetics","volume":"20 1","pages":"1158-1162 vol.3"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Feature selection in recognition of handwritten Chinese characters\",\"authors\":\"Li-xin Zhang, Yannan Zhao, Zehong Yang, Jiaxin Wang\",\"doi\":\"10.1109/ICMLC.2002.1167382\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognition of handwritten Chinese characters is a large-scale pattern recognition task, which is difficult and time consuming to build the corresponding classifiers. In this paper, two feature selection methods are proposed to reduce the complexity and speed up the handwritten Chinese recognition: one is the ReliefF-Wrapper method which evaluates the original features with the ReliefF method, and then uses the wrapper method to decide the number of features to be selected; and the other is GA-Wrapper that uses genetic algorithm to search the optimal subset of features with high training accuracy. Experiments were performed on 800 most frequently used Chinese characters, with 80,000 handwritten samples. Results show that the ReliefF-Wrapper method has good interpretation and high speed and GA-Wrapper gains higher accuracy. Limitations of the both methods and future work are also discussed.\",\"PeriodicalId\":90702,\"journal\":{\"name\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"volume\":\"20 1\",\"pages\":\"1158-1162 vol.3\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC.2002.1167382\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2002.1167382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature selection in recognition of handwritten Chinese characters
Recognition of handwritten Chinese characters is a large-scale pattern recognition task, which is difficult and time consuming to build the corresponding classifiers. In this paper, two feature selection methods are proposed to reduce the complexity and speed up the handwritten Chinese recognition: one is the ReliefF-Wrapper method which evaluates the original features with the ReliefF method, and then uses the wrapper method to decide the number of features to be selected; and the other is GA-Wrapper that uses genetic algorithm to search the optimal subset of features with high training accuracy. Experiments were performed on 800 most frequently used Chinese characters, with 80,000 handwritten samples. Results show that the ReliefF-Wrapper method has good interpretation and high speed and GA-Wrapper gains higher accuracy. Limitations of the both methods and future work are also discussed.