{"title":"New features for Chinese character recognition","authors":"T. Caesar","doi":"10.1109/ICDAR.1997.620571","DOIUrl":null,"url":null,"abstract":"The wide range of shape variations for Chinese characters requires an adequate representation of the discriminating features for classification. For the recognition of Latin characters or numerals pixel values of a normalized raster image are proper features to reach very good recognition rates. But Chinese characters require a much higher resolution of the normalized raster image to enable a discrimination of complex shaped characters which leads to a feature space dimensionality of prohibitive computational effort for classification. Therefore feature extraction algorithms are needed which capture the discriminative characteristics of character shapes in a compact form. Several algorithms were proposed in the past and many of them are based on the contour data. This paper also introduces a contour based approach which is very time efficient and overcomes the problem of vanishing lines during anisotropic size normalization.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"198 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.1997.620571","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The wide range of shape variations for Chinese characters requires an adequate representation of the discriminating features for classification. For the recognition of Latin characters or numerals pixel values of a normalized raster image are proper features to reach very good recognition rates. But Chinese characters require a much higher resolution of the normalized raster image to enable a discrimination of complex shaped characters which leads to a feature space dimensionality of prohibitive computational effort for classification. Therefore feature extraction algorithms are needed which capture the discriminative characteristics of character shapes in a compact form. Several algorithms were proposed in the past and many of them are based on the contour data. This paper also introduces a contour based approach which is very time efficient and overcomes the problem of vanishing lines during anisotropic size normalization.