{"title":"基于选择的特征形状复杂度的写作者识别","authors":"A. Bensefia, Chawki Djeddi","doi":"10.1145/3415048.3416102","DOIUrl":null,"url":null,"abstract":"Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.","PeriodicalId":122511,"journal":{"name":"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Feature's Selection-Based Shape Complexity for Writer Identification Task\",\"authors\":\"A. Bensefia, Chawki Djeddi\",\"doi\":\"10.1145/3415048.3416102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.\",\"PeriodicalId\":122511,\"journal\":{\"name\":\"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3415048.3416102\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3415048.3416102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature's Selection-Based Shape Complexity for Writer Identification Task
Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.