基于选择的特征形状复杂度的写作者识别

Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems Pub Date : 2020-07-30 DOI:10.1145/3415048.3416102

A. Bensefia, Chawki Djeddi

{"title":"基于选择的特征形状复杂度的写作者识别","authors":"A. Bensefia, Chawki Djeddi","doi":"10.1145/3415048.3416102","DOIUrl":null,"url":null,"abstract":"Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.","PeriodicalId":122511,"journal":{"name":"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Feature's Selection-Based Shape Complexity for Writer Identification Task\",\"authors\":\"A. Bensefia, Chawki Djeddi\",\"doi\":\"10.1145/3415048.3416102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.\",\"PeriodicalId\":122511,\"journal\":{\"name\":\"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3415048.3416102\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3415048.3416102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

作者识别任务因其广泛的应用而引起了广泛的研究兴趣。根据不同的特征，文献中存在不同的方法。然而，所有这些方法都使用手写样本中可用的所有信息来识别作者(相关或不相关)。在本文中，我们提出了一种基于双特征选择过程的原始方法，其中特征由分割过程产生的字素表示。利用傅里叶椭圆变换对这些特征的形状复杂度进行分析，并对每个字素(FECS)进行复杂度评分。特征选择的第二阶段是使用顺序聚类算法消除结果之间的冗余。提出了两个相似度度量来评估在IAM数据集的100个作者上提出的系统。我们仅使用25个字素，相当于3- 4个单词，就获得了96%的良好识别率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Feature's Selection-Based Shape Complexity for Writer Identification Task

Writer Identification task has attracted a lot of research interests due to its wide variety of applications. Different approaches based on various features exist in the literature. However, all these approaches use all the information available in the handwritten sample to identify the writer (relevant or irrelevant). In this paper, we propose an original approach based on a double feature selection process, where the features are represented by graphemes resulting from a segmentation process. These features are analyzed based on their shape complexity, using the Fourier Elliptic transform, and the complexity score is assigned to each grapheme (FECS). The second phase of feature selection is to eliminate the redundancy among the resulting using a sequential clustering algorithm. Two similarity measures are proposed to evaluate the proposed system on 100 writers of the IAM dataset. We obtained a good identification rate of 96% using only 25 graphemes, which is equivalent to 3--4 words.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems

自引率

0.00%

发文量