{"title":"Text-independent writer identification by feature fusion","authors":"Yongjie Hu, Wenming Yang, Youbin Chen","doi":"10.1109/ICICIP.2014.7010295","DOIUrl":null,"url":null,"abstract":"This paper explores text-independent writer identification by combining Bag of Features (BoF), contour-hinge and SIFT scales feature. The BoF method adopted differs from the common BoF approach for writer identification in that it extracts SIFT descriptors and uses Locality-constrained Linear Coding to get feature vector of each document. The Locality-constrained Linear Coding (LLC) tries to reconstruct each feature through locality constraint and has much more discriminative power than the common used Vector Quantization (VQ). Contour-hinge feature can capture orientation and curvature of the ink trace. Modification is made to the original contour-hinge to improve the identification rate. Besides, we also use SIFT scale information and integrate these three kinds of features together. Experiments are conducted the challenging ICDAR2013 writer identification contest dataset and dataset for \"ICFHR2012 Writer Identification Contest, Challenge 1: Latin Documents\". The experiment results show that the proposed BoF approach outperforms the common ones that adopt VQ, and after the integration, our method achieves the best result on the entire ICDAR2013 and ICFHR2012 dataset under soft evaluation.","PeriodicalId":408041,"journal":{"name":"Fifth International Conference on Intelligent Control and Information Processing","volume":"124 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fifth International Conference on Intelligent Control and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2014.7010295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper explores text-independent writer identification by combining Bag of Features (BoF), contour-hinge and SIFT scales feature. The BoF method adopted differs from the common BoF approach for writer identification in that it extracts SIFT descriptors and uses Locality-constrained Linear Coding to get feature vector of each document. The Locality-constrained Linear Coding (LLC) tries to reconstruct each feature through locality constraint and has much more discriminative power than the common used Vector Quantization (VQ). Contour-hinge feature can capture orientation and curvature of the ink trace. Modification is made to the original contour-hinge to improve the identification rate. Besides, we also use SIFT scale information and integrate these three kinds of features together. Experiments are conducted the challenging ICDAR2013 writer identification contest dataset and dataset for "ICFHR2012 Writer Identification Contest, Challenge 1: Latin Documents". The experiment results show that the proposed BoF approach outperforms the common ones that adopt VQ, and after the integration, our method achieves the best result on the entire ICDAR2013 and ICFHR2012 dataset under soft evaluation.
本文结合特征包(Bag of Features, BoF)、轮廓铰链(contour-hinge)和SIFT尺度特征,探讨了与文本无关的作者识别。所采用的BoF方法不同于常用的BoF方法,它提取SIFT描述符,并使用位置约束线性编码(Locality-constrained Linear Coding)得到每个文档的特征向量。局域约束线性编码(LLC)试图通过局域约束来重构每个特征,比常用的矢量量化(VQ)具有更强的判别能力。轮廓铰链特征可以捕捉墨迹的方向和曲率。对原轮廓铰链进行了改进,提高了识别率。此外,我们还利用SIFT尺度信息,将这三种特征融合在一起。对ICDAR2013作家识别大赛挑战性数据集和“ICFHR2012作家识别大赛挑战一:拉丁文献”数据集进行了实验。实验结果表明,本文提出的BoF方法优于常用的VQ方法,并且在集成后,在整个ICDAR2013和ICFHR2012数据集上进行软评价,得到了最好的结果。