{"title":"CUE: Compound Uniform Encoding for Writer Retrieval","authors":"Jiakai Luo, Hongwei Lu, Xin Nie, Shenghao Liu, Xianjun Deng, Chenlu Zhu","doi":"10.1109/MSN57253.2022.00125","DOIUrl":null,"url":null,"abstract":"Writer retrieval is crucial in document forensics and historical document analysis. However, due to the difference in syntactic structure between Chinese and other languages, the existing methods may not be directly applied to Chinese writer retrieval. Previous work on Chinese writer retrieval does not overcome the performance degradation problem when the number of samples grows. In this paper, we propose a novel compound uniform encoding algorithm (CUE) for Chinese writer retrieval, which mainly consists of a combined feature extraction module (CFE) and a prototype substitution module (PS). The CFE module combines two complementary features from image filter response and character contour. It counts local symmetries and edge co-occurrence pairs. PS module substitutes the outliers with the class prototypes to alleviate the influence of the outliers. Finally, the weighted Chi-square distance is applied to measure the similarity between writer and text. To verify the superiority of our proposed method, experiments are conducted on four public datasets and our built dataset. The results validate that CUE outperforms the state-of-the-art algorithms on mAP metric.","PeriodicalId":114459,"journal":{"name":"2022 18th International Conference on Mobility, Sensing and Networking (MSN)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 18th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN57253.2022.00125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Writer retrieval is crucial in document forensics and historical document analysis. However, due to the difference in syntactic structure between Chinese and other languages, the existing methods may not be directly applied to Chinese writer retrieval. Previous work on Chinese writer retrieval does not overcome the performance degradation problem when the number of samples grows. In this paper, we propose a novel compound uniform encoding algorithm (CUE) for Chinese writer retrieval, which mainly consists of a combined feature extraction module (CFE) and a prototype substitution module (PS). The CFE module combines two complementary features from image filter response and character contour. It counts local symmetries and edge co-occurrence pairs. PS module substitutes the outliers with the class prototypes to alleviate the influence of the outliers. Finally, the weighted Chi-square distance is applied to measure the similarity between writer and text. To verify the superiority of our proposed method, experiments are conducted on four public datasets and our built dataset. The results validate that CUE outperforms the state-of-the-art algorithms on mAP metric.