CUE: Compound Uniform Encoding for Writer Retrieval

Jiakai Luo, Hongwei Lu, Xin Nie, Shenghao Liu, Xianjun Deng, Chenlu Zhu
{"title":"CUE: Compound Uniform Encoding for Writer Retrieval","authors":"Jiakai Luo, Hongwei Lu, Xin Nie, Shenghao Liu, Xianjun Deng, Chenlu Zhu","doi":"10.1109/MSN57253.2022.00125","DOIUrl":null,"url":null,"abstract":"Writer retrieval is crucial in document forensics and historical document analysis. However, due to the difference in syntactic structure between Chinese and other languages, the existing methods may not be directly applied to Chinese writer retrieval. Previous work on Chinese writer retrieval does not overcome the performance degradation problem when the number of samples grows. In this paper, we propose a novel compound uniform encoding algorithm (CUE) for Chinese writer retrieval, which mainly consists of a combined feature extraction module (CFE) and a prototype substitution module (PS). The CFE module combines two complementary features from image filter response and character contour. It counts local symmetries and edge co-occurrence pairs. PS module substitutes the outliers with the class prototypes to alleviate the influence of the outliers. Finally, the weighted Chi-square distance is applied to measure the similarity between writer and text. To verify the superiority of our proposed method, experiments are conducted on four public datasets and our built dataset. The results validate that CUE outperforms the state-of-the-art algorithms on mAP metric.","PeriodicalId":114459,"journal":{"name":"2022 18th International Conference on Mobility, Sensing and Networking (MSN)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 18th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN57253.2022.00125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Writer retrieval is crucial in document forensics and historical document analysis. However, due to the difference in syntactic structure between Chinese and other languages, the existing methods may not be directly applied to Chinese writer retrieval. Previous work on Chinese writer retrieval does not overcome the performance degradation problem when the number of samples grows. In this paper, we propose a novel compound uniform encoding algorithm (CUE) for Chinese writer retrieval, which mainly consists of a combined feature extraction module (CFE) and a prototype substitution module (PS). The CFE module combines two complementary features from image filter response and character contour. It counts local symmetries and edge co-occurrence pairs. PS module substitutes the outliers with the class prototypes to alleviate the influence of the outliers. Finally, the weighted Chi-square distance is applied to measure the similarity between writer and text. To verify the superiority of our proposed method, experiments are conducted on four public datasets and our built dataset. The results validate that CUE outperforms the state-of-the-art algorithms on mAP metric.
作者检索在文献取证和历史文献分析中是至关重要的。然而,由于汉语和其他语言在句法结构上的差异,现有的方法可能无法直接应用于汉语作者检索。以往的中文作者检索工作并没有克服当样本数量增加时性能下降的问题。本文提出了一种新的中文作者检索复合统一编码算法(CUE),该算法主要由组合特征提取模块(CFE)和原型替换模块(PS)组成。CFE模块结合了图像滤波器响应和特征轮廓两个互补的特征。它计算局部对称性和边共现对。PS模块用类原型代替离群值,减轻离群值的影响。最后,采用加权卡方距离来衡量作者与文本之间的相似度。为了验证我们提出的方法的优越性,我们在四个公共数据集和我们构建的数据集上进行了实验。结果验证了CUE在mAP度量上优于最先进的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信