Handwriting Style Mixture Adaptation

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI:10.1109/ICDAR.2017.166

Hong-Ming Yang, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu

{"title":"Handwriting Style Mixture Adaptation","authors":"Hong-Ming Yang, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu","doi":"10.1109/ICDAR.2017.166","DOIUrl":null,"url":null,"abstract":"In handwriting recognition, the test data usually come from multiple writers which are not shown in the training data. Therefore, adapting the base classifier towards the new style of each writer can significantly improve the generalization performance. Traditional writer adaptation methods usually assume that there is only one writer (one style) in the test data, and we call this situation as style-clear adaptation. However, a more common situation is that multiple handwriting styles exist in the test data, which is widely appeared in multi-font documents and handwriting data produced by the cooperation of multiple writers. We call the adaptation in this situation as style-mixture adaptation. To deal with this problem, in this paper, we propose a novel method called K-style mixture adaptation (K-SMA) with the assumption that there are totally K styles in the test data. Specifically, we first partition the test data into K groups (style clustering) according to their style consistency, which is measured by a newly designed style feature that can eliminate class (category) information and keep handwriting style information. After that, in each group, a style transfer mapping (STM) is used for writer adaptation. Since the initial style clustering may be not reliable, we repeat this process iteratively to improve the adaptation performance. The K-SMA model is fully unsupervised which do not require either the class label or the style index. Moreover, the K-SMA model can be effectively combined with the benchmark convolutional neural network (CNN) models. Experiments on the online Chinese handwriting database CASIA-OLHWDB demonstrate that K-SMA is an efficient and effective solution for style-mixture adaptation.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In handwriting recognition, the test data usually come from multiple writers which are not shown in the training data. Therefore, adapting the base classifier towards the new style of each writer can significantly improve the generalization performance. Traditional writer adaptation methods usually assume that there is only one writer (one style) in the test data, and we call this situation as style-clear adaptation. However, a more common situation is that multiple handwriting styles exist in the test data, which is widely appeared in multi-font documents and handwriting data produced by the cooperation of multiple writers. We call the adaptation in this situation as style-mixture adaptation. To deal with this problem, in this paper, we propose a novel method called K-style mixture adaptation (K-SMA) with the assumption that there are totally K styles in the test data. Specifically, we first partition the test data into K groups (style clustering) according to their style consistency, which is measured by a newly designed style feature that can eliminate class (category) information and keep handwriting style information. After that, in each group, a style transfer mapping (STM) is used for writer adaptation. Since the initial style clustering may be not reliable, we repeat this process iteratively to improve the adaptation performance. The K-SMA model is fully unsupervised which do not require either the class label or the style index. Moreover, the K-SMA model can be effectively combined with the benchmark convolutional neural network (CNN) models. Experiments on the online Chinese handwriting database CASIA-OLHWDB demonstrate that K-SMA is an efficient and effective solution for style-mixture adaptation.

查看原文本刊更多论文

笔迹风格混合适应

在手写识别中，测试数据通常来自多个书写者，这些书写者不会在训练数据中显示出来。因此，根据每个编写者的新风格调整基分类器可以显著提高泛化性能。传统的作者适应方法通常假设测试数据中只有一个作者(一种风格)，我们称这种情况为风格清晰的适应。然而，更常见的情况是测试数据中存在多种笔迹样式，这种情况广泛出现在多字体文档和多个写作者合作产生的笔迹数据中。我们把这种适应称为风格混合适应。为了解决这一问题，在本文中，我们提出了一种新的方法，称为K-style混合自适应(K- sma)，假设测试数据中总共有K种风格。具体而言，我们首先根据风格一致性将测试数据划分为K组(风格聚类)，并通过新设计的风格特征来衡量，该特征可以消除类(类别)信息并保留手写风格信息。然后，在每组中，使用风格迁移映射(STM)进行作者改编。由于初始的风格聚类可能不可靠，我们迭代地重复这一过程以提高自适应性能。K-SMA模型是完全无监督的，既不需要类标签也不需要样式索引。此外，K-SMA模型可以有效地与基准卷积神经网络(CNN)模型相结合。在在线中文手写体数据库CASIA-OLHWDB上的实验表明，K-SMA是一种高效的混合风格自适应方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)

自引率

0.00%

发文量