Representation Learning for Tablet and Paper Domain Adaptation in Favor of Online Handwriting Recognition

Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler
{"title":"Representation Learning for Tablet and Paper Domain Adaptation in Favor of Online Handwriting Recognition","authors":"Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler","doi":"10.48550/arXiv.2301.06293","DOIUrl":null,"url":null,"abstract":"The performance of a machine learning model degrades when it is applied to data from a similar but different domain than the data it has initially been trained on. The goal of domain adaptation (DA) is to mitigate this domain shift problem by searching for an optimal feature transformation to learn a domain-invariant representation. Such a domain shift can appear in handwriting recognition (HWR) applications where the motion pattern of the hand and with that the motion pattern of the pen is different for writing on paper and on tablet. This becomes visible in the sensor data for online handwriting (OnHW) from pens with integrated inertial measurement units. This paper proposes a supervised DA approach to enhance learning for OnHW recognition between tablet and paper data. Our method exploits loss functions such as maximum mean discrepancy and correlation alignment to learn a domain-invariant feature representation (i.e., similar covariances between tablet and paper features). We use a triplet loss that takes negative samples of the auxiliary domain (i.e., paper samples) to increase the amount of samples of the tablet dataset. We conduct an evaluation on novel sequence-based OnHW datasets (i.e., words) and show an improvement on the paper domain with an early fusion strategy by using pairwise learning.","PeriodicalId":391161,"journal":{"name":"ICPR Workshops","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICPR Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2301.06293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The performance of a machine learning model degrades when it is applied to data from a similar but different domain than the data it has initially been trained on. The goal of domain adaptation (DA) is to mitigate this domain shift problem by searching for an optimal feature transformation to learn a domain-invariant representation. Such a domain shift can appear in handwriting recognition (HWR) applications where the motion pattern of the hand and with that the motion pattern of the pen is different for writing on paper and on tablet. This becomes visible in the sensor data for online handwriting (OnHW) from pens with integrated inertial measurement units. This paper proposes a supervised DA approach to enhance learning for OnHW recognition between tablet and paper data. Our method exploits loss functions such as maximum mean discrepancy and correlation alignment to learn a domain-invariant feature representation (i.e., similar covariances between tablet and paper features). We use a triplet loss that takes negative samples of the auxiliary domain (i.e., paper samples) to increase the amount of samples of the tablet dataset. We conduct an evaluation on novel sequence-based OnHW datasets (i.e., words) and show an improvement on the paper domain with an early fusion strategy by using pairwise learning.
面向平板和纸张领域适应的表示学习支持在线手写识别
当机器学习模型应用于与最初训练的数据相似但不同的领域时,它的性能会下降。领域自适应(DA)的目标是通过搜索最优特征变换来学习领域不变表示来缓解这种领域转移问题。这种域移位可能出现在手写识别(HWR)应用程序中,其中手的运动模式和笔的运动模式在纸上和平板电脑上书写时是不同的。这在带有集成惯性测量单元的笔的在线手写(OnHW)传感器数据中是可见的。本文提出了一种有监督的数据处理方法来增强平板电脑和纸张数据之间OnHW识别的学习。我们的方法利用损失函数,如最大平均差异和相关对齐来学习域不变特征表示(即平板电脑和纸张特征之间的相似协方差)。我们使用三联体损失,取辅助域的负样本(即纸质样本)来增加片剂数据集的样本数量。我们对新的基于序列的OnHW数据集(即单词)进行了评估,并通过使用成对学习的早期融合策略展示了对纸张领域的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信