Test-Time Adaptation With Self-Supervised Learning for Gaze Estimation

IF 4.3 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Pengwei Yin;Jingjing Wang;Xiaojun Wu
{"title":"Test-Time Adaptation With Self-Supervised Learning for Gaze Estimation","authors":"Pengwei Yin;Jingjing Wang;Xiaojun Wu","doi":"10.1109/TCE.2024.3523486","DOIUrl":null,"url":null,"abstract":"Gaze estimation plays a significant role in consumer electronics, particularly in the realm of user interface and interactive technology. While existing methods rely on either few-shot adaptation requiring annotated samples or unsupervised domain adaptation necessitating source domain data, these approaches face limitations due to the high cost of annotation and data privacy concerns. This paper addresses this critical gap by introducing a novel test-time adaptation framework for gaze estimation that operates without the need for source domain data or annotated samples for adaptation. Here, we present a dual-objective training strategy that combines supervised and self-supervised learning on the source domain, with a particular focus on a face and eye reconstruction task designed to enhance the learning of head pose and eye direction features crucial for gaze estimation. At test time, our model undergoes adaptation solely through fine-tuning with the self-supervised objective, optimizing the model’s ability to estimate gaze in new, unseen scenarios. Our extensive experiments on benchmarks validate the effectiveness of our approach, demonstrating improved generalization capabilities without the dependency on expensive annotations or sensitive source domain data.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 1","pages":"75-89"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10817535/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Gaze estimation plays a significant role in consumer electronics, particularly in the realm of user interface and interactive technology. While existing methods rely on either few-shot adaptation requiring annotated samples or unsupervised domain adaptation necessitating source domain data, these approaches face limitations due to the high cost of annotation and data privacy concerns. This paper addresses this critical gap by introducing a novel test-time adaptation framework for gaze estimation that operates without the need for source domain data or annotated samples for adaptation. Here, we present a dual-objective training strategy that combines supervised and self-supervised learning on the source domain, with a particular focus on a face and eye reconstruction task designed to enhance the learning of head pose and eye direction features crucial for gaze estimation. At test time, our model undergoes adaptation solely through fine-tuning with the self-supervised objective, optimizing the model’s ability to estimate gaze in new, unseen scenarios. Our extensive experiments on benchmarks validate the effectiveness of our approach, demonstrating improved generalization capabilities without the dependency on expensive annotations or sensitive source domain data.
基于自监督学习的测试时间自适应注视估计
注视估计在消费类电子产品中扮演着重要的角色,特别是在用户界面和交互技术领域。虽然现有方法依赖于需要注释样本的少量自适应或需要源域数据的无监督域自适应,但由于注释成本高和数据隐私问题,这些方法面临局限性。本文通过引入一种新的测试时间自适应框架来解决这一关键问题,该框架不需要源域数据或注释样本进行自适应。在这里,我们提出了一种双目标训练策略,该策略结合了源域上的监督学习和自监督学习,特别关注面部和眼睛重建任务,旨在增强对凝视估计至关重要的头部姿势和眼睛方向特征的学习。在测试时,我们的模型仅通过与自监督目标的微调来适应,优化模型在新的、未见过的场景中估计凝视的能力。我们在基准测试上的大量实验验证了我们方法的有效性,展示了改进的泛化能力,而不依赖于昂贵的注释或敏感的源域数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
9.30%
发文量
59
审稿时长
3.3 months
期刊介绍: The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信