{"title":"SLYKLatent: A Learning Framework for Gaze Estimation Using Deep Facial Feature Learning","authors":"Samuel Adebayo;Joost C. Dessing;Seán McLoone","doi":"10.1109/THMS.2025.3553404","DOIUrl":null,"url":null,"abstract":"In this research, we present self-learn your key latent (SLYKLatent), a novel approach for enhancing gaze estimation by addressing appearance instability challenges in datasets due to aleatoric uncertainties, covariant shifts, and test domain generalization. SLYKLatent utilizes self-supervised learning for initial training with facial expression datasets, followed by refinement with a patch-based tribranch network and an inverse explained variance weighted training loss function. Our evaluation on benchmark datasets achieves a 10.98% improvement on Gaze360, supersedes the top result with 3.83% improvement on MPIIFaceGaze, and leads on a subset of ETH-XGaze by 11.59%, surpassing existing methods by significant margins. In addition, adaptability tests on RAF-DB and Affectnet show 86.4% and 60.9% accuracies, respectively. Ablation studies confirm the effectiveness of SLYKLatent's novel components. This approach has strong potential in human–robot interaction.","PeriodicalId":48916,"journal":{"name":"IEEE Transactions on Human-Machine Systems","volume":"55 3","pages":"333-346"},"PeriodicalIF":4.4000,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Human-Machine Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10982270/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In this research, we present self-learn your key latent (SLYKLatent), a novel approach for enhancing gaze estimation by addressing appearance instability challenges in datasets due to aleatoric uncertainties, covariant shifts, and test domain generalization. SLYKLatent utilizes self-supervised learning for initial training with facial expression datasets, followed by refinement with a patch-based tribranch network and an inverse explained variance weighted training loss function. Our evaluation on benchmark datasets achieves a 10.98% improvement on Gaze360, supersedes the top result with 3.83% improvement on MPIIFaceGaze, and leads on a subset of ETH-XGaze by 11.59%, surpassing existing methods by significant margins. In addition, adaptability tests on RAF-DB and Affectnet show 86.4% and 60.9% accuracies, respectively. Ablation studies confirm the effectiveness of SLYKLatent's novel components. This approach has strong potential in human–robot interaction.
期刊介绍:
The scope of the IEEE Transactions on Human-Machine Systems includes the fields of human machine systems. It covers human systems and human organizational interactions including cognitive ergonomics, system test and evaluation, and human information processing concerns in systems and organizations.