Appearance based user-independent gaze estimation

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI:10.1145/2663204.2666288

Nanxiang Li

{"title":"Appearance based user-independent gaze estimation","authors":"Nanxiang Li","doi":"10.1145/2663204.2666288","DOIUrl":null,"url":null,"abstract":"An ideal gaze user interface should be able to accurately estimates the user's gaze direction in a non-intrusive setting. Most studies on gaze estimation focus on the accuracy of the estimation results, imposing important constraints on the user such as no head movement, intrusive head mount setting and repetitive calibration process. Due to these limitations, most graphic user interfaces (GUIs) are reluctant to include gaze as an input modality. We envision user-independent gaze detectors for user computer interaction that do not impose any constraints on the users. We believe the appearance of the eye pairs, which implicitly reveals head pose, provides conclusive information on the gaze direction. More importantly, the relative appearance changes in the eye pairs due to the different gaze direction should be consistent among different human subjects. We collected a multimodal corpus (MSP-GAZE) to study and evaluate user independent, appearance based gaze estimation approaches. This corpus considers important factors that affect the appearance based gaze estimation: the individual difference, the head movement, and the distance between the user and the interface's screen. Using this database, our initial study focused on the eye pair appearance eigenspace approach, where the projections into the eye appearance eigenspace basis are used to build regression models to estimate the gaze position. We compare the results between user dependent (training and testing on the same subject) and user independent (testing subject is not included in the training data) models. As expected, due to the individual differences between subjects, the performance decreases when the models are trained without data from the target user. The study aims to reduce the gap between user dependent and user independent conditions.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 16th International Conference on Multimodal Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2663204.2666288","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

An ideal gaze user interface should be able to accurately estimates the user's gaze direction in a non-intrusive setting. Most studies on gaze estimation focus on the accuracy of the estimation results, imposing important constraints on the user such as no head movement, intrusive head mount setting and repetitive calibration process. Due to these limitations, most graphic user interfaces (GUIs) are reluctant to include gaze as an input modality. We envision user-independent gaze detectors for user computer interaction that do not impose any constraints on the users. We believe the appearance of the eye pairs, which implicitly reveals head pose, provides conclusive information on the gaze direction. More importantly, the relative appearance changes in the eye pairs due to the different gaze direction should be consistent among different human subjects. We collected a multimodal corpus (MSP-GAZE) to study and evaluate user independent, appearance based gaze estimation approaches. This corpus considers important factors that affect the appearance based gaze estimation: the individual difference, the head movement, and the distance between the user and the interface's screen. Using this database, our initial study focused on the eye pair appearance eigenspace approach, where the projections into the eye appearance eigenspace basis are used to build regression models to estimate the gaze position. We compare the results between user dependent (training and testing on the same subject) and user independent (testing subject is not included in the training data) models. As expected, due to the individual differences between subjects, the performance decreases when the models are trained without data from the target user. The study aims to reduce the gap between user dependent and user independent conditions.

查看原文本刊更多论文

基于外观的用户独立注视估计

理想的凝视用户界面应该能够在非侵入性设置中准确估计用户的凝视方向。大多数关于凝视估计的研究都集中在估计结果的准确性上，这对用户施加了重要的约束，如不进行头部运动、侵入式头戴式设置和重复校准过程。由于这些限制，大多数图形用户界面(gui)都不愿意将注视作为一种输入方式。我们设想用于用户计算机交互的独立于用户的凝视检测器不会对用户施加任何约束。我们认为，这对眼睛的外观隐含地揭示了头部姿势，提供了关于凝视方向的决定性信息。更重要的是，不同的人类受试者由于注视方向的不同而产生的对眼相对外观变化应该是一致的。我们收集了一个多模态语料库(MSP-GAZE)来研究和评估与用户无关的、基于外观的凝视估计方法。该语料库考虑了影响基于外观的凝视估计的重要因素:个体差异、头部运动以及用户与界面屏幕之间的距离。使用该数据库，我们的初步研究集中在眼睛外观特征空间方法上，其中使用眼睛外观特征空间基的投影来构建回归模型来估计凝视位置。我们比较了用户依赖模型(在同一主题上训练和测试)和用户独立模型(测试主题不包括在训练数据中)的结果。正如预期的那样，由于受试者之间的个体差异，在没有目标用户数据的情况下训练模型时，性能会下降。本研究旨在缩小用户依赖条件和用户独立条件之间的差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 16th International Conference on Multimodal Interaction

自引率

0.00%

发文量