Eye Tracking and Emotion Recognition Using Multiple Spatial-Temporal Networks

2022 International Conference on Data Science and Its Applications (ICoDSA) Pub Date : 2022-07-06 DOI:10.1109/ICoDSA55874.2022.9862881

Eprian Junan Setianto, Esmeralda Contessa Djamal, Fikri Nugraha, Fatan Kasyidi

引用次数: 0

Abstract

E-commerce products need to be measured by reader responses as a more objective evaluation. Some of them are through emotion expression identification or eye-tracking. Using these two variables from video capture provides a more thorough evaluation of the response to interest and emotion. This study proposes a spatial-temporal multi-networks method using Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) from video for 60 seconds. The results showed that two classes of emotional expression and four directions of eye-tracking gave better accuracy, namely 95.83%, compared to three classes of emotion and four directions of eye-tracking, which was 91.67%. Experiments also show that using CNN-LSTM significantly increased accuracy, while the weight correction technique does not have much effect. The evaluated F1 score shows the consistency of the proposed model.

查看原文本刊更多论文

基于多时空网络的眼动追踪与情绪识别

电子商务产品需要通过读者的反应来衡量，作为一种更客观的评价。其中一些是通过情绪表达识别或眼球追踪。使用视频捕捉中的这两个变量可以更彻底地评估对兴趣和情感的反应。本研究提出了一种基于卷积神经网络(CNN)和循环神经网络(RNN)的时空多网络方法。结果表明，两类情绪表达和四方向眼动的准确率为95.83%，而三类情绪表达和四方向眼动的准确率为91.67%。实验还表明，使用CNN-LSTM可以显著提高准确率，而权值校正技术效果不明显。评估的F1分数显示了所提出模型的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Data Science and Its Applications (ICoDSA)

自引率

0.00%

发文量