Visual Task Classification using Classic Machine Learning and CNNs

World Congress on Electrical Engineering and Computer Systems and Science Pub Date : 2022-07-01 DOI:10.11159/mhci22.110

Devangi Vilas Chinchankarame, Noha M. Elfiky, Nada Attar

引用次数: 0

Abstract

- Our eyes actively perform tasks including, but not limited to, searching, comparing, and counting. This includes tasks in front of a computer, whether it be trivial activities like reading email, or video gaming, or more serious activities like drone management, or flight simulation. Understanding what type of visual task is being performed is important to develop intelligent user interfaces. In this work, we investigated standard machine and deep learning methods to identify the task type using eye-tracking data - including both raw numerical data and the visual representations of the user gaze scan paths and pupil size. To this end, we experimented with computer vision algorithms such as Convolutional Neural Networks (CNNs) and compared the results to classic machine learning algorithms. We found that Machine learning-based methods performed with high accuracy classifying tasks that involve minimal visual search, while CNNs techniques do better in situations where visual search task is included.

查看原文本刊更多论文

基于经典机器学习和cnn的视觉任务分类

-我们的眼睛积极地执行任务，包括但不限于搜索、比较和计数。这包括在电脑前的任务，无论是琐碎的活动，如阅读电子邮件，或视频游戏，或更严肃的活动，如无人机管理，或飞行模拟。了解正在执行的视觉任务类型对于开发智能用户界面非常重要。在这项工作中，我们研究了使用眼动追踪数据识别任务类型的标准机器和深度学习方法——包括原始数值数据和用户凝视扫描路径和瞳孔大小的视觉表示。为此，我们对卷积神经网络(cnn)等计算机视觉算法进行了实验，并将结果与经典的机器学习算法进行了比较。我们发现，基于机器学习的方法在涉及最小视觉搜索的分类任务中表现出较高的准确率，而cnn技术在包含视觉搜索任务的情况下表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

World Congress on Electrical Engineering and Computer Systems and Science

自引率

0.00%

发文量