Automatic scanpath generation with deep recurrent neural networks

Proceedings of the ACM Symposium on Applied Perception Pub Date : 2016-07-22 DOI:10.1145/2931002.2948726

Daniel Simon, S. Sridharan, Shagan Sah, R. Ptucha, Christopher Kanan, Reynold J. Bailey

{"title":"Automatic scanpath generation with deep recurrent neural networks","authors":"Daniel Simon, S. Sridharan, Shagan Sah, R. Ptucha, Christopher Kanan, Reynold J. Bailey","doi":"10.1145/2931002.2948726","DOIUrl":null,"url":null,"abstract":"Many computer vision algorithms are biologically inspired and designed based on the human visual system. Convolutional neural networks (CNNs) are similarly inspired by the primary visual cortex in the human brain. However, the key difference between current visual models and the human visual system is how the visual information is gathered and processed. We make eye movements to collect information from the environment for navigation and task performance. We also make specific eye movements to important regions in the stimulus to perform the task-at-hand quickly and efficiently. Researchers have used expert scanpaths to train novices for improving the accuracy of visual search tasks. One of the limitations of such a system is that we need an expert to examine each visual stimuli beforehand to generate the scanpaths. In order to extend the idea of gaze guidance to a new unseen stimulus, there is a need for a computational model that can automatically generate expert-like scanpaths. We propose a model for automatic scanpath generation using a convolutional neural network (CNN) and long short-term memory (LSTM) modules. Our model uses LSTMs due to the temporal nature of eye movement data (scanpaths) where the system makes fixation predictions based on previous locations examined.","PeriodicalId":102213,"journal":{"name":"Proceedings of the ACM Symposium on Applied Perception","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Symposium on Applied Perception","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2931002.2948726","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Many computer vision algorithms are biologically inspired and designed based on the human visual system. Convolutional neural networks (CNNs) are similarly inspired by the primary visual cortex in the human brain. However, the key difference between current visual models and the human visual system is how the visual information is gathered and processed. We make eye movements to collect information from the environment for navigation and task performance. We also make specific eye movements to important regions in the stimulus to perform the task-at-hand quickly and efficiently. Researchers have used expert scanpaths to train novices for improving the accuracy of visual search tasks. One of the limitations of such a system is that we need an expert to examine each visual stimuli beforehand to generate the scanpaths. In order to extend the idea of gaze guidance to a new unseen stimulus, there is a need for a computational model that can automatically generate expert-like scanpaths. We propose a model for automatic scanpath generation using a convolutional neural network (CNN) and long short-term memory (LSTM) modules. Our model uses LSTMs due to the temporal nature of eye movement data (scanpaths) where the system makes fixation predictions based on previous locations examined.

查看原文本刊更多论文

基于深度递归神经网络的自动扫描路径生成

许多计算机视觉算法都是基于人类视觉系统的生物学启发和设计的。卷积神经网络(cnn)同样受到人类大脑初级视觉皮层的启发。然而，当前的视觉模型与人类视觉系统的关键区别在于如何收集和处理视觉信息。我们通过眼球运动来从环境中收集信息，以便导航和执行任务。我们也会对刺激的重要区域进行特定的眼球运动，以快速有效地完成手头的任务。研究人员使用专家扫描路径来训练新手提高视觉搜索任务的准确性。这种系统的一个限制是我们需要一个专家来预先检查每个视觉刺激来生成扫描路径。为了将凝视引导的思想扩展到新的看不见的刺激，需要一种能够自动生成专家扫描路径的计算模型。我们提出了一个使用卷积神经网络(CNN)和长短期记忆(LSTM)模块自动生成扫描路径的模型。由于眼动数据(扫描路径)的时间性质，我们的模型使用LSTMs，其中系统根据先前检查的位置做出固定预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ACM Symposium on Applied Perception

自引率

0.00%

发文量