Predicting Visual Fixations.

IF 5 2区 医学 Q1 NEUROSCIENCES
Matthias Kümmerer, Matthias Bethge
{"title":"Predicting Visual Fixations.","authors":"Matthias Kümmerer,&nbsp;Matthias Bethge","doi":"10.1146/annurev-vision-120822-072528","DOIUrl":null,"url":null,"abstract":"<p><p>As we navigate and behave in the world, we are constantly deciding, a few times per second, where to look next. The outcomes of these decisions in response to visual input are comparatively easy to measure as trajectories of eye movements, offering insight into many unconscious and conscious visual and cognitive processes. In this article, we review recent advances in predicting where we look. We focus on evaluating and comparing models: How can we consistently measure how well models predict eye movements, and how can we judge the contribution of different mechanisms? Probabilistic models facilitate a unified approach to fixation prediction that allows us to use explainable information explained to compare different models across different settings, such as static and video saliency, as well as scanpath prediction. We review how the large variety of saliency maps and scanpath models can be translated into this unifying framework, how much different factors contribute, and how we can select the most informative examples for model comparison. We conclude that the universal scale of information gain offers a powerful tool for the inspection of candidate mechanisms and experimental design that helps us understand the continual decision-making process that determines where we look.</p>","PeriodicalId":48658,"journal":{"name":"Annual Review of Vision Science","volume":"9 ","pages":"269-291"},"PeriodicalIF":5.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Review of Vision Science","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1146/annurev-vision-120822-072528","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/7/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

As we navigate and behave in the world, we are constantly deciding, a few times per second, where to look next. The outcomes of these decisions in response to visual input are comparatively easy to measure as trajectories of eye movements, offering insight into many unconscious and conscious visual and cognitive processes. In this article, we review recent advances in predicting where we look. We focus on evaluating and comparing models: How can we consistently measure how well models predict eye movements, and how can we judge the contribution of different mechanisms? Probabilistic models facilitate a unified approach to fixation prediction that allows us to use explainable information explained to compare different models across different settings, such as static and video saliency, as well as scanpath prediction. We review how the large variety of saliency maps and scanpath models can be translated into this unifying framework, how much different factors contribute, and how we can select the most informative examples for model comparison. We conclude that the universal scale of information gain offers a powerful tool for the inspection of candidate mechanisms and experimental design that helps us understand the continual decision-making process that determines where we look.

预测视觉修复。
当我们在这个世界上导航和行为时,我们会不断地决定,每秒几次,下一步该往哪里看。这些决定对视觉输入的反应结果相对容易测量为眼球运动的轨迹,从而深入了解许多无意识和有意识的视觉和认知过程。在这篇文章中,我们回顾了预测我们所处位置的最新进展。我们专注于评估和比较模型:我们如何始终如一地衡量模型预测眼球运动的效果,以及我们如何判断不同机制的贡献?概率模型促进了固定预测的统一方法,使我们能够使用解释的信息来比较不同设置下的不同模型,如静态和视频显著性,以及扫描路径预测。我们回顾了如何将各种显著性图和扫描路径模型转化为这个统一的框架,不同因素的贡献有多大,以及我们如何选择信息最丰富的例子进行模型比较。我们得出的结论是,信息获取的普遍规模为检查候选机制和实验设计提供了一个强大的工具,有助于我们理解决定我们目光的持续决策过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Annual Review of Vision Science
Annual Review of Vision Science Medicine-Ophthalmology
CiteScore
11.10
自引率
1.70%
发文量
19
期刊介绍: The Annual Review of Vision Science reviews progress in the visual sciences, a cross-cutting set of disciplines which intersect psychology, neuroscience, computer science, cell biology and genetics, and clinical medicine. The journal covers a broad range of topics and techniques, including optics, retina, central visual processing, visual perception, eye movements, visual development, vision models, computer vision, and the mechanisms of visual disease, dysfunction, and sight restoration. The study of vision is central to progress in many areas of science, and this new journal will explore and expose the connections that link it to biology, behavior, computation, engineering, and medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信