Horizon estimation: perceptual and computational experiments

Christian Herdtweck, C. Wallraven
{"title":"Horizon estimation: perceptual and computational experiments","authors":"Christian Herdtweck, C. Wallraven","doi":"10.1145/1836248.1836257","DOIUrl":null,"url":null,"abstract":"The human visual system is able to quickly and robustly infer a wealth of scene information -- the scene \"gist\" - already after 100 milliseconds of image presentation. Here, we investigated the ability to estimate the position of the horizon in briefly shown images. Being able to judge the horizon position quickly and accurately will help in inferring viewer orientation and scene structure in general and thus might be an important factor of scene gist. In the first, perceptual study, we investigated participants' horizon estimates after a 150 millisecond, masked presentation of typical outdoor scenes from different scene categories. All images were shown in upright, blurred, inverted, and cropped conditions to investigate the influence of different information types on the perceptual decision. We found that despite individual variations, horizon estimates were fairly consistent across participants and conformed well to annotated data. In addition, inversion resulted in significant differences in performance, whereas blurring did not yield any different results, highlighting the importance of global, low-frequency information for making judgments about horizon position. In the second, computational experiment, we then correlated the performance of several algorithms for horizon estimation with the human data -- algorithms ranged from simple estimations of bright-dark-transitions to more sophisticated frequency spectrum analyses motivated by previous computational modeling of scene classification results. Surprisingly, the best fits to human data were obtained with one very simple gradient method and the most complex, trained method. Overall, global frequency spectrum analysis provided the best fit to human estimates, which together with the perceptual data suggests that the human visual system might use similar mechanisms to quickly judge horizon position as part of the scene gist.","PeriodicalId":89458,"journal":{"name":"Proceedings APGV : ... Symposium on Applied Perception in Graphics and Visualization. Symposium on Applied Perception in Graphics and Visualization","volume":"27 1","pages":"49-56"},"PeriodicalIF":0.0000,"publicationDate":"2010-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings APGV : ... Symposium on Applied Perception in Graphics and Visualization. Symposium on Applied Perception in Graphics and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1836248.1836257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

The human visual system is able to quickly and robustly infer a wealth of scene information -- the scene "gist" - already after 100 milliseconds of image presentation. Here, we investigated the ability to estimate the position of the horizon in briefly shown images. Being able to judge the horizon position quickly and accurately will help in inferring viewer orientation and scene structure in general and thus might be an important factor of scene gist. In the first, perceptual study, we investigated participants' horizon estimates after a 150 millisecond, masked presentation of typical outdoor scenes from different scene categories. All images were shown in upright, blurred, inverted, and cropped conditions to investigate the influence of different information types on the perceptual decision. We found that despite individual variations, horizon estimates were fairly consistent across participants and conformed well to annotated data. In addition, inversion resulted in significant differences in performance, whereas blurring did not yield any different results, highlighting the importance of global, low-frequency information for making judgments about horizon position. In the second, computational experiment, we then correlated the performance of several algorithms for horizon estimation with the human data -- algorithms ranged from simple estimations of bright-dark-transitions to more sophisticated frequency spectrum analyses motivated by previous computational modeling of scene classification results. Surprisingly, the best fits to human data were obtained with one very simple gradient method and the most complex, trained method. Overall, global frequency spectrum analysis provided the best fit to human estimates, which together with the perceptual data suggests that the human visual system might use similar mechanisms to quickly judge horizon position as part of the scene gist.
视界估计:知觉与计算实验
人类视觉系统能够在100毫秒的图像呈现后快速而强大地推断出丰富的场景信息——场景“要点”。在这里,我们研究了在简要显示的图像中估计地平线位置的能力。能够快速准确地判断地平线位置将有助于推断观看者的方向和场景结构,因此可能是场景要点的重要因素。在第一项感知研究中,我们调查了参与者在不同场景类别的典型户外场景被蒙面呈现150毫秒后的视界估计。所有的图像显示在直立,模糊,倒置和裁剪的条件下,以调查不同的信息类型对感知决策的影响。我们发现,尽管存在个体差异,但参与者的视界估计值相当一致,并且与注释数据很好地吻合。此外,反演会导致性能上的显著差异,而模糊则不会产生任何不同的结果,这突出了全局低频信息对于判断地平线位置的重要性。在第二个计算实验中,我们将几种地平线估计算法的性能与人类数据相关联,这些算法从简单的明暗转换估计到更复杂的频谱分析,这些分析是由之前的场景分类结果的计算建模驱动的。令人惊讶的是,最适合人类数据的是一种非常简单的梯度方法和最复杂的训练方法。总体而言,全球频谱分析提供了最适合人类估计的结果,这与感知数据一起表明,人类视觉系统可能使用类似的机制来快速判断地平线位置,作为场景要点的一部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信