Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision.

IF 2.2 3区 心理学 Q2 PSYCHOLOGY, EXPERIMENTAL
Kuinan Hou, Marco Zorzi, Alberto Testolin
{"title":"Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision.","authors":"Kuinan Hou, Marco Zorzi, Alberto Testolin","doi":"10.1007/s00426-024-02064-2","DOIUrl":null,"url":null,"abstract":"<p><p>Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such \"ecological\" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.</p>","PeriodicalId":48184,"journal":{"name":"Psychological Research-Psychologische Forschung","volume":"89 1","pages":"31"},"PeriodicalIF":2.2000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological Research-Psychologische Forschung","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1007/s00426-024-02064-2","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such "ecological" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.

利用计算机视觉估计自然场景中数量级和非数量级的视觉星等分布。
人类和许多动物一样具有感知和近似表示视觉场景中物体数量的能力。这种能力在整个童年时期都在提高,这表明学习和发展在塑造我们的数字感方面起着关键作用。基于深度学习的计算研究进一步支持了这一假设,这些研究表明,在学习具有不同数量的图像的统计结构的神经网络中,数字感知可以自发地出现。然而,神经网络模型通常使用合成数据集进行训练,这些数据集可能不能忠实地反映自然环境的统计结构,并且人们对使用更多的生态视觉刺激来研究人类的数字感知也越来越感兴趣。在这项工作中,我们利用计算机视觉算法的最新进展来设计和实现一个原始管道,该管道可用于估计包含数千个描绘日常生活中物体的真实图像的大规模数据集中的数量级和非数值量级的分布。我们证明了在自然视觉场景中,不同数字出现的频率遵循幂律分布。此外,我们表明,在数据集和场景类型(同质与异构对象集)中,数量和连续数量级的相关结构是稳定的。我们认为,考虑这种“生态”的协方差模式对于理解非数字视觉线索对数量判断的影响是重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.10
自引率
8.70%
发文量
137
期刊介绍: Psychological Research/Psychologische Forschung publishes articles that contribute to a basic understanding of human perception, attention, memory, and action. The Journal is devoted to the dissemination of knowledge based on firm experimental ground, but not to particular approaches or schools of thought. Theoretical and historical papers are welcome to the extent that they serve this general purpose; papers of an applied nature are acceptable if they contribute to basic understanding or serve to bridge the often felt gap between basic and applied research in the field covered by the Journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信