What is the doggest dog? Examination of typicality perception in ImageNet-trained networks

IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Katarzyna Filus, Joanna Domańska
{"title":"What is the doggest dog? Examination of typicality perception in ImageNet-trained networks","authors":"Katarzyna Filus,&nbsp;Joanna Domańska","doi":"10.1016/j.neunet.2025.107425","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the emergence of numerous model architectures in recent years, researchers finally have access to models that are diverse enough to properly study them from the perspective of cognitive psychology theories, e.g. Prototype Theory. The theory assumes that the degree of membership in a basic-level category is graded. As a result, some concepts are perceived as more central (typical) than others. The most typical category is called a prototype. It can be perceived as the clearest example of a category, reflecting the redundancy structure of the category as a whole. Its inverse is called an anti-prototype. Reasonable perception of prototypes and anti-prototypes is important for accurate projection of the world structure onto the class space and more human-like world perception beyond simple memorization. That is why it is beneficial to study deep models from the perspective of prototype theory. To enable it, we propose 3 methods that return the prototypes and anti-prototypes perceived by deep networks for a specific basic-level category. Additionally, one of our methods allows to visualize the centrality of objects. The results on a wide range of 42 networks trained on ImageNet (Convolutional Networks, Vision Transformers, ConvNeXts and hybrid models) reveal that the networks share the typicality perception to a large extent and that this perception does not lie so far from the human one. We release the dataset with per-network prototypes and anti-prototypes resulting from our work to enable further research on this topic.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"188 ","pages":"Article 107425"},"PeriodicalIF":6.0000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025003041","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the emergence of numerous model architectures in recent years, researchers finally have access to models that are diverse enough to properly study them from the perspective of cognitive psychology theories, e.g. Prototype Theory. The theory assumes that the degree of membership in a basic-level category is graded. As a result, some concepts are perceived as more central (typical) than others. The most typical category is called a prototype. It can be perceived as the clearest example of a category, reflecting the redundancy structure of the category as a whole. Its inverse is called an anti-prototype. Reasonable perception of prototypes and anti-prototypes is important for accurate projection of the world structure onto the class space and more human-like world perception beyond simple memorization. That is why it is beneficial to study deep models from the perspective of prototype theory. To enable it, we propose 3 methods that return the prototypes and anti-prototypes perceived by deep networks for a specific basic-level category. Additionally, one of our methods allows to visualize the centrality of objects. The results on a wide range of 42 networks trained on ImageNet (Convolutional Networks, Vision Transformers, ConvNeXts and hybrid models) reveal that the networks share the typicality perception to a large extent and that this perception does not lie so far from the human one. We release the dataset with per-network prototypes and anti-prototypes resulting from our work to enable further research on this topic.
什么狗是最大的狗?imagenet训练的网络中典型感知的检验
由于近年来出现了大量的模型架构,研究人员终于有了足够多样化的模型,可以从认知心理学理论的角度来适当地研究它们,例如原型理论。该理论假定在基本级别类别中的隶属度是分级的。因此,一些概念被认为比其他概念更核心(典型)。最典型的一类被称为原型。它可以被看作是一个类别最清晰的例子,反映了整个类别的冗余结构。它的逆称为反原型。对于原型和反原型的合理感知,对于将世界结构准确地投射到类空间上,超越简单的记忆,获得更像人的世界感知是非常重要的。这就是为什么从原型理论的角度研究深层模型是有益的。为了实现它,我们提出了3种方法,这些方法返回深度网络对特定基本级别类别感知的原型和反原型。此外,我们的一种方法允许可视化对象的中心。在ImageNet上训练的42个网络(卷积网络,视觉变形,ConvNeXts和混合模型)的结果表明,这些网络在很大程度上共享典型感知,并且这种感知与人类的感知相差不大。我们发布了基于网络原型和反原型的数据集,以进一步研究这一主题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neural Networks
Neural Networks 工程技术-计算机:人工智能
CiteScore
13.90
自引率
7.70%
发文量
425
审稿时长
67 days
期刊介绍: Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信