{"title":"What is the doggest dog? Examination of typicality perception in ImageNet-trained networks","authors":"Katarzyna Filus, Joanna Domańska","doi":"10.1016/j.neunet.2025.107425","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the emergence of numerous model architectures in recent years, researchers finally have access to models that are diverse enough to properly study them from the perspective of cognitive psychology theories, e.g. Prototype Theory. The theory assumes that the degree of membership in a basic-level category is graded. As a result, some concepts are perceived as more central (typical) than others. The most typical category is called a prototype. It can be perceived as the clearest example of a category, reflecting the redundancy structure of the category as a whole. Its inverse is called an anti-prototype. Reasonable perception of prototypes and anti-prototypes is important for accurate projection of the world structure onto the class space and more human-like world perception beyond simple memorization. That is why it is beneficial to study deep models from the perspective of prototype theory. To enable it, we propose 3 methods that return the prototypes and anti-prototypes perceived by deep networks for a specific basic-level category. Additionally, one of our methods allows to visualize the centrality of objects. The results on a wide range of 42 networks trained on ImageNet (Convolutional Networks, Vision Transformers, ConvNeXts and hybrid models) reveal that the networks share the typicality perception to a large extent and that this perception does not lie so far from the human one. We release the dataset with per-network prototypes and anti-prototypes resulting from our work to enable further research on this topic.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"188 ","pages":"Article 107425"},"PeriodicalIF":6.0000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025003041","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Due to the emergence of numerous model architectures in recent years, researchers finally have access to models that are diverse enough to properly study them from the perspective of cognitive psychology theories, e.g. Prototype Theory. The theory assumes that the degree of membership in a basic-level category is graded. As a result, some concepts are perceived as more central (typical) than others. The most typical category is called a prototype. It can be perceived as the clearest example of a category, reflecting the redundancy structure of the category as a whole. Its inverse is called an anti-prototype. Reasonable perception of prototypes and anti-prototypes is important for accurate projection of the world structure onto the class space and more human-like world perception beyond simple memorization. That is why it is beneficial to study deep models from the perspective of prototype theory. To enable it, we propose 3 methods that return the prototypes and anti-prototypes perceived by deep networks for a specific basic-level category. Additionally, one of our methods allows to visualize the centrality of objects. The results on a wide range of 42 networks trained on ImageNet (Convolutional Networks, Vision Transformers, ConvNeXts and hybrid models) reveal that the networks share the typicality perception to a large extent and that this perception does not lie so far from the human one. We release the dataset with per-network prototypes and anti-prototypes resulting from our work to enable further research on this topic.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.