Are Deep Neural Networks Adequate Behavioral Models of Human Visual Perception?

IF 5 2区医学 Q1 NEUROSCIENCES

Annual Review of Vision Science Pub Date : 2023-09-15 Epub Date: 2023-03-31 DOI:10.1146/annurev-vision-120522-031739

Felix A Wichmann, Robert Geirhos

{"title":"Are Deep Neural Networks Adequate Behavioral Models of Human Visual Perception?","authors":"Felix A Wichmann, Robert Geirhos","doi":"10.1146/annurev-vision-120522-031739","DOIUrl":null,"url":null,"abstract":"<p><p>Deep neural networks (DNNs) are machine learning algorithms that have revolutionized computer vision due to their remarkable successes in tasks like object classification and segmentation. The success of DNNs as computer vision algorithms has led to the suggestion that DNNs may also be good models of human visual perception. In this article, we review evidence regarding current DNNs as adequate behavioral models of human core object recognition. To this end, we argue that it is important to distinguish between statistical tools and computational models and to understand model quality as a multidimensional concept in which clarity about modeling goals is key. Reviewing a large number of psychophysical and computational explorations of core object recognition performance in humans and DNNs, we argue that DNNs are highly valuable scientific tools but that, as of today, DNNs should only be regarded as promising-but not yet adequate-computational models of human core object recognition behavior. On the way, we dispel several myths surrounding DNNs in vision science.</p>","PeriodicalId":48658,"journal":{"name":"Annual Review of Vision Science","volume":"9 ","pages":"501-524"},"PeriodicalIF":5.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Review of Vision Science","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1146/annurev-vision-120522-031739","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/3/31 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 6

Abstract

Deep neural networks (DNNs) are machine learning algorithms that have revolutionized computer vision due to their remarkable successes in tasks like object classification and segmentation. The success of DNNs as computer vision algorithms has led to the suggestion that DNNs may also be good models of human visual perception. In this article, we review evidence regarding current DNNs as adequate behavioral models of human core object recognition. To this end, we argue that it is important to distinguish between statistical tools and computational models and to understand model quality as a multidimensional concept in which clarity about modeling goals is key. Reviewing a large number of psychophysical and computational explorations of core object recognition performance in humans and DNNs, we argue that DNNs are highly valuable scientific tools but that, as of today, DNNs should only be regarded as promising-but not yet adequate-computational models of human core object recognition behavior. On the way, we dispel several myths surrounding DNNs in vision science.

查看原文本刊更多论文

深度神经网络是否适合人类视觉感知的行为模型？

深度神经网络（DNN）是一种机器学习算法，由于其在对象分类和分割等任务中的显著成功，它已经彻底改变了计算机视觉。DNN作为计算机视觉算法的成功表明，DNN也可能是人类视觉感知的良好模型。在这篇文章中，我们回顾了关于当前DNN作为人类核心对象识别的适当行为模型的证据。为此，我们认为，重要的是要区分统计工具和计算模型，并将模型质量理解为一个多维概念，其中建模目标的明确性是关键。回顾了大量关于人类核心对象识别性能和DNN的心理物理学和计算探索，我们认为DNN是非常有价值的科学工具，但到目前为止，DNN只能被视为人类核心对象辨识行为的有前途但还不够充分的计算模型。在这一过程中，我们消除了视觉科学中围绕DNN的几个神话。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annual Review of Vision Science Medicine-Ophthalmology

CiteScore

11.10

自引率

1.70%

发文量

期刊介绍： The Annual Review of Vision Science reviews progress in the visual sciences, a cross-cutting set of disciplines which intersect psychology, neuroscience, computer science, cell biology and genetics, and clinical medicine. The journal covers a broad range of topics and techniques, including optics, retina, central visual processing, visual perception, eye movements, visual development, vision models, computer vision, and the mechanisms of visual disease, dysfunction, and sight restoration. The study of vision is central to progress in many areas of science, and this new journal will explore and expose the connections that link it to biology, behavior, computation, engineering, and medicine.