Harmonizing the object recognition strategies of deep neural networks with humans

Advances in neural information processing systems Pub Date : 2022-11-08 DOI:10.48550/arXiv.2211.04533

Thomas Fel, Ivan Felipe, Drew A. Linsley, Thomas Serre

{"title":"Harmonizing the object recognition strategies of deep neural networks with humans","authors":"Thomas Fel, Ivan Felipe, Drew A. Linsley, Thomas Serre","doi":"10.48550/arXiv.2211.04533","DOIUrl":null,"url":null,"abstract":"The many successes of deep neural networks (DNNs) over the past decade have largely been driven by computational scale rather than insights from biological intelligence. Here, we explore if these trends have also carried concomitant improvements in explaining the visual strategies humans rely on for object recognition. We do this by comparing two related but distinct properties of visual strategies in humans and DNNs: where they believe important visual features are in images and how they use those features to categorize objects. Across 84 different DNNs trained on ImageNet and three independent datasets measuring the where and the how of human visual strategies for object recognition on those images, we find a systematic trade-off between DNN categorization accuracy and alignment with human visual strategies for object recognition. State-of-the-art DNNs are progressively becoming less aligned with humans as their accuracy improves. We rectify this growing issue with our neural harmonizer: a general-purpose training routine that both aligns DNN and human visual strategies and improves categorization accuracy. Our work represents the first demonstration that the scaling laws [1-3] that are guiding the design of DNNs today have also produced worse models of human vision. We release our code and data at https://serre-lab.github.io/Harmonization to help the field build more human-like DNNs.","PeriodicalId":72099,"journal":{"name":"Advances in neural information processing systems","volume":"35 1","pages":"9432-9446"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in neural information processing systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.04533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

Abstract

The many successes of deep neural networks (DNNs) over the past decade have largely been driven by computational scale rather than insights from biological intelligence. Here, we explore if these trends have also carried concomitant improvements in explaining the visual strategies humans rely on for object recognition. We do this by comparing two related but distinct properties of visual strategies in humans and DNNs: where they believe important visual features are in images and how they use those features to categorize objects. Across 84 different DNNs trained on ImageNet and three independent datasets measuring the where and the how of human visual strategies for object recognition on those images, we find a systematic trade-off between DNN categorization accuracy and alignment with human visual strategies for object recognition. State-of-the-art DNNs are progressively becoming less aligned with humans as their accuracy improves. We rectify this growing issue with our neural harmonizer: a general-purpose training routine that both aligns DNN and human visual strategies and improves categorization accuracy. Our work represents the first demonstration that the scaling laws [1-3] that are guiding the design of DNNs today have also produced worse models of human vision. We release our code and data at https://serre-lab.github.io/Harmonization to help the field build more human-like DNNs.

查看原文本刊更多论文

将深度神经网络的对象识别策略与人类相协调

在过去十年中，深度神经网络(dnn)的许多成功在很大程度上是由计算规模驱动的，而不是来自生物智能的见解。在这里，我们探讨这些趋势是否也在解释人类依赖于物体识别的视觉策略方面带来了伴随的改进。我们通过比较人类和dnn视觉策略的两个相关但不同的属性来做到这一点:他们认为图像中重要的视觉特征在哪里，以及他们如何使用这些特征对物体进行分类。通过在ImageNet上训练的84种不同的DNN和三个独立的数据集(测量人类视觉策略在这些图像上的位置和方式)，我们发现DNN分类准确性和与人类视觉策略的一致性之间存在系统的权衡。随着精确度的提高，最先进的深度神经网络正逐渐变得与人类越来越不一致。我们用我们的神经协调器来纠正这个日益严重的问题:一个通用的训练程序，既能使DNN和人类视觉策略保持一致，又能提高分类的准确性。我们的工作首次证明了今天指导dnn设计的缩放定律[1-3]也产生了更差的人类视觉模型。我们在https://serre-lab.github.io/Harmonization上发布了我们的代码和数据，以帮助该领域构建更多类似人类的dnn。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advances in neural information processing systems

自引率

0.00%

发文量