Deep convolutional neural networks are sensitive to face configuration.

IF 2 4区心理学 Q2 OPHTHALMOLOGY

Journal of Vision Pub Date : 2024-11-04 DOI:10.1167/jov.24.12.6

Virginia E Strehle, Natalie K Bendiksen, Alice J O'Toole

{"title":"Deep convolutional neural networks are sensitive to face configuration.","authors":"Virginia E Strehle, Natalie K Bendiksen, Alice J O'Toole","doi":"10.1167/jov.24.12.6","DOIUrl":null,"url":null,"abstract":"<p><p>Deep convolutional neural networks (DCNNs) are remarkably accurate models of human face recognition. However, less is known about whether these models generate face representations similar to those used by humans. Sensitivity to facial configuration has long been considered a marker of human perceptual expertise for faces. We tested whether DCNNs trained for face identification \"perceive\" alterations to facial features and their configuration. We also compared the extent to which representations changed as a function of the alteration type. Facial configuration was altered by changing the distance between the eyes or the distance between the nose and mouth. Facial features were altered by replacing the eyes or mouth with those of another face. Altered faces were processed by DCNNs (Ranjan et al., 2018; Szegedy et al., 2017) and the similarity of the generated representations was compared. Both DCNNs were sensitive to configural and feature changes-with changes to configuration altering the DCNN representations more than changes to face features. To determine whether the DCNNs' greater sensitivity to configuration was due to a priori differences in the images or characteristics of the DCNN processing, we compared the representation of features and configuration between the low-level, pixel-based representations and the DCNN-generated representations. Sensitivity to face configuration increased from the pixel-level image to the DCNN encoding, whereas the sensitivity to features did not change. The enhancement of configural information may be due to the utility of configuration for discriminating among similar faces combined with the within-category nature of face identification training.</p>","PeriodicalId":49955,"journal":{"name":"Journal of Vision","volume":"24 12","pages":"6"},"PeriodicalIF":2.0000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542502/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Vision","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/jov.24.12.6","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Deep convolutional neural networks (DCNNs) are remarkably accurate models of human face recognition. However, less is known about whether these models generate face representations similar to those used by humans. Sensitivity to facial configuration has long been considered a marker of human perceptual expertise for faces. We tested whether DCNNs trained for face identification "perceive" alterations to facial features and their configuration. We also compared the extent to which representations changed as a function of the alteration type. Facial configuration was altered by changing the distance between the eyes or the distance between the nose and mouth. Facial features were altered by replacing the eyes or mouth with those of another face. Altered faces were processed by DCNNs (Ranjan et al., 2018; Szegedy et al., 2017) and the similarity of the generated representations was compared. Both DCNNs were sensitive to configural and feature changes-with changes to configuration altering the DCNN representations more than changes to face features. To determine whether the DCNNs' greater sensitivity to configuration was due to a priori differences in the images or characteristics of the DCNN processing, we compared the representation of features and configuration between the low-level, pixel-based representations and the DCNN-generated representations. Sensitivity to face configuration increased from the pixel-level image to the DCNN encoding, whereas the sensitivity to features did not change. The enhancement of configural information may be due to the utility of configuration for discriminating among similar faces combined with the within-category nature of face identification training.

查看原文本刊更多论文

深度卷积神经网络对人脸结构非常敏感。

深度卷积神经网络（DCNN）是非常精确的人类人脸识别模型。然而，人们对这些模型是否能生成类似于人类使用的人脸表征却知之甚少。长期以来，对人脸构型的敏感性一直被认为是人类对人脸的感知能力的标志。我们测试了接受过人脸识别训练的 DCNN 是否能 "感知 "面部特征及其配置的变化。我们还比较了表征随改变类型而改变的程度。通过改变眼睛之间的距离或鼻子和嘴巴之间的距离来改变面部构造。改变面部特征的方法是将眼睛或嘴巴换成另一张脸的眼睛或嘴巴。改变后的人脸由 DCNNs（Ranjan 等人，2018 年；Szegedy 等人，2017 年）处理，并比较生成的表征的相似性。两种 DCNN 对构型和特征的变化都很敏感--构型的变化对 DCNN 表征的改变比人脸特征的变化更大。为了确定 DCNN 对构型更敏感是否是由于图像的先验差异或 DCNN 处理的特点，我们比较了基于像素的低级表征和 DCNN 生成的表征之间的特征和构型表征。从像素级图像到 DCNN 编码，对人脸构型的敏感度增加了，而对特征的敏感度没有变化。构型信息的增强可能是由于构型在区分相似人脸方面的作用，以及人脸识别训练的类别内性质。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Vision 医学-眼科学

CiteScore

2.90

自引率

5.60%

发文量

218

审稿时长

3-6 weeks

期刊介绍： Exploring all aspects of biological visual function, including spatial vision, perception, low vision, color vision and more, spanning the fields of neuroscience, psychology and psychophysics.