Inferring human vision in a human-like way: Key factors influencing the cognitive processing of level-1 visual perspective-taking

IF 4.9 1区 文学 Q1 COMMUNICATION
Song Zhou, Huaqi Yang, Ming Ye, Ning Ding, Tao Liu
{"title":"Inferring human vision in a human-like way: Key factors influencing the cognitive processing of level-1 visual perspective-taking","authors":"Song Zhou, Huaqi Yang, Ming Ye, Ning Ding, Tao Liu","doi":"10.1177/00936502241302569","DOIUrl":null,"url":null,"abstract":"The advancement of artificial intelligence (AI) has expanded the potential for human-machine communication and collaboration in complex contexts, necessitating AI to exhibit human-like behavior in order to align with its human counterpart. Consequently, understanding human behavioral traits becomes advantageous for developing AI agents that resemble humans. This study investigated how individuals process visual information from others to inform the future design of intelligent vision systems. Through four experiments, participants were tasked with assessing whether a given number corresponds to the number of balls while manipulating the gaze direction of an avatar by averting its eyes or altering its head orientation. The results indicate that participant response times were influenced regardless of the avatar’s gaze direction. Specifically, when the avatar was positioned with its back facing the balls, any disparity in participant performance across different conditions is eliminated. These findings suggest that implicit level-1 visual perspective-taking may not primarily rely on gaze direction but rather on perceiving affordances within the environment. Such insights contribute to a deeper understanding of cognitive mechanisms underlying level-1 visual perspective-taking and can serve as a theoretical foundation for advancing AI vision algorithms in human-machine communication and collaboration.","PeriodicalId":48323,"journal":{"name":"Communication Research","volume":"54 1","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communication Research","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/00936502241302569","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0

Abstract

The advancement of artificial intelligence (AI) has expanded the potential for human-machine communication and collaboration in complex contexts, necessitating AI to exhibit human-like behavior in order to align with its human counterpart. Consequently, understanding human behavioral traits becomes advantageous for developing AI agents that resemble humans. This study investigated how individuals process visual information from others to inform the future design of intelligent vision systems. Through four experiments, participants were tasked with assessing whether a given number corresponds to the number of balls while manipulating the gaze direction of an avatar by averting its eyes or altering its head orientation. The results indicate that participant response times were influenced regardless of the avatar’s gaze direction. Specifically, when the avatar was positioned with its back facing the balls, any disparity in participant performance across different conditions is eliminated. These findings suggest that implicit level-1 visual perspective-taking may not primarily rely on gaze direction but rather on perceiving affordances within the environment. Such insights contribute to a deeper understanding of cognitive mechanisms underlying level-1 visual perspective-taking and can serve as a theoretical foundation for advancing AI vision algorithms in human-machine communication and collaboration.
以类人方式推断人类视觉:影响一级视觉换位思考认知加工的关键因素
人工智能(AI)的进步扩大了复杂环境下人机交流和协作的潜力,这就要求人工智能表现出类似人类的行为,以便与人类同行保持一致。因此,理解人类的行为特征对于开发类似人类的人工智能代理是有利的。本研究调查了个体如何处理来自他人的视觉信息,从而为未来智能视觉系统的设计提供信息。通过四个实验,参与者的任务是评估给定的数字是否与球的数量相对应,同时通过避开虚拟角色的眼睛或改变其头部方向来操纵其凝视方向。结果表明,参与者的反应时间受到虚拟角色注视方向的影响。具体来说,当角色的背部朝向球时,参与者在不同条件下的表现差异就被消除了。这些发现表明,内隐的1级视觉视角可能主要不依赖于凝视方向,而是依赖于感知环境中的启示。这些见解有助于更深入地了解一级视觉换位思考的认知机制,并可以作为在人机通信和协作中推进人工智能视觉算法的理论基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Communication Research
Communication Research COMMUNICATION-
CiteScore
17.10
自引率
0.00%
发文量
20
期刊介绍: Empirical research in communication began in the 20th century, and there are more researchers pursuing answers to communication questions today than at any other time. The editorial goal of Communication Research is to offer a special opportunity for reflection and change in the new millennium. To qualify for publication, research should, first, be explicitly tied to some form of communication; second, be theoretically driven with results that inform theory; third, use the most rigorous empirical methods; and fourth, be directly linked to the most important problems and issues facing humankind. Critieria do not privilege any particular context; indeed, we believe that the key problems facing humankind occur in close relationships, groups, organiations, and cultures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信