通过现实的（中期）旅程：评估人工智能生成图像的人类检测的准确性，冒名顶替者偏见和自动化偏见

IF 3 Q1 PSYCHOLOGY, MULTIDISCIPLINARY

Human Behavior and Emerging Technologies Pub Date : 2025-09-11 DOI:10.1155/hbe2/9977058

Mirko Casu, Luca Guarnera, Ignazio Zangara, Pasquale Caponnetto, Sebastiano Battiato

{"title":"通过现实的（中期）旅程：评估人工智能生成图像的人类检测的准确性，冒名顶替者偏见和自动化偏见","authors":"Mirko Casu, Luca Guarnera, Ignazio Zangara, Pasquale Caponnetto, Sebastiano Battiato","doi":"10.1155/hbe2/9977058","DOIUrl":null,"url":null,"abstract":"While the challenge of distinguishing AI-generated from real images is widely acknowledged, the specific cognitive biases that systematically shape human judgment in this domain remain poorly understood. It is particularly unclear how a general awareness of AI capabilities fosters novel biases, like a pervasive skepticism (“impostor bias”), and how this interacts with established phenomena like “automation bias”. This study addresses this gap by providing the first quantitative analysis of how these two biases operate across five distinct experimental variants designed to test the context-dependency of human perception. Through a mixed-methods study with 746 participants, we demonstrate that human authentication accuracy hovered around chance levels (ranging from 47.0% to 55.5%). However, our analysis provides robust evidence for the systematic operation of cognitive biases. We validate the presence of “impostor bias” through a consistent pattern of higher doubt for AI-generated images and confirm “automation bias” through significant opinion changes following algorithmic suggestions. Our findings reveal that these biases are not uniform across populations: gender was a consistent predictor of automation bias, with males in all five variants showing a significantly stronger and more consistent tendency (Cohen’s d = 0.254–0.683) to be influenced by algorithmic suggestions. In contrast, age and academic background had minimal and highly localized effects. Furthermore, we identified a significant interaction between experimental stimuli and performance over time, isolating a pronounced fatigue effect to a single questionnaire variant where accuracy progressively declined (by approximately 1.7% per trial). By integrating human feedback with Grad-CAM visualizations, we confirm a divergence between human holistic evaluation and the localized focus of machine learning models. These findings carry direct implications for policy, as discussed within the context of the European AI Act, and inform the design of human–AI systems and media literacy programs aimed at mitigating these critical cognitive vulnerabilities.","PeriodicalId":36408,"journal":{"name":"Human Behavior and Emerging Technologies","volume":"2025 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/hbe2/9977058","citationCount":"0","resultStr":"{\"title\":\"A (Mid)journey Through Reality: Assessing Accuracy, Impostor Bias, and Automation Bias in Human Detection of AI-Generated Images\",\"authors\":\"Mirko Casu, Luca Guarnera, Ignazio Zangara, Pasquale Caponnetto, Sebastiano Battiato\",\"doi\":\"10.1155/hbe2/9977058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While the challenge of distinguishing AI-generated from real images is widely acknowledged, the specific cognitive biases that systematically shape human judgment in this domain remain poorly understood. It is particularly unclear how a general awareness of AI capabilities fosters novel biases, like a pervasive skepticism (“impostor bias”), and how this interacts with established phenomena like “automation bias”. This study addresses this gap by providing the first quantitative analysis of how these two biases operate across five distinct experimental variants designed to test the context-dependency of human perception. Through a mixed-methods study with 746 participants, we demonstrate that human authentication accuracy hovered around chance levels (ranging from 47.0% to 55.5%). However, our analysis provides robust evidence for the systematic operation of cognitive biases. We validate the presence of “impostor bias” through a consistent pattern of higher doubt for AI-generated images and confirm “automation bias” through significant opinion changes following algorithmic suggestions. Our findings reveal that these biases are not uniform across populations: gender was a consistent predictor of automation bias, with males in all five variants showing a significantly stronger and more consistent tendency (Cohen’s d = 0.254–0.683) to be influenced by algorithmic suggestions. In contrast, age and academic background had minimal and highly localized effects. Furthermore, we identified a significant interaction between experimental stimuli and performance over time, isolating a pronounced fatigue effect to a single questionnaire variant where accuracy progressively declined (by approximately 1.7% per trial). By integrating human feedback with Grad-CAM visualizations, we confirm a divergence between human holistic evaluation and the localized focus of machine learning models. These findings carry direct implications for policy, as discussed within the context of the European AI Act, and inform the design of human–AI systems and media literacy programs aimed at mitigating these critical cognitive vulnerabilities.\",\"PeriodicalId\":36408,\"journal\":{\"name\":\"Human Behavior and Emerging Technologies\",\"volume\":\"2025 1\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1155/hbe2/9977058\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Behavior and Emerging Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1155/hbe2/9977058\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Behavior and Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/hbe2/9977058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

虽然将人工智能生成的图像与真实图像区分开来的挑战得到了广泛认可，但在这一领域系统地塑造人类判断的特定认知偏见仍然知之甚少。尤其不清楚的是，对人工智能能力的普遍认知如何催生新的偏见，比如普遍的怀疑（“冒名顶替偏见”），以及这种偏见如何与“自动化偏见”等既定现象相互作用。本研究通过首次定量分析这两种偏差如何在五种不同的实验变体中运作来解决这一差距，这些实验变体旨在测试人类感知的情境依赖性。通过对746名参与者的混合方法研究，我们证明了人类身份验证的准确性徘徊在机会水平附近（范围从47.0%到55.5%）。然而，我们的分析为认知偏差的系统运作提供了有力的证据。我们通过对人工智能生成的图像的一致的更高怀疑模式来验证“冒名顶替者偏见”的存在，并通过算法建议后的重大意见变化来确认“自动化偏见”。我们的研究结果表明，这些偏差在人群中并不统一：性别是自动化偏差的一致预测因素，所有五种变体中的男性都表现出明显更强、更一致的倾向（Cohen的d = 0.254-0.683），受到算法建议的影响。相反，年龄和学术背景的影响很小，而且高度局限。此外，我们确定了实验刺激与表现之间随时间的显著相互作用，隔离了单一问卷变体的明显疲劳效应，其准确性逐渐下降（每次试验约下降1.7%）。通过将人类反馈与Grad-CAM可视化相结合，我们确认了人类整体评估与机器学习模型的局部焦点之间的分歧。正如在《欧洲人工智能法案》的背景下所讨论的那样，这些发现对政策有直接影响，并为旨在减轻这些关键认知漏洞的人类-人工智能系统和媒体扫盲计划的设计提供信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A (Mid)journey Through Reality: Assessing Accuracy, Impostor Bias, and Automation Bias in Human Detection of AI-Generated Images

查看原文本刊更多论文

A (Mid)journey Through Reality: Assessing Accuracy, Impostor Bias, and Automation Bias in Human Detection of AI-Generated Images

While the challenge of distinguishing AI-generated from real images is widely acknowledged, the specific cognitive biases that systematically shape human judgment in this domain remain poorly understood. It is particularly unclear how a general awareness of AI capabilities fosters novel biases, like a pervasive skepticism (“impostor bias”), and how this interacts with established phenomena like “automation bias”. This study addresses this gap by providing the first quantitative analysis of how these two biases operate across five distinct experimental variants designed to test the context-dependency of human perception. Through a mixed-methods study with 746 participants, we demonstrate that human authentication accuracy hovered around chance levels (ranging from 47.0% to 55.5%). However, our analysis provides robust evidence for the systematic operation of cognitive biases. We validate the presence of “impostor bias” through a consistent pattern of higher doubt for AI-generated images and confirm “automation bias” through significant opinion changes following algorithmic suggestions. Our findings reveal that these biases are not uniform across populations: gender was a consistent predictor of automation bias, with males in all five variants showing a significantly stronger and more consistent tendency (Cohen’s d = 0.254–0.683) to be influenced by algorithmic suggestions. In contrast, age and academic background had minimal and highly localized effects. Furthermore, we identified a significant interaction between experimental stimuli and performance over time, isolating a pronounced fatigue effect to a single questionnaire variant where accuracy progressively declined (by approximately 1.7% per trial). By integrating human feedback with Grad-CAM visualizations, we confirm a divergence between human holistic evaluation and the localized focus of machine learning models. These findings carry direct implications for policy, as discussed within the context of the European AI Act, and inform the design of human–AI systems and media literacy programs aimed at mitigating these critical cognitive vulnerabilities.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Human Behavior and Emerging Technologies Social Sciences-Social Sciences (all)

CiteScore

17.20

自引率

8.70%

发文量

期刊介绍： Human Behavior and Emerging Technologies is an interdisciplinary journal dedicated to publishing high-impact research that enhances understanding of the complex interactions between diverse human behavior and emerging digital technologies.