Gender bias in generative artificial intelligence text-to-image depiction of medical students

IF 1.1 4区 医学 Q3 EDUCATION & EDUCATIONAL RESEARCH
Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis
{"title":"Gender bias in generative artificial intelligence text-to-image depiction of medical students","authors":"Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis","doi":"10.1177/00178969241274621","DOIUrl":null,"url":null,"abstract":"Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.","PeriodicalId":47346,"journal":{"name":"Health Education Journal","volume":"1 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Education Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/00178969241274621","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.
生成式人工智能文本到图像描述医学生中的性别偏见
导言:在澳大利亚,54.3% 的医科学生是女性,但她们在医学刻板印象中的代表性仍然不足。生成式人工智能(genAI)虽然具有潜在的变革性,但也有可能出现错误、歪曲和偏见。GenAI 文本到图像的生成可能会强化性别偏见,因此评估 DALL-E 3(通过 ChatGPT 支持的文本到图像 GenAI)对澳大利亚医科学生的表现非常重要。方法:2024 年 3 月,DALL-E 3 通过 GPT-4 生成了一系列医科学生的个人和群体图像,特别是澳大利亚本科医科学生,以消除潜在的混杂因素。使用各种提示多次重复生成图像。总共生成了 47 幅用于评估的图像,其中 33 幅为单个人物,其余 14 幅由多个(5 至 67 个)人物组成。所有图像均由三名审查员独立分析,以确定明显的性别和肤色。因此,有 33 个特征个体接受了评估,另有 417 个人物群体接受了评估(N = 448)。结果:总体而言(个体和群体图像),58.8%(N = 258)的医学生为男性,39.9%(N = 175)为女性,92.0%(N = 404)为浅肤色,7.7%(N = 34)为中肤色,0%为深肤色。在统计上,单个图像、群体图像和集体图像的性别分布与实际澳大利亚医学生的性别分布有显著差异。在医学生个人图像(25 张)中,DALL-E 3 生成的图像中有 92% (23 张)为男性,100% 为浅肤色(25 张)。生成的图像中白人男性医学生的比例过高,这并不代表澳大利亚医学生的多样性。在使用 DALL-E 3 生成用于教育或宣传目的的医学生图像时应谨慎。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.30
自引率
0.00%
发文量
65
期刊介绍: Health Education Journal is a leading peer reviewed journal established in 1943. It carries original papers on health promotion and education research, policy development and good practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信