Benjamin Ertman, Bridget Xia, Mona Sloane, Tom Hartvigsen, Paul B Perrin
{"title":"Disability portrayals in artificial intelligence text-to-image generation: Influence of context and the medicalization of disability.","authors":"Benjamin Ertman, Bridget Xia, Mona Sloane, Tom Hartvigsen, Paul B Perrin","doi":"10.1037/rep0000634","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose/objective: </strong>Text-to-image (TTI) systems are artificial intelligence (AI) models that incorporate large amounts of data to produce high-resolution images. Although research has documented racial/ethnic and gender bias in TTI, little has examined disability bias. This study compared generated images of disabled people with no prompted setting to images of disabled individuals in health care settings.</p><p><strong>Research method/design: </strong>OpenAI's DALL-E-3 TTI generated 50 images for each of the following prompts: (a) \"person with a disability,\" (b) \"patient with a disability,\" (c) \"doctor with a disability,\" and (d) \"doctor with a disability and a patient without a disability.\" We calculated DALL-E's success in generating prompted images and coded disability type and demographics.</p><p><strong>Results: </strong>When prompted to create a \"person with a disability,\" DALL-E-3 was 100% successful, with a wide diversity of disabilities. When prompted to create a \"patient with a disability,\" DALL-E-3 was similarly 100% successful, although 70% of images portrayed an individual with a stereotypical physical disability. When prompted to create a \"doctor with a disability,\" DALL-E-3 did with 92% accuracy: 94% had a physical disability and 6% a sensory disability; no other disability types were portrayed. When prompted to create a \"doctor with a disability and a patient without a disability,\" in 64% of cases, DALL-E-3 generated images of doctors without disabilities, and 70% portrayed a disabled patient instead.</p><p><strong>Conclusions/implications: </strong>Disability diversity decreases dramatically when AI-generated images place disabled people in a medical environment. As TTI generation grows more ubiquitous, further work by model developers to mitigate representational harms is vital. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":47974,"journal":{"name":"Rehabilitation Psychology","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rehabilitation Psychology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1037/rep0000634","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHOLOGY, CLINICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose/objective: Text-to-image (TTI) systems are artificial intelligence (AI) models that incorporate large amounts of data to produce high-resolution images. Although research has documented racial/ethnic and gender bias in TTI, little has examined disability bias. This study compared generated images of disabled people with no prompted setting to images of disabled individuals in health care settings.
Research method/design: OpenAI's DALL-E-3 TTI generated 50 images for each of the following prompts: (a) "person with a disability," (b) "patient with a disability," (c) "doctor with a disability," and (d) "doctor with a disability and a patient without a disability." We calculated DALL-E's success in generating prompted images and coded disability type and demographics.
Results: When prompted to create a "person with a disability," DALL-E-3 was 100% successful, with a wide diversity of disabilities. When prompted to create a "patient with a disability," DALL-E-3 was similarly 100% successful, although 70% of images portrayed an individual with a stereotypical physical disability. When prompted to create a "doctor with a disability," DALL-E-3 did with 92% accuracy: 94% had a physical disability and 6% a sensory disability; no other disability types were portrayed. When prompted to create a "doctor with a disability and a patient without a disability," in 64% of cases, DALL-E-3 generated images of doctors without disabilities, and 70% portrayed a disabled patient instead.
Conclusions/implications: Disability diversity decreases dramatically when AI-generated images place disabled people in a medical environment. As TTI generation grows more ubiquitous, further work by model developers to mitigate representational harms is vital. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
期刊介绍:
Rehabilitation Psychology is a quarterly peer-reviewed journal that publishes articles in furtherance of the mission of Division 22 (Rehabilitation Psychology) of the American Psychological Association and to advance the science and practice of rehabilitation psychology. Rehabilitation psychologists consider the entire network of biological, psychological, social, environmental, and political factors that affect the functioning of persons with disabilities or chronic illness. Given the breadth of rehabilitation psychology, the journal"s scope is broadly defined.