Potential to perpetuate social biases in health care by Chinese large language models: a model evaluation study.

IF 4.1 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Chenxi Liu, Jianing Zheng, Yushu Liu, Xi Wang, Yuting Zhang, Qiang Fu, Wenwen Yu, Ting Yu, Wang Jiang, Dan Wang, Chaojie Liu
{"title":"Potential to perpetuate social biases in health care by Chinese large language models: a model evaluation study.","authors":"Chenxi Liu, Jianing Zheng, Yushu Liu, Xi Wang, Yuting Zhang, Qiang Fu, Wenwen Yu, Ting Yu, Wang Jiang, Dan Wang, Chaojie Liu","doi":"10.1186/s12939-025-02581-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) may perpetuate or amplify social biases toward patients. We systematically assessed potential biases of three popular Chinese LLMs in clinical application scenarios.</p><p><strong>Methods: </strong>We tested whether Qwen, Erine, and Baichuan encode social biases for patients of different sex, ethnicity, educational attainment, income level, and health insurance status. First, we prompted LLMs to generate clinical cases for medical education (n = 8,289) and compared the distribution of patient characteristics in LLM-generated cases with national distributions in China. Second, New England Journal of Medicine Healer clinical vignettes were used to prompt LLMs to generate differential diagnoses and treatment plans (n = 45,600), with variations analyzed based on sociodemographic characteristics. Third, we prompted LLMs to assess patient needs (n = 51,039) based on clinical cases, revealing any implicit biases toward patients with different characteristics.</p><p><strong>Results: </strong>The three LLMs showed social biases toward patients with different characteristics to varying degrees in medical education, diagnostic and treatment recommendation, and patient needs assessment. These biases were more frequent in relation to sex, ethnicity, income level, and health insurance status, compared to educational attainment. Overall, the three LLMs failed to appropriately model the sociodemographic diversity of medical conditions, consistently over-representing male, high-education and high-income populations. They also showed a higher referral rate, indicating potential refusal to treat patients, for minority ethnic groups and those without insurance or living with low incomes. The three LLMs were more likely to recommend pain medications for males, and considered patients with higher educational attainment, Han ethnicity, higher income, and those with health insurance as having healthier relationships with others.</p><p><strong>Interpretation: </strong>Our findings broaden the scopes of potential biases inherited in LLMs and highlight the urgent need for systematic and continuous assessments of social biases in LLMs in real-world clinical applications.</p>","PeriodicalId":13745,"journal":{"name":"International Journal for Equity in Health","volume":"24 1","pages":"206"},"PeriodicalIF":4.1000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12265265/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal for Equity in Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12939-025-02581-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Large language models (LLMs) may perpetuate or amplify social biases toward patients. We systematically assessed potential biases of three popular Chinese LLMs in clinical application scenarios.

Methods: We tested whether Qwen, Erine, and Baichuan encode social biases for patients of different sex, ethnicity, educational attainment, income level, and health insurance status. First, we prompted LLMs to generate clinical cases for medical education (n = 8,289) and compared the distribution of patient characteristics in LLM-generated cases with national distributions in China. Second, New England Journal of Medicine Healer clinical vignettes were used to prompt LLMs to generate differential diagnoses and treatment plans (n = 45,600), with variations analyzed based on sociodemographic characteristics. Third, we prompted LLMs to assess patient needs (n = 51,039) based on clinical cases, revealing any implicit biases toward patients with different characteristics.

Results: The three LLMs showed social biases toward patients with different characteristics to varying degrees in medical education, diagnostic and treatment recommendation, and patient needs assessment. These biases were more frequent in relation to sex, ethnicity, income level, and health insurance status, compared to educational attainment. Overall, the three LLMs failed to appropriately model the sociodemographic diversity of medical conditions, consistently over-representing male, high-education and high-income populations. They also showed a higher referral rate, indicating potential refusal to treat patients, for minority ethnic groups and those without insurance or living with low incomes. The three LLMs were more likely to recommend pain medications for males, and considered patients with higher educational attainment, Han ethnicity, higher income, and those with health insurance as having healthier relationships with others.

Interpretation: Our findings broaden the scopes of potential biases inherited in LLMs and highlight the urgent need for systematic and continuous assessments of social biases in LLMs in real-world clinical applications.

中文大语言模型对医疗保健社会偏见的潜在影响:一项模型评价研究。
背景:大型语言模型(llm)可能会延续或放大对患者的社会偏见。我们系统地评估了三个受欢迎的中国法学硕士在临床应用场景中的潜在偏倚。方法:对不同性别、种族、教育程度、收入水平和医疗保险状况的患者,Qwen、Erine和Baichuan是否编码了社会偏见。首先,我们促使法学硕士生成用于医学教育的临床病例(n = 8289),并将法学硕士生成病例的患者特征分布与中国全国分布进行比较。其次,新英格兰医学杂志临床小片段被用来促使法学硕士产生鉴别诊断和治疗计划(n = 45,600),并根据社会人口统计学特征分析差异。第三,我们提示法学硕士根据临床病例评估患者需求(n = 51,039),揭示对不同特征患者的隐性偏见。结果:3位法学硕士在医学教育、诊疗推荐、患者需求评估等方面对不同特征的患者存在不同程度的社会偏见。与受教育程度相比,这些偏见在性别、种族、收入水平和健康保险状况方面更为常见。总体而言,三位法学硕士未能适当地模拟医疗条件的社会人口多样性,始终过多地代表男性、高学历和高收入人群。他们还显示出更高的转诊率,这表明少数民族、没有保险或收入较低的患者可能会被拒绝治疗。三位法学硕士更倾向于为男性推荐止痛药,并认为受教育程度较高、汉族、收入较高和有医疗保险的患者与他人的关系更健康。解释:我们的研究结果扩大了法学硕士遗传的潜在偏见的范围,并强调了迫切需要在现实世界的临床应用中对法学硕士的社会偏见进行系统和持续的评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.80
自引率
4.20%
发文量
162
审稿时长
28 weeks
期刊介绍: International Journal for Equity in Health is an Open Access, peer-reviewed, online journal presenting evidence relevant to the search for, and attainment of, equity in health across and within countries. International Journal for Equity in Health aims to improve the understanding of issues that influence the health of populations. This includes the discussion of political, policy-related, economic, social and health services-related influences, particularly with regard to systematic differences in distributions of one or more aspects of health in population groups defined demographically, geographically, or socially.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信