Learning unbiased risk prediction based algorithms in healthcare: A case study with primary care patients

Q1 Medicine

Informatics in Medicine Unlocked Pub Date : 2025-01-01 DOI:10.1016/j.imu.2025.101627

Vibhuti Gupta , Julian Broughton , Ange Rukundo , Lubna J. Pinky

{"title":"Learning unbiased risk prediction based algorithms in healthcare: A case study with primary care patients","authors":"Vibhuti Gupta , Julian Broughton , Ange Rukundo , Lubna J. Pinky","doi":"10.1016/j.imu.2025.101627","DOIUrl":null,"url":null,"abstract":"<div><div>The proliferation of Artificial Intelligence (AI) has revolutionized the healthcare domain with technological advancements in conventional diagnosis and treatment methods. These advancements lead to faster disease detection, and management and provide personalized healthcare solutions. However, most of the clinical AI methods developed and deployed in hospitals have algorithmic and data-driven biases due to insufficient representation of specific race, gender, and age group which leads to misdiagnosis, disparities, and unfair outcomes. Thus, it is crucial to thoroughly examine these biases and develop computational methods that can mitigate biases effectively. This paper critically analyzes this problem by exploring different types of data and algorithmic biases during both pre-processing and post-processing phases to uncover additional, previously unexplored biases in a widely used real-world healthcare dataset of primary care patients. Additionally, effective strategies are proposed to address gender, race, and age biases, ensuring that risk prediction outcomes are equitable and impartial. Through experiments with various machine learning algorithms leveraging the Fairlearn tool, we have identified biases in the dataset, compared the impact of these biases on the prediction performance, and proposed effective strategies to mitigate these biases. Our results demonstrate clear evidence of racial, gender-based, and age-related biases in the healthcare dataset used to guide resource allocation for patients and have profound impact on the prediction performance which leads to unfair outcomes. Thus, it is crucial to implement mechanisms to detect and address unintended biases to ensure a safe, reliable, and trustworthy AI system in healthcare.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"54 ","pages":"Article 101627"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

The proliferation of Artificial Intelligence (AI) has revolutionized the healthcare domain with technological advancements in conventional diagnosis and treatment methods. These advancements lead to faster disease detection, and management and provide personalized healthcare solutions. However, most of the clinical AI methods developed and deployed in hospitals have algorithmic and data-driven biases due to insufficient representation of specific race, gender, and age group which leads to misdiagnosis, disparities, and unfair outcomes. Thus, it is crucial to thoroughly examine these biases and develop computational methods that can mitigate biases effectively. This paper critically analyzes this problem by exploring different types of data and algorithmic biases during both pre-processing and post-processing phases to uncover additional, previously unexplored biases in a widely used real-world healthcare dataset of primary care patients. Additionally, effective strategies are proposed to address gender, race, and age biases, ensuring that risk prediction outcomes are equitable and impartial. Through experiments with various machine learning algorithms leveraging the Fairlearn tool, we have identified biases in the dataset, compared the impact of these biases on the prediction performance, and proposed effective strategies to mitigate these biases. Our results demonstrate clear evidence of racial, gender-based, and age-related biases in the healthcare dataset used to guide resource allocation for patients and have profound impact on the prediction performance which leads to unfair outcomes. Thus, it is crucial to implement mechanisms to detect and address unintended biases to ensure a safe, reliable, and trustworthy AI system in healthcare.

Abstract Image

查看原文本刊更多论文

医疗保健中基于无偏风险预测算法的学习：初级保健患者的案例研究

随着传统诊断和治疗方法的技术进步，人工智能（AI）的扩散已经彻底改变了医疗保健领域。这些进步导致更快的疾病检测和管理，并提供个性化的医疗保健解决方案。然而，由于特定种族、性别和年龄组的代表性不足，医院开发和部署的大多数临床人工智能方法都存在算法和数据驱动的偏见，从而导致误诊、差异和不公平的结果。因此，彻底检查这些偏差并开发能够有效减轻偏差的计算方法至关重要。本文通过在预处理和后处理阶段探索不同类型的数据和算法偏差来批判性地分析这个问题，以发现广泛使用的初级保健患者的现实世界医疗数据集中其他的，以前未探索的偏差。此外，还提出了解决性别、种族和年龄偏见的有效策略，确保风险预测结果是公平和公正的。通过利用Fairlearn工具对各种机器学习算法进行实验，我们确定了数据集中的偏差，比较了这些偏差对预测性能的影响，并提出了有效的策略来减轻这些偏差。我们的研究结果清楚地表明，用于指导患者资源分配的医疗数据集中存在种族、性别和年龄相关的偏见，并对预测性能产生深远影响，从而导致不公平的结果。因此，实施检测和解决意外偏见的机制至关重要，以确保医疗保健领域的人工智能系统安全、可靠和值得信赖。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Informatics in Medicine Unlocked Medicine-Health Informatics

CiteScore

9.50

自引率

0.00%

发文量

282

审稿时长

39 days

期刊介绍： Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.