An intersectional framework for counterfactual fairness in risk prediction.

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics Pub Date : 2024-07-01 DOI:10.1093/biostatistics/kxad021

Solvejg Wastvedt, Jared D Huling, Julian Wolfson

{"title":"An intersectional framework for counterfactual fairness in risk prediction.","authors":"Solvejg Wastvedt, Jared D Huling, Julian Wolfson","doi":"10.1093/biostatistics/kxad021","DOIUrl":null,"url":null,"abstract":"<p><p>Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing \"algorithmic fairness\" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value (\"u-value\"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"702-717"},"PeriodicalIF":2.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biostatistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biostatistics/kxad021","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing "algorithmic fairness" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value ("u-value"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.

查看原文本刊更多论文

风险预测中反事实公平性的交叉框架。

随着健康数据的日益普及，为决策和政策提供信息的数据驱动模型也随之兴起。这些模型有可能使患者和医疗服务提供者受益，但也可能加剧健康不平等。现有的衡量和纠正模型偏差的 "算法公平性 "方法在两个关键方面无法满足卫生政策的需要。首先，这些方法通常只关注可能出现歧视的单一分组，而不是考虑多个交叉分组。其次，在临床应用中，风险预测通常用于指导治疗，这就产生了明显的统计问题，使大多数现有技术失效。我们提出了新的不公平度量方法来应对这两个挑战。我们还为我们的指标开发了一个完整的估算和推理工具框架，包括不公平值（"u 值"）（用于确定不公平的相对极值），以及标准误差和置信区间（采用标准自举法的替代方法）。我们展示了我们的框架在中西部一家大型医疗系统部署的 COVID-19 风险预测模型中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biostatistics 生物-数学与计算生物学

CiteScore

5.10

自引率

4.80%

发文量

审稿时长

6-12 weeks

期刊介绍： Among the important scientific developments of the 20th century is the explosive growth in statistical reasoning and methods for application to studies of human health. Examples include developments in likelihood methods for inference, epidemiologic statistics, clinical trials, survival analysis, and statistical genetics. Substantive problems in public health and biomedical research have fueled the development of statistical methods, which in turn have improved our ability to draw valid inferences from data. The objective of Biostatistics is to advance statistical science and its application to problems of human health and disease, with the ultimate goal of advancing the public''s health.