A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.

IF 3.2 2区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia
{"title":"A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.","authors":"Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia","doi":"10.1111/1475-6773.70039","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.</p><p><strong>Study setting and design: </strong>Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).</p><p><strong>Data sources and analytic sample: </strong>We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.</p><p><strong>Principal findings: </strong>In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).</p><p><strong>Conclusions: </strong>The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.</p>","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":" ","pages":"e70039"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/1475-6773.70039","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.

Study setting and design: Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).

Data sources and analytic sample: We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.

Principal findings: In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).

Conclusions: The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.

用于估计癌症诊断对接受预先护理计划的影响的因果机器学习框架。
目的:开发一个因果机器学习(因果ML)框架,用于估计诊断(本研究中的癌症)如何影响接受特定医疗服务(本研究中的提前护理计划)的可能性以及相关的异质性。研究设置和设计:我们提出的框架利用因果森林方法,结合人口加权重采样和平均估计策略,来估计平均治疗效果(ATEs)和条件平均治疗效果(CATEs)。事后,我们使用最佳线性预测来识别与CATEs变化相关的协变量。我们通过将其应用于分层随机患者样本来说明该框架,其中分层是通过癌症诊断(确诊与未确诊)和ACP接收(记录与未记录)的交叉稳定来定义的。数据来源和分析样本:我们提取了2019年10月至2024年10月的未识别患者数据(n = 87,772),解释变量分为三类:人口统计学、发病率和卫生保健系统利用率。主要发现:在因果ML框架的应用中,我们发现在该医疗保健系统中被诊断为癌症的患者比未被诊断为癌症的类似患者发生ACP的可能性至少高17.2%。我们还发现了显著的异质性。例如,每增加一个标准偏差的亲自门诊就诊与CATE估计的平均增加有关(6.1个百分点),而住院次数、住院天数和手术时间(以分钟为单位)每增加一个标准偏差与CATE估计的平均减少有关(分别减少-1.3、-5.6和-0.5个百分点)。结论:提出的因果ML框架能够估计诊断对接受相关卫生保健服务的影响。在癌症诊断环境中,它可以识别不太可能接受ACP的患者群体,从而为服务分配策略提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Health Services Research
Health Services Research 医学-卫生保健
CiteScore
4.80
自引率
5.90%
发文量
193
审稿时长
4-8 weeks
期刊介绍: Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信