用于估计癌症诊断对接受预先护理计划的影响的因果机器学习框架。

IF 3.2 2区医学 Q2 HEALTH CARE SCIENCES & SERVICES

Health Services Research Pub Date : 2025-09-06 DOI:10.1111/1475-6773.70039

Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia

{"title":"用于估计癌症诊断对接受预先护理计划的影响的因果机器学习框架。","authors":"Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia","doi":"10.1111/1475-6773.70039","DOIUrl":null,"url":null,"abstract":"Objective: Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.Study setting and design: Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).Data sources and analytic sample: We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.Principal findings: In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).Conclusions: The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":" ","pages":"e70039"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.\",\"authors\":\"Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia\",\"doi\":\"10.1111/1475-6773.70039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.Study setting and design: Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).Data sources and analytic sample: We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.Principal findings: In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).Conclusions: The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.\",\"PeriodicalId\":55065,\"journal\":{\"name\":\"Health Services Research\",\"volume\":\" \",\"pages\":\"e70039\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/1475-6773.70039\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/1475-6773.70039","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

目的：开发一个因果机器学习（因果ML）框架，用于估计诊断（本研究中的癌症）如何影响接受特定医疗服务（本研究中的提前护理计划）的可能性以及相关的异质性。研究设置和设计：我们提出的框架利用因果森林方法，结合人口加权重采样和平均估计策略，来估计平均治疗效果（ATEs）和条件平均治疗效果（CATEs）。事后，我们使用最佳线性预测来识别与CATEs变化相关的协变量。我们通过将其应用于分层随机患者样本来说明该框架，其中分层是通过癌症诊断（确诊与未确诊）和ACP接收（记录与未记录）的交叉稳定来定义的。数据来源和分析样本：我们提取了2019年10月至2024年10月的未识别患者数据（n = 87,772），解释变量分为三类：人口统计学、发病率和卫生保健系统利用率。主要发现：在因果ML框架的应用中，我们发现在该医疗保健系统中被诊断为癌症的患者比未被诊断为癌症的类似患者发生ACP的可能性至少高17.2%。我们还发现了显著的异质性。例如，每增加一个标准偏差的亲自门诊就诊与CATE估计的平均增加有关（6.1个百分点），而住院次数、住院天数和手术时间（以分钟为单位）每增加一个标准偏差与CATE估计的平均减少有关（分别减少-1.3、-5.6和-0.5个百分点）。结论：提出的因果ML框架能够估计诊断对接受相关卫生保健服务的影响。在癌症诊断环境中，它可以识别不太可能接受ACP的患者群体，从而为服务分配策略提供信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.

Objective: Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.

Study setting and design: Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).

Data sources and analytic sample: We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.

Principal findings: In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).

Conclusions: The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Health Services Research 医学-卫生保健

CiteScore

4.80

自引率

5.90%

发文量

193

审稿时长

4-8 weeks

期刊介绍： Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.