Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia
{"title":"用于估计癌症诊断对接受预先护理计划的影响的因果机器学习框架。","authors":"Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia","doi":"10.1111/1475-6773.70039","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.</p><p><strong>Study setting and design: </strong>Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).</p><p><strong>Data sources and analytic sample: </strong>We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.</p><p><strong>Principal findings: </strong>In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).</p><p><strong>Conclusions: </strong>The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.</p>","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":" ","pages":"e70039"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.\",\"authors\":\"Aaron Baird, Yichen Cheng, Jason Lesandrini, Yusen Xia\",\"doi\":\"10.1111/1475-6773.70039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.</p><p><strong>Study setting and design: </strong>Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).</p><p><strong>Data sources and analytic sample: </strong>We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.</p><p><strong>Principal findings: </strong>In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).</p><p><strong>Conclusions: </strong>The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.</p>\",\"PeriodicalId\":55065,\"journal\":{\"name\":\"Health Services Research\",\"volume\":\" \",\"pages\":\"e70039\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/1475-6773.70039\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/1475-6773.70039","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
A Causal Machine Learning Framework for Estimating the Impact of Cancer Diagnosis on Receipt of Advance Care Planning.
Objective: Develop a causal machine learning (causal ML) framework for estimating how a diagnosis (cancer in this study) affects the likelihood of receiving a specific health care service (advance care planning in this study) and associated heterogeneity.
Study setting and design: Our proposed framework leverages the causal forest method, combined with a population-weighted resampling and averaging over estimations strategy, to estimate average treatment effects (ATEs) and conditional average treatment effects (CATEs). Post hoc, we used best linear projections to identify covariates associated with variation in the CATEs. We illustrate the framework by applying it to a stratified random sample of patients, where the strata are defined by the crosstabulation of cancer diagnosis (diagnosed vs. not diagnosed) and ACP receipt (documented vs. not documented).
Data sources and analytic sample: We extracted deidentified patient data from October 2019 to October 2024 (n = 87,772) with explanatory variables in three categories: demographics, morbidity, and health care system utilization.
Principal findings: In application of the causal ML framework, we found that patients diagnosed with cancer at this health care system to be at least 17.2% more likely to have documented ACP than similar patients not diagnosed with cancer. We also found significant heterogeneity. For instance, a one standard deviation increase in in-person outpatient visits was associated with an on-average increase in the CATE estimate (by 6.1 percentage points), while a one standard deviation increase in hospital admissions, inpatient days, and surgical duration in minutes was associated with an on-average decrease in the CATE estimate (by -1.3, -5.6, and -0.5 percentage points, respectively).
Conclusions: The proposed causal ML framework enables estimation of the effect of a diagnosis on receiving a relevant health care service. In the cancer diagnosis context, it can identify patient groups less likely to receive ACP, thus informing service allocation strategies.
期刊介绍:
Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.