使用可解释的机器学习方法确定老年霍奇金淋巴瘤幸存者的医疗保健费用驱动因素。

IF 2.3 4区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Zasim Azhar Siddiqui, Yves Paul Mbous, Sabina Nduaguba, Traci LeMasters, Virginia G Scott, Jay S Patel, Usha Sambamoorthi
{"title":"使用可解释的机器学习方法确定老年霍奇金淋巴瘤幸存者的医疗保健费用驱动因素。","authors":"Zasim Azhar Siddiqui, Yves Paul Mbous, Sabina Nduaguba, Traci LeMasters, Virginia G Scott, Jay S Patel, Usha Sambamoorthi","doi":"10.18553/jmcp.2025.31.4.406","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The cost of health care for patients with Hodgkin lymphoma (HL) is projected to rise, making it essential to understand expenditure drivers across different demographics, including the older adult population. Although older HL patients constitute a significant number of HL patients, the literature on health care expenditures in older HL patients is lacking. Predictive capabilities of machine learning (ML) methods enhance our ability to leverage a data-driven approach, which helps identify key predictors of expenditures and strategically plan future expenditures.</p><p><strong>Objective: </strong>To determine the leading predictors of health care expenditures among older HL survivors across prediagnosis, treatment, and posttreatment phases of care.</p><p><strong>Methods: </strong>The study uses a retrospective research design to identify the incident cases of HL diagnosed between 2009 and 2017 using Surveillance, Epidemiology, and End Results-Medicare data. Three phases of cancer care (prediagnosis, treatment, and posttreatment) were indexed around the diagnosis date, with each phase divided into 12 months of baseline and 12 months of follow-up. ML methods, including XGBoost, Random Forest, and Cross-Validated linear regressions, were used to identify the best regression model for predicting Medicare and out-of-pocket (OOP) health care expenditures. Interpretable ML SHapley Additive exPlanations method was used to identify the leading predictors of Medicare and OOP health care expenditures in each phase.</p><p><strong>Results: </strong>The study analyzed 1,242 patients in the prediagnosis phase, 902 in the treatment phase, and 873 in the posttreatment phase. XGBoost regression outperformed Random Forest and Cross-Validated linear regression models with overall performance in predicting Medicare expenditures, with R-squared (root mean square error) values of 0.42 (1.39), 0.43 (0.56), and 0.46 (0.90) across the 3 phases of care, respectively. Interpretable ML methods highlighted baseline expenditures, number of prescription medications, and cardiac dysrhythmia as the leading predictors for Medicare and OOP expenditures in the prediagnosis phase. Chemotherapy and immunotherapy and surgical treatment and immunotherapy were the leading predictors of expenditures in the treatment and posttreatment phases, respectively.</p><p><strong>Conclusions: </strong>As ML applications increase in predicting health care expenditure, researchers should consider implementing models in different phases of care to identify the changes in the predictors. Leading predictors of health care expenditures can be targeted for informed policy development to address financial hardship in HL survivors.</p>","PeriodicalId":16170,"journal":{"name":"Journal of managed care & specialty pharmacy","volume":"31 4","pages":"406-420"},"PeriodicalIF":2.3000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11953870/pdf/","citationCount":"0","resultStr":"{\"title\":\"Determining health care cost drivers in older Hodgkin lymphoma survivors using interpretable machine learning methods.\",\"authors\":\"Zasim Azhar Siddiqui, Yves Paul Mbous, Sabina Nduaguba, Traci LeMasters, Virginia G Scott, Jay S Patel, Usha Sambamoorthi\",\"doi\":\"10.18553/jmcp.2025.31.4.406\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The cost of health care for patients with Hodgkin lymphoma (HL) is projected to rise, making it essential to understand expenditure drivers across different demographics, including the older adult population. Although older HL patients constitute a significant number of HL patients, the literature on health care expenditures in older HL patients is lacking. Predictive capabilities of machine learning (ML) methods enhance our ability to leverage a data-driven approach, which helps identify key predictors of expenditures and strategically plan future expenditures.</p><p><strong>Objective: </strong>To determine the leading predictors of health care expenditures among older HL survivors across prediagnosis, treatment, and posttreatment phases of care.</p><p><strong>Methods: </strong>The study uses a retrospective research design to identify the incident cases of HL diagnosed between 2009 and 2017 using Surveillance, Epidemiology, and End Results-Medicare data. Three phases of cancer care (prediagnosis, treatment, and posttreatment) were indexed around the diagnosis date, with each phase divided into 12 months of baseline and 12 months of follow-up. ML methods, including XGBoost, Random Forest, and Cross-Validated linear regressions, were used to identify the best regression model for predicting Medicare and out-of-pocket (OOP) health care expenditures. Interpretable ML SHapley Additive exPlanations method was used to identify the leading predictors of Medicare and OOP health care expenditures in each phase.</p><p><strong>Results: </strong>The study analyzed 1,242 patients in the prediagnosis phase, 902 in the treatment phase, and 873 in the posttreatment phase. XGBoost regression outperformed Random Forest and Cross-Validated linear regression models with overall performance in predicting Medicare expenditures, with R-squared (root mean square error) values of 0.42 (1.39), 0.43 (0.56), and 0.46 (0.90) across the 3 phases of care, respectively. Interpretable ML methods highlighted baseline expenditures, number of prescription medications, and cardiac dysrhythmia as the leading predictors for Medicare and OOP expenditures in the prediagnosis phase. Chemotherapy and immunotherapy and surgical treatment and immunotherapy were the leading predictors of expenditures in the treatment and posttreatment phases, respectively.</p><p><strong>Conclusions: </strong>As ML applications increase in predicting health care expenditure, researchers should consider implementing models in different phases of care to identify the changes in the predictors. Leading predictors of health care expenditures can be targeted for informed policy development to address financial hardship in HL survivors.</p>\",\"PeriodicalId\":16170,\"journal\":{\"name\":\"Journal of managed care & specialty pharmacy\",\"volume\":\"31 4\",\"pages\":\"406-420\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11953870/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of managed care & specialty pharmacy\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.18553/jmcp.2025.31.4.406\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of managed care & specialty pharmacy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.18553/jmcp.2025.31.4.406","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

背景:霍奇金淋巴瘤(HL)患者的医疗保健费用预计将上升,因此了解不同人口统计数据(包括老年人口)的支出驱动因素至关重要。尽管老年HL患者占HL患者的很大一部分,但缺乏老年HL患者医疗支出的文献。机器学习(ML)方法的预测能力增强了我们利用数据驱动方法的能力,这有助于确定支出的关键预测因素并战略性地规划未来支出。目的:确定老年HL幸存者在诊断前、治疗和治疗后护理阶段的医疗保健支出的主要预测因素。方法:本研究采用回顾性研究设计,利用监测、流行病学和最终结果-医疗保险数据,确定2009年至2017年诊断的HL病例。癌症护理的三个阶段(诊断前、治疗和治疗后)围绕诊断日期进行索引,每个阶段分为12个月的基线和12个月的随访。ML方法,包括XGBoost、随机森林和交叉验证线性回归,用于确定预测医疗保险和自付医疗费用(OOP)的最佳回归模型。采用可解释的ML SHapley加性解释法确定各阶段医疗保险和面向对象医疗保健支出的主要预测因素。结果:该研究分析了1242例诊断前期患者,902例治疗期患者,873例治疗后期患者。XGBoost回归在预测医疗保险支出方面的总体表现优于随机森林和交叉验证线性回归模型,在三个护理阶段的r平方(均方根误差)值分别为0.42(1.39)、0.43(0.56)和0.46(0.90)。可解释的ML方法强调了基线支出、处方药数量和心律失常作为诊断前阶段医疗保险和OOP支出的主要预测因素。化疗和免疫治疗以及手术治疗和免疫治疗分别是治疗和治疗后阶段支出的主要预测因素。结论:随着机器学习在预测医疗支出方面的应用越来越多,研究人员应该考虑在不同的医疗阶段实施模型,以确定预测因子的变化。卫生保健支出的主要预测指标可作为制定明智政策的目标,以解决HL幸存者的经济困难。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Determining health care cost drivers in older Hodgkin lymphoma survivors using interpretable machine learning methods.

Background: The cost of health care for patients with Hodgkin lymphoma (HL) is projected to rise, making it essential to understand expenditure drivers across different demographics, including the older adult population. Although older HL patients constitute a significant number of HL patients, the literature on health care expenditures in older HL patients is lacking. Predictive capabilities of machine learning (ML) methods enhance our ability to leverage a data-driven approach, which helps identify key predictors of expenditures and strategically plan future expenditures.

Objective: To determine the leading predictors of health care expenditures among older HL survivors across prediagnosis, treatment, and posttreatment phases of care.

Methods: The study uses a retrospective research design to identify the incident cases of HL diagnosed between 2009 and 2017 using Surveillance, Epidemiology, and End Results-Medicare data. Three phases of cancer care (prediagnosis, treatment, and posttreatment) were indexed around the diagnosis date, with each phase divided into 12 months of baseline and 12 months of follow-up. ML methods, including XGBoost, Random Forest, and Cross-Validated linear regressions, were used to identify the best regression model for predicting Medicare and out-of-pocket (OOP) health care expenditures. Interpretable ML SHapley Additive exPlanations method was used to identify the leading predictors of Medicare and OOP health care expenditures in each phase.

Results: The study analyzed 1,242 patients in the prediagnosis phase, 902 in the treatment phase, and 873 in the posttreatment phase. XGBoost regression outperformed Random Forest and Cross-Validated linear regression models with overall performance in predicting Medicare expenditures, with R-squared (root mean square error) values of 0.42 (1.39), 0.43 (0.56), and 0.46 (0.90) across the 3 phases of care, respectively. Interpretable ML methods highlighted baseline expenditures, number of prescription medications, and cardiac dysrhythmia as the leading predictors for Medicare and OOP expenditures in the prediagnosis phase. Chemotherapy and immunotherapy and surgical treatment and immunotherapy were the leading predictors of expenditures in the treatment and posttreatment phases, respectively.

Conclusions: As ML applications increase in predicting health care expenditure, researchers should consider implementing models in different phases of care to identify the changes in the predictors. Leading predictors of health care expenditures can be targeted for informed policy development to address financial hardship in HL survivors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of managed care & specialty pharmacy
Journal of managed care & specialty pharmacy Health Professions-Pharmacy
CiteScore
3.50
自引率
4.80%
发文量
131
期刊介绍: JMCP welcomes research studies conducted outside of the United States that are relevant to our readership. Our audience is primarily concerned with designing policies of formulary coverage, health benefit design, and pharmaceutical programs that are based on evidence from large populations of people. Studies of pharmacist interventions conducted outside the United States that have already been extensively studied within the United States and studies of small sample sizes in non-managed care environments outside of the United States (e.g., hospitals or community pharmacies) are generally of low interest to our readership. However, studies of health outcomes and costs assessed in large populations that provide evidence for formulary coverage, health benefit design, and pharmaceutical programs are of high interest to JMCP’s readership.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信