Machine Learning Risk Stratification for Older Breast Cancer Survivors: Clinical Care Implications.

IF 3.2 2区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Stephanie B Wheeler, Jason Rotter, Lisa P Spees, Caitlin B Biddell, Justin G Trogdon, Catherine M Alfano, Deborah K Mayer, Michaela A Dinan, Larissa Nekhlyudov, Sarah A Birken
{"title":"Machine Learning Risk Stratification for Older Breast Cancer Survivors: Clinical Care Implications.","authors":"Stephanie B Wheeler, Jason Rotter, Lisa P Spees, Caitlin B Biddell, Justin G Trogdon, Catherine M Alfano, Deborah K Mayer, Michaela A Dinan, Larissa Nekhlyudov, Sarah A Birken","doi":"10.1111/1475-6773.70005","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop and validate a clinical risk prediction algorithm to identify breast cancer survivors at high risk for adverse outcomes.</p><p><strong>Study setting and design: </strong>Our national retrospective analysis used cross-validated random forest machine learning models to separately predict the risk of all-cause death, cancer-specific death, claims-derived risk of recurrence, and other adverse health outcomes within 3 and 5 years following treatment completion.</p><p><strong>Data sources and analytic sample: </strong>Our study used the Surveillance and Epidemiology End Results (SEER) registry-Consumer Assessment of Healthcare Providers and Systems (CAHPS) survey (SEER-CAHPS) linked data for survivors diagnosed between 2003 and 2011, with follow-up claims data to 2017.</p><p><strong>Principal findings: </strong>Within the 3-year follow-up period, 372/4516 survivors (mean age 75.1; 81.7% white) in the primary cohort (8.2%) died, 111 from cancer (2.5%), 665 (14.7%) experienced cancer recurrence, and 488 (10.8%) were hospitalized for adverse health outcomes. The algorithm's prediction resulted in 91.9% out-of-sample accuracy (the percent of observations classified correctly) and a 37.6% Cohen's Kappa (i.e., improvement over an uninformed model). Out-of-sample accuracy was 97.5% (44% improvement) for predicting cancer-specific death, 85% (26% improvement) for recurrence, and 89% (28% improvement) for other adverse health outcomes. Important predictors across outcomes included geographic region, age, frailty, comorbidity, time since diagnosis, and out-of-pocket cost responsibility.</p><p><strong>Conclusions: </strong>Machine learning models accurately predicted relevant adverse survivorship outcomes, driven primarily by non-cancer specific factors. Breast cancer survivors at high risk for adverse outcomes may benefit from more intensive care, whereas those at low risk may be more appropriately managed by primary care.</p>","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":" ","pages":"e70005"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/1475-6773.70005","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To develop and validate a clinical risk prediction algorithm to identify breast cancer survivors at high risk for adverse outcomes.

Study setting and design: Our national retrospective analysis used cross-validated random forest machine learning models to separately predict the risk of all-cause death, cancer-specific death, claims-derived risk of recurrence, and other adverse health outcomes within 3 and 5 years following treatment completion.

Data sources and analytic sample: Our study used the Surveillance and Epidemiology End Results (SEER) registry-Consumer Assessment of Healthcare Providers and Systems (CAHPS) survey (SEER-CAHPS) linked data for survivors diagnosed between 2003 and 2011, with follow-up claims data to 2017.

Principal findings: Within the 3-year follow-up period, 372/4516 survivors (mean age 75.1; 81.7% white) in the primary cohort (8.2%) died, 111 from cancer (2.5%), 665 (14.7%) experienced cancer recurrence, and 488 (10.8%) were hospitalized for adverse health outcomes. The algorithm's prediction resulted in 91.9% out-of-sample accuracy (the percent of observations classified correctly) and a 37.6% Cohen's Kappa (i.e., improvement over an uninformed model). Out-of-sample accuracy was 97.5% (44% improvement) for predicting cancer-specific death, 85% (26% improvement) for recurrence, and 89% (28% improvement) for other adverse health outcomes. Important predictors across outcomes included geographic region, age, frailty, comorbidity, time since diagnosis, and out-of-pocket cost responsibility.

Conclusions: Machine learning models accurately predicted relevant adverse survivorship outcomes, driven primarily by non-cancer specific factors. Breast cancer survivors at high risk for adverse outcomes may benefit from more intensive care, whereas those at low risk may be more appropriately managed by primary care.

老年乳腺癌幸存者的机器学习风险分层:临床护理意义。
目的:开发并验证一种临床风险预测算法,以识别高危不良结局的乳腺癌幸存者。研究设置和设计:我们的国家回顾性分析使用交叉验证的随机森林机器学习模型,分别预测治疗完成后3年和5年内的全因死亡风险、癌症特异性死亡风险、索赔衍生的复发风险和其他不良健康结果。数据来源和分析样本:我们的研究使用了监测和流行病学最终结果(SEER)登记-医疗保健提供者和系统的消费者评估(CAHPS)调查(SEER-CAHPS)与2003年至2011年诊断的幸存者相关的数据,以及到2017年的随访索赔数据。主要发现:在3年随访期间,372/4516名幸存者(平均年龄75.1岁;81.7%白人)死亡(8.2%),111人死于癌症(2.5%),665人(14.7%)经历癌症复发,488人(10.8%)因不良健康结果住院。该算法的预测结果达到了91.9%的样本外准确率(正确分类的观测值百分比)和37.6%的科恩Kappa(即比不知情的模型有所改进)。预测癌症特异性死亡的样本外准确度为97.5%(提高44%),预测复发的样本外准确度为85%(提高26%),预测其他不良健康结局的样本外准确度为89%(提高28%)。结果的重要预测因素包括地理区域、年龄、虚弱、合并症、诊断后的时间和自付费用。结论:机器学习模型准确地预测了相关的不良生存结果,主要由非癌症特异性因素驱动。不良后果高风险的乳腺癌幸存者可能受益于更多的重症监护,而低风险的乳腺癌幸存者可能更适合由初级保健管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Health Services Research
Health Services Research 医学-卫生保健
CiteScore
4.80
自引率
5.90%
发文量
193
审稿时长
4-8 weeks
期刊介绍: Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信