Global and Episode-Specific Prediction of Recurrent Events Using Longitudinal Health Informatics Data.

IF 3 1区 数学 Q1 STATISTICS & PROBABILITY
Yifei Sun, Sy Han Chiou, Chiung-Yu Huang
{"title":"Global and Episode-Specific Prediction of Recurrent Events Using Longitudinal Health Informatics Data.","authors":"Yifei Sun, Sy Han Chiou, Chiung-Yu Huang","doi":"10.1080/01621459.2025.2497569","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate prediction of recurrent clinical events is crucial for effective management of chronic conditions such as cancer and cardiovascular disease. In recent years, longitudinal health informatics databases, which routinely collect data on repeated clinical events, have been increasingly utilized to construct risk prediction models. We introduce a novel nonparametric framework to predict recurrent events on a gap time scale using survival tree ensembles. Our framework incorporates two predictive modeling strategies: episode-specific model and global model. These models avoid strong assumptions on how future event risk depends on previous event history and other predictors, making them a promising alternative to Cox-type models. Additional complexities in tree-based prediction for recurrent events include induced informative censoring of gap times and inter-event correlations. We develop algorithms to address these issues through the use of inverse probability of censoring weighting and modified resampling procedures. Applied to SEER-Medicare data to predict repeated hospitalizations for breast cancer patients, our models showed superior performance. In particular, borrowing information across events via global models substantially improved prediction accuracy for later hospitalizations.</p>","PeriodicalId":17227,"journal":{"name":"Journal of the American Statistical Association","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12363995/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Statistical Association","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/01621459.2025.2497569","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate prediction of recurrent clinical events is crucial for effective management of chronic conditions such as cancer and cardiovascular disease. In recent years, longitudinal health informatics databases, which routinely collect data on repeated clinical events, have been increasingly utilized to construct risk prediction models. We introduce a novel nonparametric framework to predict recurrent events on a gap time scale using survival tree ensembles. Our framework incorporates two predictive modeling strategies: episode-specific model and global model. These models avoid strong assumptions on how future event risk depends on previous event history and other predictors, making them a promising alternative to Cox-type models. Additional complexities in tree-based prediction for recurrent events include induced informative censoring of gap times and inter-event correlations. We develop algorithms to address these issues through the use of inverse probability of censoring weighting and modified resampling procedures. Applied to SEER-Medicare data to predict repeated hospitalizations for breast cancer patients, our models showed superior performance. In particular, borrowing information across events via global models substantially improved prediction accuracy for later hospitalizations.

利用纵向健康信息学数据对复发事件进行全球和特定事件预测。
准确预测复发性临床事件对于有效管理慢性疾病如癌症和心血管疾病至关重要。近年来,定期收集重复临床事件数据的纵向健康信息学数据库越来越多地用于构建风险预测模型。我们引入了一种新的非参数框架,利用生存树集成来预测间隔时间尺度上的重复事件。我们的框架包含两种预测建模策略:特定情景模型和全局模型。这些模型避免了对未来事件风险如何取决于先前事件历史和其他预测因素的强烈假设,使它们成为cox型模型的一个有希望的替代方案。基于树的重复事件预测的额外复杂性包括对间隔时间和事件间相关性的诱导信息审查。我们开发算法来解决这些问题,通过使用反概率的审查加权和修改的重采样程序。将SEER-Medicare数据应用于预测乳腺癌患者的重复住院,我们的模型显示出优越的性能。特别是,通过全球模型借用事件间的信息大大提高了对后期住院的预测准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.50
自引率
8.10%
发文量
168
审稿时长
12 months
期刊介绍: Established in 1888 and published quarterly in March, June, September, and December, the Journal of the American Statistical Association ( JASA ) has long been considered the premier journal of statistical science. Articles focus on statistical applications, theory, and methods in economic, social, physical, engineering, and health sciences. Important books contributing to statistical advancement are reviewed in JASA . JASA is indexed in Current Index to Statistics and MathSci Online and reviewed in Mathematical Reviews. JASA is abstracted by Access Company and is indexed and abstracted in the SRM Database of Social Research Methodology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信