{"title":"Q-learning via deep learning-based Buckley-James method for non-linear censored data.","authors":"Jeongjin Lee, Jong-Min Kim","doi":"10.1007/s10985-026-09707-z","DOIUrl":null,"url":null,"abstract":"<p><p>In healthcare, personalized treatment strategies are vital for improving patient outcomes, especially under right-censored survival data. We propose Dynamic Deep Buckley-James Q-Learning, a novel counterfactual Q-learning algorithm that integrates deep learning with the Buckley-James method to simultaneously address censoring and nonlinear modeling challenges. By explicitly capturing complex, nonlinear interactions between covariates and treatment effects, the algorithm robustly estimates optimal dynamic treatment regimes. Leveraging a counterfactual framework, we define and estimate potential survival outcomes under hypothetical treatment sequences, enabling unbiased Q-function estimation even in the presence of time-dependent covariates and right censoring. The algorithm maximizes the expected imputed survival reward under these counterfactual scenarios. Simulation studies and real-world data analysis demonstrate its superior performance in predictive accuracy and treatment decision-making, offering a powerful framework for individualized care in complex clinical settings.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":"32 2","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2026-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lifetime Data Analysis","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10985-026-09707-z","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In healthcare, personalized treatment strategies are vital for improving patient outcomes, especially under right-censored survival data. We propose Dynamic Deep Buckley-James Q-Learning, a novel counterfactual Q-learning algorithm that integrates deep learning with the Buckley-James method to simultaneously address censoring and nonlinear modeling challenges. By explicitly capturing complex, nonlinear interactions between covariates and treatment effects, the algorithm robustly estimates optimal dynamic treatment regimes. Leveraging a counterfactual framework, we define and estimate potential survival outcomes under hypothetical treatment sequences, enabling unbiased Q-function estimation even in the presence of time-dependent covariates and right censoring. The algorithm maximizes the expected imputed survival reward under these counterfactual scenarios. Simulation studies and real-world data analysis demonstrate its superior performance in predictive accuracy and treatment decision-making, offering a powerful framework for individualized care in complex clinical settings.
期刊介绍:
The objective of Lifetime Data Analysis is to advance and promote statistical science in the various applied fields that deal with lifetime data, including: Actuarial Science – Economics – Engineering Sciences – Environmental Sciences – Management Science – Medicine – Operations Research – Public Health – Social and Behavioral Sciences.