利用缺失数据的顺序多任务随机试验数据优化动态治疗方案。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology Pub Date : 2025-07-01 DOI:10.1186/s12874-025-02595-1

Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson

{"title":"利用缺失数据的顺序多任务随机试验数据优化动态治疗方案。","authors":"Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson","doi":"10.1186/s12874-025-02595-1","DOIUrl":null,"url":null,"abstract":"Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"162"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211643/pdf/","citationCount":"0","resultStr":"{\"title\":\"Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.\",\"authors\":\"Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson\",\"doi\":\"10.1186/s12874-025-02595-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.\",\"PeriodicalId\":9114,\"journal\":{\"name\":\"BMC Medical Research Methodology\",\"volume\":\"25 1\",\"pages\":\"162\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211643/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Research Methodology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12874-025-02595-1\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02595-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

动态治疗方案通常用于慢性或进行性疾病的患者。顺序多任务随机试验（SMARTs）是通过反复随机分配参与者进行治疗来优化动态治疗方案的研究。Q-learning是一种基于阶段回归的方法，用于分析smart，它使用逆向归纳法来比较作为一个序列进行的治疗。丢失数据是随机试验中常见的问题，并且在SMARTs中由于顺序随机化而变得复杂。处理缺失数据的常用方法，如完整病例分析（CCA）和多重imputation (MI)，已经在单阶段随机试验中被广泛探索，然而，唯一在smart中探索这些方法的研究没有考虑Q-learning。我们评估了CCA和MI在SMART中估计q -学习参数的性能。我们基于两个阶段的SMART，模拟了500名参与者的1000个数据集，在不同的缺失数据情景下，由缺失有向无环图（m-DAGS）、缺失数据百分比（20%、40%）、2期治疗效果以及与2期治疗、患者病史和结果缺失的关联强度定义。我们还使用来自纵向戒烟SMART的回顾性数据比较了CCA和MI。当在第一阶段或第二阶段没有治疗效果时，我们观察到第一阶段治疗效果的绝对偏倚接近于零，在所有缺失数据情景下，CCA和MI的经验标准误差相似。当所有参与者都有相对较大的第二阶段治疗效果时，我们观察到CCA和MI的偏差最小，MI的偏差略大。除了数据缺失不依赖于任何变量的情况外，在所有情况下，MI的经验标准误差都高于CCA。当2期治疗效果在参与者之间变化，并且数据缺失依赖于其他变量（例如，1期应答者状态缺失依赖于1期治疗和基线变量）时，我们观察到在估计1期治疗效果时，MI的偏倚更大，随着缺失百分比的增加而增加，而CCA的偏倚仍然很小。在所有缺失数据情景下，与CCA相比，MI的经验标准误差更低或相似。结果显示，对于两阶段SMART，当参与者之间的第二阶段治疗效果不同时，MI未能捕捉到治疗效果之间的差异。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.

Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Medical Research Methodology 医学-卫生保健

CiteScore

6.50

自引率

2.50%

发文量

298

审稿时长

3-8 weeks

期刊介绍： BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.