Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson
{"title":"利用缺失数据的顺序多任务随机试验数据优化动态治疗方案。","authors":"Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson","doi":"10.1186/s12874-025-02595-1","DOIUrl":null,"url":null,"abstract":"<p><p>Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"162"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211643/pdf/","citationCount":"0","resultStr":"{\"title\":\"Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.\",\"authors\":\"Jessica Xu, Anurika P De Silva, Katherine J Lee, Robert K Mahar, Julie A Simpson\",\"doi\":\"10.1186/s12874-025-02595-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.</p>\",\"PeriodicalId\":9114,\"journal\":{\"name\":\"BMC Medical Research Methodology\",\"volume\":\"25 1\",\"pages\":\"162\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211643/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Research Methodology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12874-025-02595-1\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02595-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.
Dynamic treatment regimens are commonly used for patients with chronic or progressive medical conditions. Sequential multiple assignment randomised trials (SMARTs) are studies used to optimise dynamic treatment regimens by repeatedly randomising participants to treatments. Q-learning, a stage-wise regression-based method used to analyse SMARTs, uses backward induction to compare treatments administered as a sequence. Missing data is a common problem in randomised trials and can be complex in SMARTs given the sequential randomisation. Common methods for handling missing data such as complete case analysis (CCA) and multiple imputation (MI) have been widely explored in single-stage randomised trials, however, the only study that explored these methods in SMARTs did not consider Q-learning. We evaluated the performance of CCA and MI on the estimation of Q-learning parameters in a SMART. We simulated 1000 datasets of 500 participants, based on a SMART with two stages, under different missing data scenarios defined by missing directed acyclic graphs (m-DAGS), percentages of missing data (20%, 40%), stage 2 treatment effects, and strengths of association with missingness in stage 2 treatment, patient history and outcome. We also compared CCA and MI using retrospective data from a longitudinal smoking cessation SMART. When there was no treatment effect at either stage 1 or 2, we observed close to zero absolute bias in the stage 1 treatment effect and similar empirical standard errors for CCA and MI under all missing data scenarios. When all participants had a relatively large stage 2 treatment effect, we observed minimal bias from both CCA and MI, with slightly greater bias for MI. Empirical standard errors were higher for MI compared to CCA under all scenarios except for when data were missing not dependent on any variables. When the stage 2 treatment effect varied between participants and data were missing dependent on other variables (for example, stage 1 responder status missing dependent on stage 1 treatment and baseline variables), we observed greater bias for MI when estimating the stage 1 treatment effect, which increased with the percentage missingness, while the bias for CCA remained minimal. Resulting empirical standard errors were lower or similar for MI compared to CCA under all missing data scenarios. Results showed that for a two-stage SMART, MI failed to capture the differences between treatment effects when the stage 2 treatment effect varied between participants.
期刊介绍:
BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.