Analissa Avila, Beth A Glenn, Roshan Bastani, Catherine M Crespi
{"title":"Handling Missing Outcome Data in Cluster Randomized Trials With Both Individual- and Cluster-Level Dropout.","authors":"Analissa Avila, Beth A Glenn, Roshan Bastani, Catherine M Crespi","doi":"10.1002/sim.70259","DOIUrl":null,"url":null,"abstract":"<p><p>Missing outcome data are common in cluster randomized trials (CRTs), which can complicate inference. Further, the missingness can occur due to dropout of individuals, termed sporadically missing data, or dropout of clusters, termed systematically missing data, and these two types of missingness could have potentially different missing data mechanisms. We aimed to develop a well-performing and practical approach to handle inference in CRTs when outcome data may be both sporadically and systematically missing. To this end, we first examined the performance of several multilevel multiple imputation (MI) methods to handle sporadically and systematically missing CRT outcome data via a simulation study. Specifically, we examined performance under a multilevel covariate-dependent missingness assumption. Our findings indicated that several full conditional specification (FCS) methods designed for missingness in linear mixed models performed well under various scenarios, while an FCS approach using a two-stage estimator often performed poorly. We then developed methods for conducting sensitivity analysis to test the robustness of inferences under different missing at random (MAR) and missing not at random (MNAR) assumptions. The methods allow for different MNAR assumptions for cluster dropout and individual dropout to reflect that they may arise from different missing data mechanisms. We used graphical displays to visualize sensitivity analysis results. Our methods are illustrated using a real data application.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"44 20-22","pages":"e70259"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.70259","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Missing outcome data are common in cluster randomized trials (CRTs), which can complicate inference. Further, the missingness can occur due to dropout of individuals, termed sporadically missing data, or dropout of clusters, termed systematically missing data, and these two types of missingness could have potentially different missing data mechanisms. We aimed to develop a well-performing and practical approach to handle inference in CRTs when outcome data may be both sporadically and systematically missing. To this end, we first examined the performance of several multilevel multiple imputation (MI) methods to handle sporadically and systematically missing CRT outcome data via a simulation study. Specifically, we examined performance under a multilevel covariate-dependent missingness assumption. Our findings indicated that several full conditional specification (FCS) methods designed for missingness in linear mixed models performed well under various scenarios, while an FCS approach using a two-stage estimator often performed poorly. We then developed methods for conducting sensitivity analysis to test the robustness of inferences under different missing at random (MAR) and missing not at random (MNAR) assumptions. The methods allow for different MNAR assumptions for cluster dropout and individual dropout to reflect that they may arise from different missing data mechanisms. We used graphical displays to visualize sensitivity analysis results. Our methods are illustrated using a real data application.
期刊介绍:
The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.