Handling Missing Outcome Data in Cluster Randomized Trials With Both Individual- and Cluster-Level Dropout.

IF 1.8 4区医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Statistics in Medicine Pub Date : 2025-09-01 DOI:10.1002/sim.70259

Analissa Avila, Beth A Glenn, Roshan Bastani, Catherine M Crespi

{"title":"Handling Missing Outcome Data in Cluster Randomized Trials With Both Individual- and Cluster-Level Dropout.","authors":"Analissa Avila, Beth A Glenn, Roshan Bastani, Catherine M Crespi","doi":"10.1002/sim.70259","DOIUrl":null,"url":null,"abstract":"<p><p>Missing outcome data are common in cluster randomized trials (CRTs), which can complicate inference. Further, the missingness can occur due to dropout of individuals, termed sporadically missing data, or dropout of clusters, termed systematically missing data, and these two types of missingness could have potentially different missing data mechanisms. We aimed to develop a well-performing and practical approach to handle inference in CRTs when outcome data may be both sporadically and systematically missing. To this end, we first examined the performance of several multilevel multiple imputation (MI) methods to handle sporadically and systematically missing CRT outcome data via a simulation study. Specifically, we examined performance under a multilevel covariate-dependent missingness assumption. Our findings indicated that several full conditional specification (FCS) methods designed for missingness in linear mixed models performed well under various scenarios, while an FCS approach using a two-stage estimator often performed poorly. We then developed methods for conducting sensitivity analysis to test the robustness of inferences under different missing at random (MAR) and missing not at random (MNAR) assumptions. The methods allow for different MNAR assumptions for cluster dropout and individual dropout to reflect that they may arise from different missing data mechanisms. We used graphical displays to visualize sensitivity analysis results. Our methods are illustrated using a real data application.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"44 20-22","pages":"e70259"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.70259","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Missing outcome data are common in cluster randomized trials (CRTs), which can complicate inference. Further, the missingness can occur due to dropout of individuals, termed sporadically missing data, or dropout of clusters, termed systematically missing data, and these two types of missingness could have potentially different missing data mechanisms. We aimed to develop a well-performing and practical approach to handle inference in CRTs when outcome data may be both sporadically and systematically missing. To this end, we first examined the performance of several multilevel multiple imputation (MI) methods to handle sporadically and systematically missing CRT outcome data via a simulation study. Specifically, we examined performance under a multilevel covariate-dependent missingness assumption. Our findings indicated that several full conditional specification (FCS) methods designed for missingness in linear mixed models performed well under various scenarios, while an FCS approach using a two-stage estimator often performed poorly. We then developed methods for conducting sensitivity analysis to test the robustness of inferences under different missing at random (MAR) and missing not at random (MNAR) assumptions. The methods allow for different MNAR assumptions for cluster dropout and individual dropout to reflect that they may arise from different missing data mechanisms. We used graphical displays to visualize sensitivity analysis results. Our methods are illustrated using a real data application.

查看原文本刊更多论文

在个体和群体水平均退出的聚类随机试验中处理缺失结果数据。

结果数据缺失在聚类随机试验（crt）中很常见，这可能使推理复杂化。此外，缺失可能是由于个体的缺失（称为零星缺失数据）或集群的缺失（称为系统缺失数据）造成的，这两种类型的缺失可能具有潜在的不同缺失数据机制。我们的目标是开发一种性能良好且实用的方法来处理ct中的推断，当结果数据可能既零星又系统地缺失时。为此，我们首先通过模拟研究，检验了几种多层次多重输入（MI）方法处理偶发性和系统性缺失的CRT结果数据的性能。具体而言，我们在多水平协变量相关缺失假设下检查了性能。我们的研究结果表明，为线性混合模型中的缺失设计的几种完全条件规范（FCS）方法在各种情况下都表现良好，而使用两阶段估计器的FCS方法通常表现不佳。然后，我们开发了进行敏感性分析的方法，以检验不同随机缺失（MAR）和非随机缺失（MNAR）假设下推断的稳健性。这些方法允许对集群退出和个体退出进行不同的MNAR假设，以反映它们可能由不同的缺失数据机制引起。我们使用图形显示将灵敏度分析结果可视化。我们的方法是通过一个实际的数据应用来说明的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Statistics in Medicine 医学-公共卫生、环境卫生与职业卫生

CiteScore

3.40

自引率

10.00%

发文量

334

审稿时长

2-4 weeks

期刊介绍： The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.