Causal inference in randomized trials with partial clustering.

IF 2.2 3区医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL

Clinical Trials Pub Date : 2025-10-01 Epub Date: 2025-05-02 DOI:10.1177/17407745251333779

Joshua R Nugent, Elijah Kakande, Gabriel Chamie, Jane Kabami, Asiphas Owaraganise, Diane V Havlir, Moses Kamya, Laura B Balzer

{"title":"Causal inference in randomized trials with partial clustering.","authors":"Joshua R Nugent, Elijah Kakande, Gabriel Chamie, Jane Kabami, Asiphas Owaraganise, Diane V Havlir, Moses Kamya, Laura B Balzer","doi":"10.1177/17407745251333779","DOIUrl":null,"url":null,"abstract":"Background: Participant dependence, if present, must be accounted for in the analysis of randomized trials. This dependence, also referred to as \"clustering,\" can occur in one or more trial arms. This dependence may predate randomization or arise after randomization. We examine three trial designs: one \"fully clustered\" (where all participants are dependent within clusters or groups) and two \"partially clustered\" (where some participants are dependent within clusters and some participants are completely independent of all others).Methods: For these three designs, we (1) use causal models to non-parametrically describe the data generating process and formalize the dependence in the observed data distribution; (2) develop a novel implementation of targeted minimum loss-based estimation for analysis; (3) evaluate the finite-sample performance of targeted minimum loss-based estimation and common alternatives via a simulation study; and (4) apply the methods to real-data from the SEARCH-IPT trial.Results: We show that the two randomization schemes resulting in partially clustered trials have the same dependence structure, enabling use of the same statistical methods for estimation and inference of causal effects. Our novel targeted minimum loss-based estimation approach leverages covariate adjustment and machine learning to improve precision and facilitates estimation of a large set of causal effects. In simulations, we demonstrate that targeted minimum loss-based estimation achieves comparable or markedly higher statistical power than common alternatives for these partially clustered designs. Finally, application of targeted minimum loss-based estimation to real data from the SEARCH-IPT trial resulted in 20%-57% efficiency gains, demonstrating the real-world consequences of our proposed approach.ConclusionsPartially clustered trial analysis can be made more efficient by implementing targeted minimum loss-based estimation, assuming care is taken to account for the dependent nature of the observed data.","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"547-558"},"PeriodicalIF":2.2000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12355196/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Trials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/17407745251333779","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/2 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Participant dependence, if present, must be accounted for in the analysis of randomized trials. This dependence, also referred to as "clustering," can occur in one or more trial arms. This dependence may predate randomization or arise after randomization. We examine three trial designs: one "fully clustered" (where all participants are dependent within clusters or groups) and two "partially clustered" (where some participants are dependent within clusters and some participants are completely independent of all others).

Methods: For these three designs, we (1) use causal models to non-parametrically describe the data generating process and formalize the dependence in the observed data distribution; (2) develop a novel implementation of targeted minimum loss-based estimation for analysis; (3) evaluate the finite-sample performance of targeted minimum loss-based estimation and common alternatives via a simulation study; and (4) apply the methods to real-data from the SEARCH-IPT trial.

Results: We show that the two randomization schemes resulting in partially clustered trials have the same dependence structure, enabling use of the same statistical methods for estimation and inference of causal effects. Our novel targeted minimum loss-based estimation approach leverages covariate adjustment and machine learning to improve precision and facilitates estimation of a large set of causal effects. In simulations, we demonstrate that targeted minimum loss-based estimation achieves comparable or markedly higher statistical power than common alternatives for these partially clustered designs. Finally, application of targeted minimum loss-based estimation to real data from the SEARCH-IPT trial resulted in 20%-57% efficiency gains, demonstrating the real-world consequences of our proposed approach.ConclusionsPartially clustered trial analysis can be made more efficient by implementing targeted minimum loss-based estimation, assuming care is taken to account for the dependent nature of the observed data.

查看原文本刊更多论文

部分聚类随机试验中的因果推断。

背景：在随机试验的分析中，如果存在受试者依赖，必须考虑到。这种依赖性，也称为“聚类”，可能发生在一个或多个试验组中。这种依赖性可能发生在随机化之前或随机化之后。我们研究了三种试验设计：一种是“完全聚类”（所有参与者都依赖于集群或组），另一种是“部分聚类”（一些参与者依赖于集群，一些参与者完全独立于其他所有参与者）。方法：对于这三个设计，我们(1)使用因果模型来非参数地描述数据生成过程，并形式化观测数据分布中的相关性；(2)开发一种新的基于目标最小损失的分析估计方法；(3)通过仿真研究评估目标最小损失估计和常见替代方案的有限样本性能；(4)将该方法应用于SEARCH-IPT试验的实际数据。结果：我们表明，导致部分聚类试验的两种随机化方案具有相同的依赖结构，可以使用相同的统计方法来估计和推断因果效应。我们新颖的目标最小损失估计方法利用协变量调整和机器学习来提高精度，并促进对大量因果效应的估计。在模拟中，我们证明了针对这些部分聚类设计的基于最小损失的目标估计实现了与普通替代方案相当或显着更高的统计功率。最后，将基于最小损失的目标估计应用于SEARCH-IPT试验的实际数据，效率提高了20%-57%，证明了我们提出的方法在现实世界中的效果。结论部分聚类试验分析可以通过实施有针对性的基于最小损失的估计来提高效率，前提是要注意考虑到观察数据的依赖性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Clinical Trials 医学-医学：研究与实验

CiteScore

4.10

自引率

3.70%

发文量

审稿时长

6-12 weeks

期刊介绍： Clinical Trials is dedicated to advancing knowledge on the design and conduct of clinical trials related research methodologies. Covering the design, conduct, analysis, synthesis and evaluation of key methodologies, the journal remains on the cusp of the latest topics, including ethics, regulation and policy impact.