Commentary on ``Nonparametric identification is not enough, but randomized controlled trials are'': Statistical considerations for generating reliable evidence across a spectrum of studies that increasingly involve real-world elements.
{"title":"Commentary on ``Nonparametric identification is not enough, but randomized controlled trials are'': Statistical considerations for generating reliable evidence across a spectrum of studies that increasingly involve real-world elements.","authors":"Rachael Phillips, Mark van der Laan","doi":"10.1353/obs.2025.a956842","DOIUrl":null,"url":null,"abstract":"<p><p>Judea Pearl, quoted in Pearl and Mackenzie (2008), stated that \"once we have understood why [randomized controlled trials] RCTs work, there is no need to put them on a pedestal and treat them as the gold standard of causal analysis, which all other methods should emulate.\" In Aronow et al. (2024), this claim is refuted, drawing on results of Robins and Ritov (1997). The argument is made that statistical estimation and inference tend to be fundamentally more difficult in observational studies than in randomized controlled trials, even when all confounders are observed and measured without error. We congratulate the authors for raising this highly timely, interesting discussion and welcome this opportunity to join this important debate. In this commentary, we focus on what it takes to generate reliable evidence across a spectrum of studies that increasingly involve real-world elements and less control over design. A related question is whether, along this spectrum of studies, the reliability of evidence generated by a statistical analysis decreases. We claim that this is not the case, but that the challenge for the appropriate statistical method increases, requiring sophisticated and careful execution.</p>","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":"11 1","pages":"61-76"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12139718/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Observational studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1353/obs.2025.a956842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Judea Pearl, quoted in Pearl and Mackenzie (2008), stated that "once we have understood why [randomized controlled trials] RCTs work, there is no need to put them on a pedestal and treat them as the gold standard of causal analysis, which all other methods should emulate." In Aronow et al. (2024), this claim is refuted, drawing on results of Robins and Ritov (1997). The argument is made that statistical estimation and inference tend to be fundamentally more difficult in observational studies than in randomized controlled trials, even when all confounders are observed and measured without error. We congratulate the authors for raising this highly timely, interesting discussion and welcome this opportunity to join this important debate. In this commentary, we focus on what it takes to generate reliable evidence across a spectrum of studies that increasingly involve real-world elements and less control over design. A related question is whether, along this spectrum of studies, the reliability of evidence generated by a statistical analysis decreases. We claim that this is not the case, but that the challenge for the appropriate statistical method increases, requiring sophisticated and careful execution.
Pearl and Mackenzie(2008)引用朱迪亚·珀尔(Judea Pearl)的话说:“一旦我们理解了随机对照试验(rrcts)有效的原因,就没有必要把它们奉为因果分析的黄金标准,所有其他方法都应该效仿。”在Aronow et al.(2024)中,这一说法被反驳,借鉴了Robins和Ritov(1997)的结果。有人认为,在观察性研究中,统计估计和推断往往从根本上比在随机对照试验中更困难,即使所有的混杂因素都被准确地观察和测量。我们祝贺作者提出了这一非常及时、有趣的讨论,并欢迎有机会参加这一重要的辩论。在这篇评论中,我们关注的是如何在一系列研究中产生可靠的证据,这些研究越来越多地涉及现实世界的元素,对设计的控制越来越少。一个相关的问题是,在这一系列研究中,统计分析产生的证据的可靠性是否会降低。我们声称,情况并非如此,而是对适当统计方法的挑战增加了,需要复杂和仔细的执行。