On the importance of modeling the invisible world of underlying effect sizes

Q2 Psychology

Social Psychological Bulletin Pub Date : 2023-11-17 DOI:10.32872/spb.9981

Brent M Wilson, J. Wixted

{"title":"On the importance of modeling the invisible world of underlying effect sizes","authors":"Brent M Wilson, J. Wixted","doi":"10.32872/spb.9981","DOIUrl":null,"url":null,"abstract":"The headline findings from the Open Science Collaboration (2015)―namely, that 36% of original experiments replicated at p < .05, with the overall replication effect sizes being half as large as the original effects―cannot be meaningfully interpreted without a formal model. A simple model-based approach might ask: what would the state of original science be and what would replication results show if original experiments tested true effects half the time (prior odds = 1), true effects had a medium effect size (Cohen’s δ = 0.50), and power to detect true effects was 50%? Assuming no questionable research practices, 91% of p < .05 findings in the original literature would be true positives. However, only 58% of original p < .05 findings would be expected to replicate using the Open Science Collaboration approach, and the replication effects overall would be only ~60% as large as the original effects. A minor variant of this model yields an expected replication rate of only 45%, with overall replication effect sizes dropping by half. If the state of original science is as grim as a non-model-based (i.e., intuitive) interpretation of the Open Science Collaboration data suggests, should it be this easy to largely account for those findings using a model in which 91% of statistically significant findings in the original science literature are true positives? Claims that the findings reported by the Open Science Collaboration indicate a replication crisis should not be based solely on intuition but should instead be accompanied by a specific model that supports that interpretation.","PeriodicalId":32922,"journal":{"name":"Social Psychological Bulletin","volume":"12 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Psychological Bulletin","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32872/spb.9981","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Psychology","Score":null,"Total":0}

引用次数: 0

Abstract

The headline findings from the Open Science Collaboration (2015)―namely, that 36% of original experiments replicated at p < .05, with the overall replication effect sizes being half as large as the original effects―cannot be meaningfully interpreted without a formal model. A simple model-based approach might ask: what would the state of original science be and what would replication results show if original experiments tested true effects half the time (prior odds = 1), true effects had a medium effect size (Cohen’s δ = 0.50), and power to detect true effects was 50%? Assuming no questionable research practices, 91% of p < .05 findings in the original literature would be true positives. However, only 58% of original p < .05 findings would be expected to replicate using the Open Science Collaboration approach, and the replication effects overall would be only ~60% as large as the original effects. A minor variant of this model yields an expected replication rate of only 45%, with overall replication effect sizes dropping by half. If the state of original science is as grim as a non-model-based (i.e., intuitive) interpretation of the Open Science Collaboration data suggests, should it be this easy to largely account for those findings using a model in which 91% of statistically significant findings in the original science literature are true positives? Claims that the findings reported by the Open Science Collaboration indicate a replication crisis should not be based solely on intuition but should instead be accompanied by a specific model that supports that interpretation.

查看原文本刊更多论文

对基本效应大小的隐形世界进行建模的重要性

开放科学合作组织（2015）的主要发现--即 36% 的原始实验的复制结果 p < .05，总体复制效应大小是原始效应的一半--如果没有一个正式的模型，是无法进行有意义的解释的。一个简单的基于模型的方法可能会问：如果原始实验有一半的时间测试了真实效应（先验几率 = 1），真实效应具有中等效应大小（Cohen's δ = 0.50），检测真实效应的功率为 50%，那么原始科学的状况会是怎样的？假设没有可疑的研究实践，那么原始文献中 91% 的 p < .05 结果将是真正的阳性结果。然而，使用开放科学合作方法，预计只有 58% 的 p < .05 的原始研究结果会被复制，复制效果总体上只有原始效果的 60%。该模型的一个小变体得出的预期复制率仅为 45%，总体复制效应大小下降了一半。如果原创科学的现状就像对开放科学合作组织数据的非模型（即直观）解释所显示的那样严峻，那么使用一个模型来解释这些发现是否就这么容易呢？在这个模型中，原创科学文献中91%具有统计意义的发现都是真阳性？关于开放科学合作组织报告的研究结果表明存在复制危机的说法不应该仅仅基于直觉，而应该有一个支持这种解释的具体模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊