认真对待参数假设:在单因素方差分析中使用韦尔奇f检验而不是经典f检验的争论

IF 1.6 4区心理学 Q3 PSYCHOLOGY, SOCIAL

International Review of Social Psychology Pub Date : 2019-08-01 DOI:10.5334/IRSP.198

Marie Delacre, C. Leys, Youri L. Mora, D. Lakens

{"title":"认真对待参数假设:在单因素方差分析中使用韦尔奇f检验而不是经典f检验的争论","authors":"Marie Delacre, C. Leys, Youri L. Mora, D. Lakens","doi":"10.5334/IRSP.198","DOIUrl":null,"url":null,"abstract":"Student’s t-test and classical F-test ANOVA rely on the assumptions that two or more samples are independent, and that independent and identically distributed residuals are normal and have equal variances between groups. We focus on the assumptions of normality and equality of variances, and argue that these assumptions are often unrealistic in the field of psychology. We underline the current lack of attention to these assumptions through an analysis of researchers’ practices. Through Monte Carlo simulations, we illustrate the consequences of performing the classic parametric F-test for ANOVA when the test assumptions are not met on the Type I error rate and statistical power. Under realistic deviations from the assumption of equal variances, the classic F-test can yield severely biased results and lead to invalid statistical inferences. We examine two common alternatives to the F-test, namely the Welch’s ANOVA (W-test) and the Brown-Forsythe test (F*-test). Our simulations show that under a range of realistic scenarios, the W-test is a better alternative and we therefore recommend using the W-test by default when comparing means. We provide a detailed example explaining how to perform the W-test in SPSS and R. We summarize our conclusions in practical recommendations that researchers can use to improve their statistical practices.","PeriodicalId":45461,"journal":{"name":"International Review of Social Psychology","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"129","resultStr":"{\"title\":\"Taking Parametric Assumptions Seriously: Arguments for the Use of Welch’s F-test instead of the Classical F-test in One-Way ANOVA\",\"authors\":\"Marie Delacre, C. Leys, Youri L. Mora, D. Lakens\",\"doi\":\"10.5334/IRSP.198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Student’s t-test and classical F-test ANOVA rely on the assumptions that two or more samples are independent, and that independent and identically distributed residuals are normal and have equal variances between groups. We focus on the assumptions of normality and equality of variances, and argue that these assumptions are often unrealistic in the field of psychology. We underline the current lack of attention to these assumptions through an analysis of researchers’ practices. Through Monte Carlo simulations, we illustrate the consequences of performing the classic parametric F-test for ANOVA when the test assumptions are not met on the Type I error rate and statistical power. Under realistic deviations from the assumption of equal variances, the classic F-test can yield severely biased results and lead to invalid statistical inferences. We examine two common alternatives to the F-test, namely the Welch’s ANOVA (W-test) and the Brown-Forsythe test (F*-test). Our simulations show that under a range of realistic scenarios, the W-test is a better alternative and we therefore recommend using the W-test by default when comparing means. We provide a detailed example explaining how to perform the W-test in SPSS and R. We summarize our conclusions in practical recommendations that researchers can use to improve their statistical practices.\",\"PeriodicalId\":45461,\"journal\":{\"name\":\"International Review of Social Psychology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"129\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Review of Social Psychology\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.5334/IRSP.198\",\"RegionNum\":4,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PSYCHOLOGY, SOCIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Social Psychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.5334/IRSP.198","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHOLOGY, SOCIAL","Score":null,"Total":0}

引用次数: 129

摘要

学生t检验和经典f检验ANOVA依赖于两个或多个样本是独立的假设，并且独立和同分布的残差是正态的，并且组间方差相等。我们关注正态性和方差相等的假设，并认为这些假设在心理学领域往往是不现实的。通过对研究人员实践的分析，我们强调目前缺乏对这些假设的关注。通过蒙特卡罗模拟，我们说明了在I型错误率和统计功率不满足检验假设时，对ANOVA进行经典参数f检验的后果。在实际偏离方差相等假设的情况下，经典的f检验会产生严重的偏倚结果，导致无效的统计推断。我们研究了F检验的两种常见替代方法，即韦尔奇方差分析(w检验)和布朗-福赛检验(F*检验)。我们的模拟表明，在一系列现实场景下，w检验是一个更好的选择，因此我们建议在比较均值时默认使用w检验。我们提供了一个详细的例子来解释如何在SPSS和r中执行w检验。我们将我们的结论总结为研究人员可以使用的实用建议，以改进他们的统计实践。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Taking Parametric Assumptions Seriously: Arguments for the Use of Welch’s F-test instead of the Classical F-test in One-Way ANOVA

Student’s t-test and classical F-test ANOVA rely on the assumptions that two or more samples are independent, and that independent and identically distributed residuals are normal and have equal variances between groups. We focus on the assumptions of normality and equality of variances, and argue that these assumptions are often unrealistic in the field of psychology. We underline the current lack of attention to these assumptions through an analysis of researchers’ practices. Through Monte Carlo simulations, we illustrate the consequences of performing the classic parametric F-test for ANOVA when the test assumptions are not met on the Type I error rate and statistical power. Under realistic deviations from the assumption of equal variances, the classic F-test can yield severely biased results and lead to invalid statistical inferences. We examine two common alternatives to the F-test, namely the Welch’s ANOVA (W-test) and the Brown-Forsythe test (F*-test). Our simulations show that under a range of realistic scenarios, the W-test is a better alternative and we therefore recommend using the W-test by default when comparing means. We provide a detailed example explaining how to perform the W-test in SPSS and R. We summarize our conclusions in practical recommendations that researchers can use to improve their statistical practices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Review of Social Psychology PSYCHOLOGY, SOCIAL-

CiteScore

5.20

自引率

8.00%

发文量

审稿时长

16 weeks

期刊介绍： The International Review of Social Psychology (IRSP) is supported by the Association pour la Diffusion de la Recherche Internationale en Psychologie Sociale (A.D.R.I.P.S.). The International Review of Social Psychology publishes empirical research and theoretical notes in all areas of social psychology. Articles are written preferably in English but can also be written in French. The journal was created to reflect research advances in a field where theoretical and fundamental questions inevitably convey social significance and implications. It emphasizes scientific quality of its publications in every area of social psychology. Any kind of research can be considered, as long as the results significantly enhance the understanding of a general social psychological phenomenon and the methodology is appropriate.