Power analysis to detect misfit in SEMs with many items: Resolving unrecognized problems, relating old and new approaches, and "matching" power analysis approach to data analysis approach.

IF 7.6 1区心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY

Psychological methods Pub Date : 2024-12-12 DOI:10.1037/met0000684

Amy Liang, Sonya K Sterba

{"title":"Power analysis to detect misfit in SEMs with many items: Resolving unrecognized problems, relating old and new approaches, and \"matching\" power analysis approach to data analysis approach.","authors":"Amy Liang, Sonya K Sterba","doi":"10.1037/met0000684","DOIUrl":null,"url":null,"abstract":"It is unappreciated that there are four different approaches to power analysis for detecting misspecification by testing overall fit of structural equation models (SEMs) and, moreover, that common approaches can yield radically diverging results for SEMs with many items (high p). Here we newly relate these four approaches. Analytical power analysis methods using theoretical null and theoretical alternative distributions (Approach 1) have a long history, are widespread, and are often contrasted with \"the\" Monte Carlo method-which is an oversimplification. Actually, three Monte Carlo methods can be distinguished; all use an empirical alternative distribution but differ regarding whether the null distribution is theoretical (Approach 2), empirical (Approach 3), or-as we newly propose and demonstrate the need for-adjusted empirical (Approach 4). Because these four approaches can yield radically diverging power results under high p (as demonstrated here), researchers need to \"match\" their a priori SEM power analysis approach to their later SEM data analysis approach for testing overall fit, once data are collected. Disturbingly, the most common power analysis approach for a global test-of-fit is mismatched with the most common data analysis approach for a global test-of-fit in SEM. Because of this mismatch, researchers' anticipated versus actual/obtained power can differ substantially. We explain how/why to \"match\" across power-analysis and data-analysis phases of a study and provide software to facilitate doing so. As extensions, we explain how to relate and implement all four approaches to power analysis (a) for testing overall fit using χ² versus root-mean-square error of approximation and (b) for testing overall fit versus testing a target parameter/effect. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/met0000684","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

It is unappreciated that there are four different approaches to power analysis for detecting misspecification by testing overall fit of structural equation models (SEMs) and, moreover, that common approaches can yield radically diverging results for SEMs with many items (high p). Here we newly relate these four approaches. Analytical power analysis methods using theoretical null and theoretical alternative distributions (Approach 1) have a long history, are widespread, and are often contrasted with "the" Monte Carlo method-which is an oversimplification. Actually, three Monte Carlo methods can be distinguished; all use an empirical alternative distribution but differ regarding whether the null distribution is theoretical (Approach 2), empirical (Approach 3), or-as we newly propose and demonstrate the need for-adjusted empirical (Approach 4). Because these four approaches can yield radically diverging power results under high p (as demonstrated here), researchers need to "match" their a priori SEM power analysis approach to their later SEM data analysis approach for testing overall fit, once data are collected. Disturbingly, the most common power analysis approach for a global test-of-fit is mismatched with the most common data analysis approach for a global test-of-fit in SEM. Because of this mismatch, researchers' anticipated versus actual/obtained power can differ substantially. We explain how/why to "match" across power-analysis and data-analysis phases of a study and provide software to facilitate doing so. As extensions, we explain how to relate and implement all four approaches to power analysis (a) for testing overall fit using χ² versus root-mean-square error of approximation and (b) for testing overall fit versus testing a target parameter/effect. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

查看原文本刊更多论文

功率分析：解决未被识别的问题，将新旧方法联系起来，并将功率分析方法与数据分析方法“匹配”。

通过测试结构方程模型（SEM）的总体拟合度来检测规范失当的幂次分析有四种不同的方法，而且，对于具有许多项目（高 p）的 SEM，普通方法可能会产生截然不同的结果，这一点尚未得到重视。在此，我们新近将这四种方法联系起来。使用理论空分布和理论备择分布的分析幂分析方法（方法 1）历史悠久、应用广泛，经常与蒙特卡罗方法形成鲜明对比--这未免过于简单化。实际上，有三种蒙特卡罗方法可以区分；它们都使用经验替代分布，但在空分布是理论分布（方法 2）、经验分布（方法 3），还是我们新提出并证明需要的调整经验分布（方法 4）方面存在差异。由于这四种方法在高 p 条件下会产生截然不同的功率结果（如本文所示），因此研究人员需要在收集数据后，将其先验的 SEM 功率分析方法与后来用于测试总体拟合度的 SEM 数据分析方法 "匹配 "起来。令人不安的是，全局拟合度测试中最常见的功率分析方法与 SEM 中全局拟合度测试中最常见的数据分析方法并不匹配。由于这种不匹配，研究人员的预期功率与实际功率/获得功率会有很大差异。我们解释了如何/为什么要在研究的功率分析和数据分析阶段之间进行 "匹配"，并提供了便于这样做的软件。作为扩展，我们解释了如何联系和实施所有四种功率分析方法：(a) 使用 χ² 与均方根近似误差测试总体拟合度；(b) 测试总体拟合度与测试目标参数/效应。(PsycInfo Database Record (c) 2024 APA，版权所有）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Psychological methods PSYCHOLOGY, MULTIDISCIPLINARY-

CiteScore

13.10

自引率

7.10%

发文量

159

期刊介绍： Psychological Methods is devoted to the development and dissemination of methods for collecting, analyzing, understanding, and interpreting psychological data. Its purpose is the dissemination of innovations in research design, measurement, methodology, and quantitative and qualitative analysis to the psychological community; its further purpose is to promote effective communication about related substantive and methodological issues. The audience is expected to be diverse and to include those who develop new procedures, those who are responsible for undergraduate and graduate training in design, measurement, and statistics, as well as those who employ those procedures in research.