Revisiting Savalei’s (2011) Research on Remediating Zero-Frequency Cells in Estimating Polychoric Correlations: A Data Distribution Perspective

IF 2.5 2区 心理学 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
Tong-Rong Yang, Li-Jen Weng
{"title":"Revisiting Savalei’s (2011) Research on Remediating Zero-Frequency Cells in Estimating Polychoric Correlations: A Data Distribution Perspective","authors":"Tong-Rong Yang, Li-Jen Weng","doi":"10.1080/10705511.2023.2220919","DOIUrl":null,"url":null,"abstract":"<p><b>Abstract</b></p><p>In Savalei’s (<span>2011<span aria-label=\"reference\" role=\"dialog\"><span aria-label=\"Close reference popup\" role=\"button\" tabindex=\"0\"></span><span></span> <span>Savalei, <span>V.</span></span> (<span>2011</span>). <span>What to do about zero frequency cells when estimating polychoric correlations</span>. <i>Structural Equation Modeling</i>, <i>18</i>, <span>253</span>–<span>273</span>. <span>https://doi.org/10.1080/10705511.2011.557339</span><span><span>[Taylor &amp; Francis Online], [Web of Science ®]</span> <span>, [Google Scholar]</span></span></span></span>) simulation that evaluated the performance of polychoric correlation estimates in small samples, two methods for treating zero-frequency cells, adding 0.5 (ADD) and doing nothing (NONE), were compared. Savalei tentatively suggested using ADD for binary data and NONE for data with three or more categories. Yet, Savalei’s suggestion could be explained by the skewness of the data distribution being severe for binary data and slight for three-category data. To rule out this alternative explanation, we extended Savalei’s design by incorporating the degree of skewness into our simulation. With slightly skewed data, NONE is recommended due to its high-quality estimates. With severely skewed data, only ADD is recommended for binary data when the skewness of two variables is the same-signed and the underlying correlation is expected to be strong. Methods for improving the polychoric correlation estimates with severely skewed data merit further study.</p>","PeriodicalId":21964,"journal":{"name":"Structural Equation Modeling: A Multidisciplinary Journal","volume":"24 7","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2023-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Structural Equation Modeling: A Multidisciplinary Journal","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1080/10705511.2023.2220919","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

In Savalei’s (2011 Savalei, V. (2011). What to do about zero frequency cells when estimating polychoric correlations. Structural Equation Modeling, 18, 253273. https://doi.org/10.1080/10705511.2011.557339[Taylor & Francis Online], [Web of Science ®] , [Google Scholar]) simulation that evaluated the performance of polychoric correlation estimates in small samples, two methods for treating zero-frequency cells, adding 0.5 (ADD) and doing nothing (NONE), were compared. Savalei tentatively suggested using ADD for binary data and NONE for data with three or more categories. Yet, Savalei’s suggestion could be explained by the skewness of the data distribution being severe for binary data and slight for three-category data. To rule out this alternative explanation, we extended Savalei’s design by incorporating the degree of skewness into our simulation. With slightly skewed data, NONE is recommended due to its high-quality estimates. With severely skewed data, only ADD is recommended for binary data when the skewness of two variables is the same-signed and the underlying correlation is expected to be strong. Methods for improving the polychoric correlation estimates with severely skewed data merit further study.

回顾Savalei(2011)在估计多共时相关性中修复零频率单元的研究:一个数据分布的视角
[摘要]in Savalei 's(2011)。在估计多频相关性时如何处理零频率单元。力学与工程学报,18(3):593 - 593。https://doi.org/10.1080/10705511.2011.557339[泰勒,Francis Online], [Web of Science®],[谷歌Scholar])模拟评估小样本中多频相关估计的性能,比较了两种处理零频率细胞的方法,添加0.5 (ADD)和不做(NONE)。Savalei初步建议对二进制数据使用ADD,对具有三个或更多类别的数据使用NONE。然而,Savalei的建议可以用数据分布的偏性对二进制数据来说很严重,而对三类数据来说则轻微来解释。为了排除这种可能的解释,我们扩展了Savalei的设计,在模拟中加入了偏度。对于稍微偏斜的数据,建议使用NONE,因为它具有高质量的估计。对于严重偏斜的数据,当两个变量的偏度是同号的并且期望潜在的相关性很强时,只建议对二进制数据使用ADD。改进严重偏斜数据的多重相关估计的方法值得进一步研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.70
自引率
11.70%
发文量
71
审稿时长
>12 weeks
期刊介绍: Structural Equation Modeling: A Multidisciplinary Journal publishes refereed scholarly work from all academic disciplines interested in structural equation modeling. These disciplines include, but are not limited to, psychology, medicine, sociology, education, political science, economics, management, and business/marketing. Theoretical articles address new developments; applied articles deal with innovative structural equation modeling applications; the Teacher’s Corner provides instructional modules on aspects of structural equation modeling; book and software reviews examine new modeling information and techniques; and advertising alerts readers to new products. Comments on technical or substantive issues addressed in articles or reviews published in the journal are encouraged; comments are reviewed, and authors of the original works are invited to respond.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信