要不要归罪于人?在不完整数据集上检验多元正态性:重访BHEP检验。

IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY
Journal of Applied Statistics Pub Date : 2024-12-09 eCollection Date: 2025-01-01 DOI:10.1080/02664763.2024.2438798
Danijel G Aleksić, Bojana Milošević
{"title":"要不要归罪于人?在不完整数据集上检验多元正态性:重访BHEP检验。","authors":"Danijel G Aleksić, Bojana Milošević","doi":"10.1080/02664763.2024.2438798","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of the BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of the test statistic under these approaches. Since complete-case approach removes all elements of the sample with at least one missing component, it might lead to the loss of information. On the other hand, we note that performing the test on imputed data as if they were complete, Type I error becomes severely distorted. To address these issues, we propose an appropriate bootstrap algorithm for approximating <i>p</i>-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research. The proposed methodology is illustrated with real-data examples.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 9","pages":"1742-1759"},"PeriodicalIF":1.1000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12217108/pdf/","citationCount":"0","resultStr":"{\"title\":\"To impute or not? Testing multivariate normality on incomplete dataset: revisiting the BHEP test.\",\"authors\":\"Danijel G Aleksić, Bojana Milošević\",\"doi\":\"10.1080/02664763.2024.2438798\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of the BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of the test statistic under these approaches. Since complete-case approach removes all elements of the sample with at least one missing component, it might lead to the loss of information. On the other hand, we note that performing the test on imputed data as if they were complete, Type I error becomes severely distorted. To address these issues, we propose an appropriate bootstrap algorithm for approximating <i>p</i>-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research. The proposed methodology is illustrated with real-data examples.</p>\",\"PeriodicalId\":15239,\"journal\":{\"name\":\"Journal of Applied Statistics\",\"volume\":\"52 9\",\"pages\":\"1742-1759\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12217108/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1080/02664763.2024.2438798\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/02664763.2024.2438798","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

在本文中,我们着重于使用完全随机缺失数据的BHEP检验来测试多元正态性。我们的目标有两个:首先,深入了解BHEP检验统计量在处理缺失数据的两种广泛使用的方法下的渐近行为,即完全案例分析和imputation,其次,比较这些方法下检验统计量的功率性能。由于完全案例方法删除了样本中至少缺少一个组件的所有元素,因此可能会导致信息丢失。另一方面,我们注意到,在输入数据上执行测试,就好像它们是完整的一样,I型错误变得严重扭曲。为了解决这些问题,我们提出了一个适当的自举算法来逼近p值。广泛的模拟研究表明,与全案例分析相比,均值和中位数方法都表现出更大的功效,并为进一步研究打开了一些问题。用实际数据实例说明了所提出的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
To impute or not? Testing multivariate normality on incomplete dataset: revisiting the BHEP test.

In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of the BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of the test statistic under these approaches. Since complete-case approach removes all elements of the sample with at least one missing component, it might lead to the loss of information. On the other hand, we note that performing the test on imputed data as if they were complete, Type I error becomes severely distorted. To address these issues, we propose an appropriate bootstrap algorithm for approximating p-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research. The proposed methodology is illustrated with real-data examples.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Applied Statistics
Journal of Applied Statistics 数学-统计学与概率论
CiteScore
3.40
自引率
0.00%
发文量
126
审稿时长
6 months
期刊介绍: Journal of Applied Statistics provides a forum for communication between both applied statisticians and users of applied statistical techniques across a wide range of disciplines. These areas include business, computing, economics, ecology, education, management, medicine, operational research and sociology, but papers from other areas are also considered. The editorial policy is to publish rigorous but clear and accessible papers on applied techniques. Purely theoretical papers are avoided but those on theoretical developments which clearly demonstrate significant applied potential are welcomed. Each paper is submitted to at least two independent referees.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信