{"title":"To impute or not? Testing multivariate normality on incomplete dataset: revisiting the BHEP test.","authors":"Danijel G Aleksić, Bojana Milošević","doi":"10.1080/02664763.2024.2438798","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of the BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of the test statistic under these approaches. Since complete-case approach removes all elements of the sample with at least one missing component, it might lead to the loss of information. On the other hand, we note that performing the test on imputed data as if they were complete, Type I error becomes severely distorted. To address these issues, we propose an appropriate bootstrap algorithm for approximating <i>p</i>-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research. The proposed methodology is illustrated with real-data examples.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 9","pages":"1742-1759"},"PeriodicalIF":1.1000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12217108/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/02664763.2024.2438798","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of the BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of the test statistic under these approaches. Since complete-case approach removes all elements of the sample with at least one missing component, it might lead to the loss of information. On the other hand, we note that performing the test on imputed data as if they were complete, Type I error becomes severely distorted. To address these issues, we propose an appropriate bootstrap algorithm for approximating p-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research. The proposed methodology is illustrated with real-data examples.
期刊介绍:
Journal of Applied Statistics provides a forum for communication between both applied statisticians and users of applied statistical techniques across a wide range of disciplines. These areas include business, computing, economics, ecology, education, management, medicine, operational research and sociology, but papers from other areas are also considered. The editorial policy is to publish rigorous but clear and accessible papers on applied techniques. Purely theoretical papers are avoided but those on theoretical developments which clearly demonstrate significant applied potential are welcomed. Each paper is submitted to at least two independent referees.