{"title":"完全随机缺失数据的非退化u统计量,用于检验独立性","authors":"Danijel Aleksić, Marija Cuparić, Bojana Milošević","doi":"10.1002/sta4.634","DOIUrl":null,"url":null,"abstract":"Although the era of digitalization has enabled access to large quantities of data, due to their insufficient structuring, some data are often missing, and sometimes, the percentage of missing data is significant compared to the entire sample. On the other hand, most of the statistical methodology is designed for complete data. Here, we explore the asymptotic properties of non-degenerate <i>U</i>-statistics when the data are missing completely at random and a complete-case approach is utilized. The obtained results are applied to the estimator of Kendall's <math altimg=\"urn:x-wiley:sta4:media:sta4634:sta4634-math-0001\" display=\"inline\" location=\"graphic/sta4634-math-0001.png\">\n<mi>t</mi>\n<mi>a</mi>\n<mi>u</mi></math> used for testing independence. In this context, the median-based imputation approach is also considered, and asymptotic properties are explored. In addition, both complete-case and median imputation approaches are compared in an extensive Monte Carlo study.","PeriodicalId":56159,"journal":{"name":"Stat","volume":"368 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Non-degenerate U-statistics for data missing completely at random with application to testing independence\",\"authors\":\"Danijel Aleksić, Marija Cuparić, Bojana Milošević\",\"doi\":\"10.1002/sta4.634\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although the era of digitalization has enabled access to large quantities of data, due to their insufficient structuring, some data are often missing, and sometimes, the percentage of missing data is significant compared to the entire sample. On the other hand, most of the statistical methodology is designed for complete data. Here, we explore the asymptotic properties of non-degenerate <i>U</i>-statistics when the data are missing completely at random and a complete-case approach is utilized. The obtained results are applied to the estimator of Kendall's <math altimg=\\\"urn:x-wiley:sta4:media:sta4634:sta4634-math-0001\\\" display=\\\"inline\\\" location=\\\"graphic/sta4634-math-0001.png\\\">\\n<mi>t</mi>\\n<mi>a</mi>\\n<mi>u</mi></math> used for testing independence. In this context, the median-based imputation approach is also considered, and asymptotic properties are explored. In addition, both complete-case and median imputation approaches are compared in an extensive Monte Carlo study.\",\"PeriodicalId\":56159,\"journal\":{\"name\":\"Stat\",\"volume\":\"368 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stat\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1002/sta4.634\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stat","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.634","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Non-degenerate U-statistics for data missing completely at random with application to testing independence
Although the era of digitalization has enabled access to large quantities of data, due to their insufficient structuring, some data are often missing, and sometimes, the percentage of missing data is significant compared to the entire sample. On the other hand, most of the statistical methodology is designed for complete data. Here, we explore the asymptotic properties of non-degenerate U-statistics when the data are missing completely at random and a complete-case approach is utilized. The obtained results are applied to the estimator of Kendall's used for testing independence. In this context, the median-based imputation approach is also considered, and asymptotic properties are explored. In addition, both complete-case and median imputation approaches are compared in an extensive Monte Carlo study.
StatDecision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.10
自引率
0.00%
发文量
85
期刊介绍:
Stat is an innovative electronic journal for the rapid publication of novel and topical research results, publishing compact articles of the highest quality in all areas of statistical endeavour. Its purpose is to provide a means of rapid sharing of important new theoretical, methodological and applied research. Stat is a joint venture between the International Statistical Institute and Wiley-Blackwell.
Stat is characterised by:
• Speed - a high-quality review process that aims to reach a decision within 20 days of submission.
• Concision - a maximum article length of 10 pages of text, not including references.
• Supporting materials - inclusion of electronic supporting materials including graphs, video, software, data and images.
• Scope - addresses all areas of statistics and interdisciplinary areas.
Stat is a scientific journal for the international community of statisticians and researchers and practitioners in allied quantitative disciplines.