Non-degenerate U-statistics for data missing completely at random with application to testing independence

IF 0.7 4区 数学 Q3 STATISTICS & PROBABILITY
Stat Pub Date : 2023-11-27 DOI:10.1002/sta4.634
Danijel Aleksić, Marija Cuparić, Bojana Milošević
{"title":"Non-degenerate U-statistics for data missing completely at random with application to testing independence","authors":"Danijel Aleksić, Marija Cuparić, Bojana Milošević","doi":"10.1002/sta4.634","DOIUrl":null,"url":null,"abstract":"Although the era of digitalization has enabled access to large quantities of data, due to their insufficient structuring, some data are often missing, and sometimes, the percentage of missing data is significant compared to the entire sample. On the other hand, most of the statistical methodology is designed for complete data. Here, we explore the asymptotic properties of non-degenerate <i>U</i>-statistics when the data are missing completely at random and a complete-case approach is utilized. The obtained results are applied to the estimator of Kendall's <math altimg=\"urn:x-wiley:sta4:media:sta4634:sta4634-math-0001\" display=\"inline\" location=\"graphic/sta4634-math-0001.png\">\n<mi>t</mi>\n<mi>a</mi>\n<mi>u</mi></math> used for testing independence. In this context, the median-based imputation approach is also considered, and asymptotic properties are explored. In addition, both complete-case and median imputation approaches are compared in an extensive Monte Carlo study.","PeriodicalId":56159,"journal":{"name":"Stat","volume":"368 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stat","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.634","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1

Abstract

Although the era of digitalization has enabled access to large quantities of data, due to their insufficient structuring, some data are often missing, and sometimes, the percentage of missing data is significant compared to the entire sample. On the other hand, most of the statistical methodology is designed for complete data. Here, we explore the asymptotic properties of non-degenerate U-statistics when the data are missing completely at random and a complete-case approach is utilized. The obtained results are applied to the estimator of Kendall's t a u used for testing independence. In this context, the median-based imputation approach is also considered, and asymptotic properties are explored. In addition, both complete-case and median imputation approaches are compared in an extensive Monte Carlo study.
完全随机缺失数据的非退化u统计量,用于检验独立性
虽然数字化时代使大量数据得以访问,但由于其结构化不足,一些数据经常缺失,有时缺失数据的百分比与整个样本相比显着。另一方面,大多数统计方法是为完整的数据而设计的。本文研究了当数据完全随机缺失时非退化u统计量的渐近性质,并采用完全情况方法。将所得结果应用于Kendall's tau的估计量,用于检验独立性。在这种情况下,也考虑了基于中位数的imputation方法,并探讨了渐近性质。此外,在一个广泛的蒙特卡罗研究中,对完全情况和中位数方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Stat
Stat Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.10
自引率
0.00%
发文量
85
期刊介绍: Stat is an innovative electronic journal for the rapid publication of novel and topical research results, publishing compact articles of the highest quality in all areas of statistical endeavour. Its purpose is to provide a means of rapid sharing of important new theoretical, methodological and applied research. Stat is a joint venture between the International Statistical Institute and Wiley-Blackwell. Stat is characterised by: • Speed - a high-quality review process that aims to reach a decision within 20 days of submission. • Concision - a maximum article length of 10 pages of text, not including references. • Supporting materials - inclusion of electronic supporting materials including graphs, video, software, data and images. • Scope - addresses all areas of statistics and interdisciplinary areas. Stat is a scientific journal for the international community of statisticians and researchers and practitioners in allied quantitative disciplines.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信