{"title":"New two‐sample test utilizing interpoint distance discrepancy","authors":"Dong Xu","doi":"10.1002/sta4.712","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel two‐sample test for multivariate sample space. The test statistic calculates the mean of absolute difference of average interpoint distance. We utilize a permutation procedure to establish the critical value for the test. Through comprehensive simulation studies, we compare the performance of our proposed test with that of the K‐nearest neighbour test and the energy test. The results demonstrate that our proposed test exhibits advantages over the other two tests, particularly in high‐dimensional sample spaces. This superiority is further validated by its application to UCR time series datasets.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.712","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a novel two‐sample test for multivariate sample space. The test statistic calculates the mean of absolute difference of average interpoint distance. We utilize a permutation procedure to establish the critical value for the test. Through comprehensive simulation studies, we compare the performance of our proposed test with that of the K‐nearest neighbour test and the energy test. The results demonstrate that our proposed test exhibits advantages over the other two tests, particularly in high‐dimensional sample spaces. This superiority is further validated by its application to UCR time series datasets.
本文提出了一种新颖的多元样本空间双样本检验方法。该检验统计量计算平均点间距离绝对差的平均值。我们利用置换程序来确定检验的临界值。通过综合模拟研究,我们比较了我们提出的检验与 K 最近邻检验和能量检验的性能。结果表明,我们提出的检验方法比其他两种检验方法更具优势,尤其是在高维样本空间中。在 UCR 时间序列数据集上的应用进一步验证了这一优势。