A hyperbolic divergence based nonparametric test for two-sample multivariate distributions

Pub Date : 2022-11-26 DOI:10.1002/cjs.11736
Roulin Wang, Wei Fan, Xueqin Wang
{"title":"A hyperbolic divergence based nonparametric test for two-sample multivariate distributions","authors":"Roulin Wang,&nbsp;Wei Fan,&nbsp;Xueqin Wang","doi":"10.1002/cjs.11736","DOIUrl":null,"url":null,"abstract":"<p>Two-sample hypothesis testing, as a fundamental problem in statistical inference, seeks to detect the difference between two probability measures and has numerous real-world applications. Current test procedures for multivariate two-sample problems typically rely on angles and lengths in a Euclidean space, or lengths in a unit hypersphere after representing data with the spherical model. This article introduces a hyperbolic divergence based on hyperbolic lengths in hyperbolic geometry, as well as a subsequent nonparametric approach to testing the multivariate two-sample problem. We investigate the properties of our test procedure and discover that our hyperbolic divergence statistic is strongly consistent and consistent against all other alternatives; we also demonstrate that its limit distribution is an infinite mixture of <math>\n <msup>\n <mrow>\n <mi>χ</mi>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msup></math> distributions under the null hypothesis and a normal distribution under the alternative hypothesis. To calculate the <math>\n <mrow>\n <mi>P</mi>\n </mrow></math>-value, we employ the permutation method. Furthermore, in numerical studies, we compare our method with several nonparametric procedures under various distributional assumptions and alternatives. We discover that our test procedure has some advantages when the distributions' complex correlation structures differ. Finally, we examine one real data set to show how our method can be used to test two-sample heterogeneity.</p>","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjs.11736","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Two-sample hypothesis testing, as a fundamental problem in statistical inference, seeks to detect the difference between two probability measures and has numerous real-world applications. Current test procedures for multivariate two-sample problems typically rely on angles and lengths in a Euclidean space, or lengths in a unit hypersphere after representing data with the spherical model. This article introduces a hyperbolic divergence based on hyperbolic lengths in hyperbolic geometry, as well as a subsequent nonparametric approach to testing the multivariate two-sample problem. We investigate the properties of our test procedure and discover that our hyperbolic divergence statistic is strongly consistent and consistent against all other alternatives; we also demonstrate that its limit distribution is an infinite mixture of χ 2 distributions under the null hypothesis and a normal distribution under the alternative hypothesis. To calculate the P -value, we employ the permutation method. Furthermore, in numerical studies, we compare our method with several nonparametric procedures under various distributional assumptions and alternatives. We discover that our test procedure has some advantages when the distributions' complex correlation structures differ. Finally, we examine one real data set to show how our method can be used to test two-sample heterogeneity.

分享
查看原文
两样本多元分布的基于双曲散度的非参数检验
双样本假设检验作为统计推断中的一个基本问题,旨在检测两个概率度量之间的差异,并在现实世界中有许多应用。当前多变量双样本问题的测试程序通常依赖于欧几里得空间中的角度和长度,或者在用球面模型表示数据后的单位超球中的长度。本文介绍了双曲几何中基于双曲长度的双曲散度,以及随后检验多变量双样本问题的非参数方法。我们研究了我们的检验程序的性质,并发现我们的双曲散度统计量对所有其他选择都是强一致和一致的;我们还证明了它的极限分布是零假设下的χ 2分布和备择假设下的正态分布的无限混合。为了计算P值,我们采用置换法。此外,在数值研究中,我们将该方法与不同分布假设和选择下的几种非参数过程进行了比较。我们发现,当分布的复杂相关结构不同时,我们的测试方法具有一定的优势。最后,我们检查了一个真实的数据集,以显示如何使用我们的方法来测试双样本异质性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信