Measures of multivariate skewness and kurtosis in high-dimensional framework

Q4 Mathematics
Kazuyuki Koizumi, Takuma Sumikawa, T. Pavlenko
{"title":"Measures of multivariate skewness and kurtosis in high-dimensional framework","authors":"Kazuyuki Koizumi, Takuma Sumikawa, T. Pavlenko","doi":"10.55937/sut/1424858950","DOIUrl":null,"url":null,"abstract":"Skewness and kurtosis characteristics of a multivariate p-dimensional distribution introduced by Mardia (1970) have been used in various testing procedures and demonstrated attractive asymptotic properties in large sample settings. However these characteristics are not designed for high-dimensional problems where the dimensionality, p can largely exceeds the sample size, N. Such type of high-dimensional data are commonly encountered in modern statistical applications. This the suggests that new measures of skewness and kurtosis that can accommodate high-dimensional settings must be derived and carefully studied. In this paper, we show that, by exploiting the dependence structure, new expressions for skewness and kurtosis are introduced as an extension of the corresponding Mardia’s measures, which uses the potential advantages that the block-diagonal covariance structure has to offer in high dimensions. Asymptotic properties of newly derived measures are investigated and the cumulant based characterizations are presented along with of applications to a mixture of multivariate normal distributions and multivariate Laplace distribution, for which the explicit expressions of skewness and kurto-sis are obtained. Test statistics based on the new measures of skewness and kurtosis are proposed for testing a distribution shape, and their limit distributions are established in the asymptotic framework where N → ∞ and p is fixed but large, including p > N. For the dependence structure learning, the gLasso based technique is explored followed by AIC step which we propose for optimization of the gLasso candidate model. Performance accuracy of the test procedures based on our estimators of skewness and kurtosis are evaluated using Monte Carlo simulations and the validity of the suggested approach is shown for a number of cases when p > N.","PeriodicalId":38708,"journal":{"name":"SUT Journal of Mathematics","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SUT Journal of Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55937/sut/1424858950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 19

Abstract

Skewness and kurtosis characteristics of a multivariate p-dimensional distribution introduced by Mardia (1970) have been used in various testing procedures and demonstrated attractive asymptotic properties in large sample settings. However these characteristics are not designed for high-dimensional problems where the dimensionality, p can largely exceeds the sample size, N. Such type of high-dimensional data are commonly encountered in modern statistical applications. This the suggests that new measures of skewness and kurtosis that can accommodate high-dimensional settings must be derived and carefully studied. In this paper, we show that, by exploiting the dependence structure, new expressions for skewness and kurtosis are introduced as an extension of the corresponding Mardia’s measures, which uses the potential advantages that the block-diagonal covariance structure has to offer in high dimensions. Asymptotic properties of newly derived measures are investigated and the cumulant based characterizations are presented along with of applications to a mixture of multivariate normal distributions and multivariate Laplace distribution, for which the explicit expressions of skewness and kurto-sis are obtained. Test statistics based on the new measures of skewness and kurtosis are proposed for testing a distribution shape, and their limit distributions are established in the asymptotic framework where N → ∞ and p is fixed but large, including p > N. For the dependence structure learning, the gLasso based technique is explored followed by AIC step which we propose for optimization of the gLasso candidate model. Performance accuracy of the test procedures based on our estimators of skewness and kurtosis are evaluated using Monte Carlo simulations and the validity of the suggested approach is shown for a number of cases when p > N.
高维框架中多元偏度和峰度的度量
Mardia(1970)引入的多元p维分布的偏度和峰度特征已被用于各种测试程序,并在大样本设置中显示出有吸引力的渐近特性。然而,这些特征不是为高维问题而设计的,其中维数p可能大大超过样本量n。这种类型的高维数据在现代统计应用中经常遇到。这表明,必须推导并仔细研究能够适应高维环境的偏度和峰度的新度量。在本文中,我们表明,通过利用依赖结构,引入了新的偏度和峰度表达式,作为相应的Mardia度量的扩展,它利用了块对角协方差结构在高维中所具有的潜在优势。研究了新导出测度的渐近性质,给出了基于累积量的表征,并应用于多元正态分布和多元拉普拉斯分布的混合分布,得到了偏度和峰度的显式表达式。提出了基于偏度和峭度新度量的检验统计量用于检验分布形状,并在N→∞和p固定但较大(包括p > N)的渐近框架中建立了它们的极限分布。对于依赖结构学习,探索了基于gLasso的技术,然后提出了AIC步骤来优化gLasso候选模型。使用蒙特卡罗模拟评估了基于我们的偏度和峰度估计的测试程序的性能准确性,并在p > N的许多情况下显示了所建议方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
SUT Journal of Mathematics
SUT Journal of Mathematics Mathematics-Mathematics (all)
CiteScore
0.30
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信