Model-based statistical depth for matrix data

IF 0.7 4区数学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Statistics and Its Interface Pub Date : 2024-02-01 DOI:10.4310/23-sii829

Yue Mu, Guanyu Hu, Wei Wu

{"title":"Model-based statistical depth for matrix data","authors":"Yue Mu, Guanyu Hu, Wei Wu","doi":"10.4310/23-sii829","DOIUrl":null,"url":null,"abstract":"The field of matrix data learning has witnessed significant advancements in recent years, encompassing diverse datasets such as medical images, social networks, and personalized recommendation systems. These advancements have found widespread application in various domains, including medicine, biology, public health, engineering, finance, economics, sports analytics, and environmental sciences. While extensive research has been conducted on estimation, inference, prediction, and computation for matrix data, the ranking problem has not received adequate attention. Statistical depth, a measure providing a centeroutward rank for different data types, has been introduced in the past few decades. However, its exploration has been limited due to the complexity of the second and higher orderstatistics. In this paper, we propose an approach to rank matrix data by employing a model-based depth framework. Our methodology involves estimating the eigen-decomposition of a 4th-order covariance tensor. To enable this process using conventional matrix operations, we specify the tensor product operator between matrices and 4th-order tensors. Furthermore, we introduce a Kronecker product form on the covariance to enhance the robustness and efficiency of the estimation process, effectively reducing the number of parameters in the model. Based on this new framework, we develop an efficient algorithm to estimate the model-based statistical depth. To validate the effectiveness of our proposed method, we conduct simulations and apply it to two real-world applications: field goal attempts of NBA players and global temperature anomalies.","PeriodicalId":51230,"journal":{"name":"Statistics and Its Interface","volume":"281 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics and Its Interface","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.4310/23-sii829","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

The field of matrix data learning has witnessed significant advancements in recent years, encompassing diverse datasets such as medical images, social networks, and personalized recommendation systems. These advancements have found widespread application in various domains, including medicine, biology, public health, engineering, finance, economics, sports analytics, and environmental sciences. While extensive research has been conducted on estimation, inference, prediction, and computation for matrix data, the ranking problem has not received adequate attention. Statistical depth, a measure providing a centeroutward rank for different data types, has been introduced in the past few decades. However, its exploration has been limited due to the complexity of the second and higher orderstatistics. In this paper, we propose an approach to rank matrix data by employing a model-based depth framework. Our methodology involves estimating the eigen-decomposition of a 4th-order covariance tensor. To enable this process using conventional matrix operations, we specify the tensor product operator between matrices and 4th-order tensors. Furthermore, we introduce a Kronecker product form on the covariance to enhance the robustness and efficiency of the estimation process, effectively reducing the number of parameters in the model. Based on this new framework, we develop an efficient algorithm to estimate the model-based statistical depth. To validate the effectiveness of our proposed method, we conduct simulations and apply it to two real-world applications: field goal attempts of NBA players and global temperature anomalies.

查看原文本刊更多论文

基于模型的矩阵数据统计深度

近年来，矩阵数据学习领域取得了重大进展，涵盖了医疗图像、社交网络和个性化推荐系统等各种数据集。这些进步在医学、生物学、公共卫生、工程学、金融学、经济学、体育分析和环境科学等各个领域得到了广泛应用。虽然对矩阵数据的估计、推理、预测和计算进行了广泛的研究，但排序问题却没有得到足够的重视。统计深度是一种为不同数据类型提供向心排序的测量方法，在过去几十年中已经被引入。然而，由于二阶和高阶统计的复杂性，对它的探索一直受到限制。在本文中，我们提出了一种通过采用基于模型的深度框架对矩阵数据进行排序的方法。我们的方法涉及估计四阶协方差张量的特征分解。为了使用传统的矩阵运算实现这一过程，我们指定了矩阵和四阶张量之间的张量乘积算子。此外，我们还引入了协方差的 Kronecker 积形式，以提高估计过程的稳健性和效率，从而有效减少模型中的参数数量。基于这一新框架，我们开发了一种高效算法来估计基于模型的统计深度。为了验证我们提出的方法的有效性，我们进行了模拟，并将其应用于两个现实世界的应用中：NBA 球员的射门尝试和全球温度异常。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Statistics and Its Interface MATHEMATICAL & COMPUTATIONAL BIOLOGY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

CiteScore

0.90

自引率

12.50%

发文量

审稿时长

6 months

期刊介绍： Exploring the interface between the field of statistics and other disciplines, including but not limited to: biomedical sciences, geosciences, computer sciences, engineering, and social and behavioral sciences. Publishes high-quality articles in broad areas of statistical science, emphasizing substantive problems, sound statistical models and methods, clear and efficient computational algorithms, and insightful discussions of the motivating problems.