{"title":"Equality tests of covariance matrices under a low-dimensional factor structure","authors":"Masashi Hyodo , Takahiro Nishiyama , Hiroki Watanabe , Tomoyuki Nakagawa , Kouji Tahata","doi":"10.1016/j.jmva.2024.105397","DOIUrl":"10.1016/j.jmva.2024.105397","url":null,"abstract":"<div><div>We propose an equality test to compare two covariance matrices in a high-dimensional framework while accommodating a low-dimensional latent factor model. We show that null limiting distributions of the test statistics follow a weighted mixture of chi-square distributions under a high-dimensional asymptotic regime combined with weak technical conditions. This distribution depends on the noise covariance matrix and the number of latent factors. Because latent factors are often unknown, we employ an estimation that builds on recent advances in random matrix theory. A numerical study demonstrates the asymptotic power of the proposed test and confirms its favorable analytical properties compared to existing procedures.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105397"},"PeriodicalIF":1.4,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143138000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jakub Woźny , Piotr Jaworski , Damian Jelito , Marcin Pitera , Agnieszka Wyłomańska
{"title":"Gaussian dependence structure pairwise goodness-of-fit testing based on conditional covariance and the 20/60/20 rule","authors":"Jakub Woźny , Piotr Jaworski , Damian Jelito , Marcin Pitera , Agnieszka Wyłomańska","doi":"10.1016/j.jmva.2024.105396","DOIUrl":"10.1016/j.jmva.2024.105396","url":null,"abstract":"<div><div>We present a novel data-oriented statistical framework that assesses the presumed Gaussian dependence structure in a pairwise setting. This refers to both multivariate normality and normal copula goodness-of-fit testing. The proposed test clusters the data according to the 20/60/20 rule and confronts conditional covariance (or correlation) estimates on the obtained subsets. The corresponding test statistic has a natural practical interpretation, desirable statistical properties, and asymptotic pivotal distribution under the multivariate normality assumption. We illustrate the usefulness of the introduced framework using extensive power simulation studies and show that our approach outperforms popular benchmark alternatives. Also, we apply the proposed methodology to exemplary commodity and equity market data.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105396"},"PeriodicalIF":1.4,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143138044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Alteration detection of tensor dependence structure via sparsity-exploited reranking algorithm","authors":"Li Ma , Shenghao Qin , Yin Xia","doi":"10.1016/j.jmva.2024.105395","DOIUrl":"10.1016/j.jmva.2024.105395","url":null,"abstract":"<div><div>Tensor-valued data arise frequently from a wide variety of scientific applications, and many among them can be translated into an alteration detection problem of tensor dependence structures. In this article, we formulate the problem under the popularly adopted tensor-normal distributions and aim at two-sample correlation/partial correlation comparisons of tensor-valued observations. Through decorrelation and centralization, a separable covariance structure is employed to pool sample information from different tensor modes to enhance the power of the test. Additionally, we propose a novel Sparsity-Exploited Reranking Algorithm (SERA) to further improve the multiple testing efficiency. Such efficiency gain is achieved by incorporating a carefully constructed auxiliary tensor sequence to rerank the <span><math><mi>p</mi></math></span>-values. Besides the tensor framework, SERA is also generally applicable to a wide range of two-sample large-scale inference problems with sparsity structures, and is of independent interest. The asymptotic properties of the proposed test are derived and the algorithm is shown to control the false discovery at the pre-specified level. We demonstrate the efficacy of the proposed method through intensive simulations and two scientific applications.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105395"},"PeriodicalIF":1.4,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143138043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"New multivariate Gini’s indices","authors":"Marco Capaldo , Jorge Navarro","doi":"10.1016/j.jmva.2024.105394","DOIUrl":"10.1016/j.jmva.2024.105394","url":null,"abstract":"<div><div>The Gini’s mean difference was defined as the expected absolute difference between a random variable and its independent copy. The corresponding normalized version, namely Gini’s index, denotes two times the area between the egalitarian line and the Lorenz curve. Both are dispersion indices because they quantify how far a random variable and its independent copy are. Aiming to measure dispersion in the multivariate case, we define and study new Gini’s indices. For the bivariate case we provide several results and we point out that they are “dependence-dispersion” indices. Covariance representations are exhibited, with an interpretation also in terms of conditional distributions. Further results, bounds and illustrative examples are discussed too. Multivariate extensions are defined, aiming to apply both indices in more general settings. Then, we define efficiency Gini’s indices for any semi-coherent system and we discuss about their interpretation. Empirical versions are considered as well in order to apply multivariate Gini’s indices to data.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105394"},"PeriodicalIF":1.4,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143137983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Keunbaik Lee , Jongwoo Choi , Eun Jin Jang , Dipak Dey
{"title":"Multivariate robust linear models for multivariate longitudinal data","authors":"Keunbaik Lee , Jongwoo Choi , Eun Jin Jang , Dipak Dey","doi":"10.1016/j.jmva.2024.105392","DOIUrl":"10.1016/j.jmva.2024.105392","url":null,"abstract":"<div><div>Linear models commonly used in longitudinal data analysis often assume a multivariate normal distribution. This assumption, however, can lead to biased mean parameter estimates in the presence of outliers. To address this, alternative linear models based on multivariate t distributions have been developed. In this paper, we review the commonly used multivariate distributions applicable to multivariate longitudinal data and introduce multivariate Laplace linear models (MLLMs) that are designed to handle outliers effectively. These models incorporate a scale matrix that is autoregressive, heteroscedastic, and positive definite, using modified Cholesky and hypersphere decompositions. We conduct simulation studies and apply these models to a real data example, comparing the performance of MLLMs with multivariate normal linear models (MNLMs) and multivariate t linear models (MTLMs), and providing insights on when each model is most appropriate.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105392"},"PeriodicalIF":1.4,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A general approach for testing independence in Hilbert spaces","authors":"Daniel Gaigall , Shunyao Wu , Hua Liang","doi":"10.1016/j.jmva.2024.105384","DOIUrl":"10.1016/j.jmva.2024.105384","url":null,"abstract":"<div><div>We generalize the projection correlation idea for testing independence of random vectors which is known as a powerful method in multivariate analysis. A universal Hilbert space approach makes the new testing procedures useful in various cases and ensures the applicability to high or even infinite dimensional data. We prove that the new tests keep the significance level under the null hypothesis of independence exactly and can detect any alternative of dependence in the limit, in particular in settings where the dimensions of the observations is infinite or tend to infinity simultaneously with the sample size. Simulations demonstrate that the generalization does not impair the good performance of the approach and confirm our theoretical findings. Furthermore, we describe the implementation of the new approach and present a real data example for illustration.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105384"},"PeriodicalIF":1.4,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142723652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sparse functional varying-coefficient mixture regression","authors":"Qingzhi Zhong , Xinyuan Song","doi":"10.1016/j.jmva.2024.105383","DOIUrl":"10.1016/j.jmva.2024.105383","url":null,"abstract":"<div><div>The functional varying-coefficient model (FVCM) provides a simple yet efficient method for function on scalar regression. However, classical FVCM typically assumes that varying associations between functional responses and scalar covariates are identical for all subjects and nonzero in the entire domain of functional measures. This study considers sparse functional varying-coefficient mixture regression, which allows heterogeneous regression associations and dependency structure among multiple functional responses and accommodates functional sparsity in varying coefficient functions. Moreover, we devise a computationally efficient EM algorithm with a double-sparse penalty for estimation. We show that the proposed estimator is consistent, can uncover sparse subregions, and simultaneously select the number of clusters with probability tending to one. Simulation studies and an application to the Alzheimer’s Disease Neuroimaging Initiative study confirm that the proposed method yields more interpretable results and a much lower classification error than existing methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"206 ","pages":"Article 105383"},"PeriodicalIF":1.4,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Maximum likelihood estimation of elliptical tail","authors":"Moosup Kim , Sangyeol Lee","doi":"10.1016/j.jmva.2024.105382","DOIUrl":"10.1016/j.jmva.2024.105382","url":null,"abstract":"<div><div>This study is focused on the efficient estimation of the elliptical tail. Initially, we derive the density function of the spectral measure of an elliptical distribution concerning a dominating measure on the unit sphere, which consequently leads to the density function of the elliptical tail. Subsequently, we propose a maximum likelihood estimation based on the derived density function class. The resulting maximum likelihood estimator (MLE) is proven to be consistent and asymptotically normal. Moreover, it is demonstrated that the MLE is asymptotically efficient, with the added advantage that its asymptotic covariance matrix can be feasibly estimated at a low computational cost. A simulation study and real data analysis are conducted to illustrate the efficacy of the proposed method.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"205 ","pages":"Article 105382"},"PeriodicalIF":1.4,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142658552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lucas Reding , Andrés F. López-Lopera , François Bachoc
{"title":"Covariance parameter estimation of Gaussian processes with approximated functional inputs","authors":"Lucas Reding , Andrés F. López-Lopera , François Bachoc","doi":"10.1016/j.jmva.2024.105380","DOIUrl":"10.1016/j.jmva.2024.105380","url":null,"abstract":"<div><div>We consider the problem of covariance parameter estimation for Gaussian processes with functional inputs. Our study addresses scenarios where exact functional inputs are available and where only approximate versions of these functions are accessible. From an increasing-domain asymptotics perspective, we first establish the asymptotic consistency and normality of the maximum likelihood estimator for the exact inputs. Then, by accounting for approximation errors, we certify the robustness of practical implementations that rely on conventional sampling methods or projections onto a functional basis. Loosely speaking, both consistency and normality continue to hold when the approximation error becomes negligible, a condition often met as the number of samples or basis functions becomes large. To ensure broad applicability, our asymptotic analysis is conducted for any Hilbert space of inputs. Our findings are illustrated through analytical examples, including the case of non-randomly perturbed grids, as well as several numerical illustrations.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"205 ","pages":"Article 105380"},"PeriodicalIF":1.4,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142592553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cristian Castiglione , Eleonora Arnone , Mauro Bernardi , Alessio Farcomeni , Laura M. Sangalli
{"title":"PDE-regularised spatial quantile regression","authors":"Cristian Castiglione , Eleonora Arnone , Mauro Bernardi , Alessio Farcomeni , Laura M. Sangalli","doi":"10.1016/j.jmva.2024.105381","DOIUrl":"10.1016/j.jmva.2024.105381","url":null,"abstract":"<div><div>We consider the problem of estimating the conditional quantiles of an unknown distribution from data gathered on a spatial domain. We propose a spatial quantile regression model with differential regularisation. The penalisation involves a partial differential equation defined over the considered spatial domain, that can display a complex geometry. Such regularisation permits, on one hand, to model complex anisotropy and non-stationarity patterns, possibly on the basis of problem-specific knowledge, and, on the other hand, to comply with the complex conformation of the spatial domain. We define an innovative functional Expectation–Maximisation algorithm, to estimate the unknown quantile surface. We moreover describe a suitable discretisation of the estimation problem, and investigate the theoretical properties of the resulting estimator. The performance of the proposed method is assessed by simulation studies, comparing with state-of-the-art techniques for spatial quantile regression. Finally, the considered model is applied to two real data analyses, the first concerning rainfall measurements in Switzerland and the second concerning sea surface conductivity data in the Gulf of Mexico.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"205 ","pages":"Article 105381"},"PeriodicalIF":1.4,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142572576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}