{"title":"Two-sample test for high-dimensional covariance matrices: A normal-reference approach","authors":"Jingyi Wang , Tianming Zhu , Jin-Ting Zhang","doi":"10.1016/j.jmva.2024.105354","DOIUrl":"10.1016/j.jmva.2024.105354","url":null,"abstract":"<div><p>Testing the equality of the covariance matrices of two high-dimensional samples is a fundamental inference problem in statistics. Several tests have been proposed but they are either too liberal or too conservative when the required assumptions are not satisfied which attests that they are not always applicable in real data analysis. To overcome this difficulty, a normal-reference test is proposed and studied in this paper. It is shown that under some regularity conditions and the null hypothesis, the proposed test statistic and a chi-squared-type mixture have the same limiting distribution. It is then justified to approximate the null distribution of the proposed test statistic using that of the chi-squared-type mixture. The distribution of the chi-squared-type mixture can be well approximated using a three-cumulant matched chi-squared-approximation with its approximation parameters consistently estimated from the data. The asymptotic power of the proposed test under a local alternative is also established. Simulation studies and a real data example demonstrate that the proposed test works well in general scenarios and outperforms the existing competitors substantially in terms of size control.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141871513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An approximation to peak detection power using Gaussian random field theory","authors":"Yu Zhao , Dan Cheng , Armin Schwartzman","doi":"10.1016/j.jmva.2024.105346","DOIUrl":"10.1016/j.jmva.2024.105346","url":null,"abstract":"<div><p>We study power approximation formulas for peak detection using Gaussian random field theory. The approximation, based on the expected number of local maxima above the threshold <span><math><mi>u</mi></math></span>, <span><math><mrow><mi>E</mi><mrow><mo>[</mo><msub><mrow><mi>M</mi></mrow><mrow><mi>u</mi></mrow></msub><mo>]</mo></mrow></mrow></math></span>, is proved to work well under three asymptotic scenarios: small domain, large threshold, and sharp signal. An adjusted version of <span><math><mrow><mi>E</mi><mrow><mo>[</mo><msub><mrow><mi>M</mi></mrow><mrow><mi>u</mi></mrow></msub><mo>]</mo></mrow></mrow></math></span> is also proposed to improve accuracy when the expected number of local maxima <span><math><mrow><mi>E</mi><mrow><mo>[</mo><msub><mrow><mi>M</mi></mrow><mrow><mo>−</mo><mi>∞</mi></mrow></msub><mo>]</mo></mrow></mrow></math></span> exceeds 1. Cheng and Schwartzman (2018) developed explicit formulas for <span><math><mrow><mi>E</mi><mrow><mo>[</mo><msub><mrow><mi>M</mi></mrow><mrow><mi>u</mi></mrow></msub><mo>]</mo></mrow></mrow></math></span> of smooth isotropic Gaussian random fields with zero mean. In this paper, these formulas are extended to allow for rotational symmetric mean functions, making them applicable not only for power calculations but also for other areas of application that involve non-centered Gaussian random fields. We also apply our formulas to 2D and 3D simulated datasets, and the 3D data is induced by a group analysis of fMRI data from the Human Connectome Project to measure performance in a realistic setting.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141871667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Double penalized variable selection for high-dimensional partial linear mixed effects models","authors":"Yiping Yang, Chuanqin Luo, Weiming Yang","doi":"10.1016/j.jmva.2024.105345","DOIUrl":"10.1016/j.jmva.2024.105345","url":null,"abstract":"<div><p>In this study, we address the selection of both fixed and random effects in partial linear mixed effects models. By combining B-spline and QR decomposition techniques, we propose a double-penalized likelihood procedure for both estimating and selecting these effects. Furthermore, we introduce an orthogonality-based method to estimate the non-parametric component, ensuring that the fixed and random effects are separated without any mutual interference. The asymptotic properties of the resulting estimators are investigated under mild conditions. Simulation studies are conducted to evaluate the finite sample performance of the proposed method. Finally, we demonstrate the practical applicability of our methodology by analyzing a real data.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stochastic hyperplane-based ranks and their use in multivariate portmanteau tests","authors":"Šárka Hudecová , Miroslav Šiman","doi":"10.1016/j.jmva.2024.105344","DOIUrl":"10.1016/j.jmva.2024.105344","url":null,"abstract":"<div><p>The article proposes and justifies an optimal rank-based portmanteau test of multivariate elliptical strict white noise against multivariate serial dependence. It is based on new stochastic hyperplane-based ranks that are simpler and easier to compute than other usable hyperplane-based competitors and still share with them many good properties such as their distribution-free nature, affine invariance, efficiency, robustness and weak moment assumptions. The finite-sample performance of the portmanteau test is illustrated empirically in a small Monte Carlo simulation study.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Composite expectile estimation in partial functional linear regression model","authors":"Ping Yu , Xinyuan Song , Jiang Du","doi":"10.1016/j.jmva.2024.105343","DOIUrl":"https://doi.org/10.1016/j.jmva.2024.105343","url":null,"abstract":"<div><p>Recent research and substantive studies have shown growing interest in expectile regression (ER) procedures. Similar to quantile regression, ER with respect to different expectile levels can provide a comprehensive picture of the conditional distribution of a response variable given predictors. This study proposes three composite-type ER estimators to improve estimation accuracy. The proposed ER estimators include the composite estimator, which minimizes the composite expectile objective function across expectiles; the weighted expectile average estimator, which takes the weighted average of expectile-specific estimators; and the weighted composite estimator, which minimizes the weighted composite expectile objective function across expectiles. Under certain regularity conditions, we derive the convergence rate of the slope function, obtain the mean squared prediction error, and establish the asymptotic normality of the slope vector. Simulations are conducted to assess the empirical performances of various estimators. An application to the analysis of capital bike share data is presented. The numerical evidence endorses our theoretical results and confirm the superiority of the composite-type ER estimators to the conventional least squares and single ER estimators.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141485003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
El Mehdi Issouani , Patrice Bertail , Emmanuelle Gautherat
{"title":"Exponential bounds for regularized Hotelling’s T2 statistic in high dimension","authors":"El Mehdi Issouani , Patrice Bertail , Emmanuelle Gautherat","doi":"10.1016/j.jmva.2024.105342","DOIUrl":"10.1016/j.jmva.2024.105342","url":null,"abstract":"<div><p>We obtain exponential inequalities for regularized Hotelling’s <span><math><msubsup><mrow><mi>T</mi></mrow><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msubsup></math></span> statistics, that take into account the potential high dimensional aspects of the problem. We explore the finite sample properties of the tail of these statistics by deriving exponential bounds for symmetric distributions and also for general distributions under weak moment assumptions (we never assume exponential moments). For this, we use a penalized estimator of the covariance matrix and propose an optimal choice for the penalty coefficient.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X24000496/pdfft?md5=cd918fc00e938bad85311ad3c899e4a8&pid=1-s2.0-S0047259X24000496-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141392223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fisher’s pioneering work on discriminant analysis and its impact on Artificial Intelligence","authors":"Kanti V. Mardia","doi":"10.1016/j.jmva.2024.105341","DOIUrl":"10.1016/j.jmva.2024.105341","url":null,"abstract":"<div><p>Sir Ronald Aylmer Fisher opened many new areas in Multivariate Analysis, and the one which we will consider is discriminant analysis. Several papers by Fisher and others followed from his seminal paper in 1936 where he coined the name discrimination function. Historically, his four papers on discriminant analysis during 1936–1940 connect to the contemporaneous pioneering work of Hotelling and Mahalanobis. We revisit the famous iris data which Fisher used in his 1936 paper and in particular, test the hypothesis of multivariate normality for the data which he assumed. Fisher constructed his genetic discriminant motivated by this application and we provide a deeper insight into this construction; however, this construction has not been well understood as far as we know. We also indicate how the subject has developed along with the computer revolution, noting newer methods to carry out discriminant analysis, such as kernel classifiers, classification trees, support vector machines, neural networks, and deep learning. Overall, with computational power, the whole subject of Multivariate Analysis has changed its emphasis but the impact of this Fisher’s pioneering work continues as an integral part of supervised learning in Artificial Intelligence (AI).</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141403667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Max-convolution processes with random shape indicator kernels","authors":"Pavel Krupskii , Raphaël Huser","doi":"10.1016/j.jmva.2024.105340","DOIUrl":"https://doi.org/10.1016/j.jmva.2024.105340","url":null,"abstract":"<div><p>In this paper, we introduce a new class of models for spatial data obtained from max-convolution processes based on indicator kernels with random shape. We show that these models have appealing dependence properties including tail dependence at short distances and independence at long distances. We further consider max-convolutions between such processes and processes with tail independence, in order to separately control the bulk and tail dependence behaviors, and to increase flexibility of the model at longer distances, in particular, to capture intermediate tail dependence. We show how parameters can be estimated using a weighted pairwise likelihood approach, and we conduct an extensive simulation study to show that the proposed inference approach is feasible in relatively high dimensions and it yields accurate parameter estimates in most cases. We apply the proposed methodology to analyze daily temperature maxima measured at 100 monitoring stations in the state of Oklahoma, US. Our results indicate that our proposed model provides a good fit to the data, and that it captures both the bulk and the tail dependence structures accurately.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X24000472/pdfft?md5=6e148f4b405bc0c38b2fef0ced10dc6b&pid=1-s2.0-S0047259X24000472-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141313857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Low-rank tensor regression for selection of grouped variables","authors":"Yang Chen, Ziyan Luo, Lingchen Kong","doi":"10.1016/j.jmva.2024.105339","DOIUrl":"https://doi.org/10.1016/j.jmva.2024.105339","url":null,"abstract":"<div><p>Low-rank tensor regression (LRTR) problems are widely studied in statistics and machine learning, in which the regressors are generally grouped by clustering strongly correlated variables or variables corresponding to different levels of the same predictive factor in many practical applications. By virtue of the idea of group selection in the classical linear regression framework, we propose an LRTR method for adaptive selection of grouped variables in this article, which is formulated as a group SLOPE penalized low-rank, orthogonally decomposable tensor optimization problem. Moreover, we introduce the notion of tensor group false discovery rate (TgFDR) to measure the group selection performance. The proposed regression method provably controls TgFDR and achieves the asymptotically minimax estimate under the assumption that variable groups are orthogonal to each other. Finally, an alternating minimization algorithm is developed for efficient problem resolution. We demonstrate the performance of our proposed method in group selection and low-rank estimation through simulation studies and real dataset analysis.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141323989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bias correction for kernel density estimation with spherical data","authors":"Yasuhito Tsuruta","doi":"10.1016/j.jmva.2024.105338","DOIUrl":"10.1016/j.jmva.2024.105338","url":null,"abstract":"<div><p>Kernel density estimations with spherical data can flexibly estimate the shape of an underlying density, including rotationally symmetric, skewed, and multimodal distributions. Standard estimators are generally based on rotationally symmetric kernel functions such as the von Mises kernel function. Unfortunately, their mean integrated squared error does not have root-<span><math><mi>n</mi></math></span> consistency and increasing the dimension slows its convergence rate. Therefore, this study aims to improve its accuracy by correcting this bias. It proposes bias correction methods by applying the generalized jackknifing method that can be generated from the von Mises kernel function. We also obtain the asymptotic mean integrated squared errors of the proposed estimators. We find that the convergence rates of the proposed estimators are higher than those of previous estimators. Further, a numerical experiment shows that the proposed estimators perform better than the von Mises kernel density estimators in finite samples in scenarios that are mixtures of von Mises densities.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141281345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}