{"title":"A High-Dimensional Two-Sample Test for Non-Gaussian Data under a Strongly Spiked Eigenvalue Model","authors":"Aki Ishii","doi":"10.14490/JJSS.47.273","DOIUrl":null,"url":null,"abstract":"In this paper, we discuss two-sample tests for high-dimension, non-Gaussian data. We suppose that two classes have a strongly spiked eigenvalue model. First, we investigate the noise space for high-dimension, non-Gaussian data. A two-sample test is proposed by using the cross-data-matrix (CDM) methodology and its power is derived under some regularity conditions when the dimension is very large. We discuss the validity of assumptions. We check the performance of the proposed two-sample test procedure by simulations. Finally, we demonstrate the proposed two-sample test in actual data analyses.","PeriodicalId":326924,"journal":{"name":"Journal of the Japan Statistical Society. Japanese issue","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Japan Statistical Society. Japanese issue","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14490/JJSS.47.273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In this paper, we discuss two-sample tests for high-dimension, non-Gaussian data. We suppose that two classes have a strongly spiked eigenvalue model. First, we investigate the noise space for high-dimension, non-Gaussian data. A two-sample test is proposed by using the cross-data-matrix (CDM) methodology and its power is derived under some regularity conditions when the dimension is very large. We discuss the validity of assumptions. We check the performance of the proposed two-sample test procedure by simulations. Finally, we demonstrate the proposed two-sample test in actual data analyses.