Annals of Statistics最新文献_第9页

The multi-armed bandit problem: An efficient nonparametric solution 多武装土匪问题：一个有效的非参数解

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 DOI: 10.1214/19-aos1809

H. Chan

引用次数: 13

Penalized generalized empirical likelihood with a diverging number of general estimating equations for censored data 截尾数据广义估计方程具有发散数的惩罚广义经验似然

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 DOI: 10.1214/19-aos1870

Nian-Sheng Tang, Xiaodong Yan, Xingqiu Zhao

{"title":"Penalized generalized empirical likelihood with a diverging number of general estimating equations for censored data","authors":"Nian-Sheng Tang, Xiaodong Yan, Xingqiu Zhao","doi":"10.1214/19-aos1870","DOIUrl":"https://doi.org/10.1214/19-aos1870","url":null,"abstract":"This article considers simultaneous variable selection and parameter estimation as well as hypothesis testing in censored regression models with unspecified parametric likelihood. For the problem, we utilize certain growing dimensional general estimating equations and propose a penalized generalized empirical likelihood using the folded concave penalties. We first construct general estimating equations attaining the semiparametric efficiency bound with censored regression data and then establish the consistency and oracle properties of the penalized generalized empirical likelihood estimators. Furthermore, we show that the penalized generalized empirical likelihood ratio test statistic has an asymptotic standard central chi-squared distribution. The conditions of local and restricted global optimality of weighted penalized generalized empirical likelihood estimators are also discussed. We present an two-layer iterative algorithm for efficient implementation, and rigorously investigate its convergence property. The good performance of the proposed methods are demonstrated by extensive simulation studies and a real data example is provided for illustration.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":"48 1","pages":"607-627"},"PeriodicalIF":4.5,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44061735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Sparse SIR: Optimal rates and adaptive estimation 稀疏SIR:最优速率和自适应估计

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 DOI: 10.1214/18-aos1791

Kai Tan, Lei Shi, Zhou Yu

引用次数: 17

CONSISTENT SELECTION OF THE NUMBER OF CHANGE-POINTS VIA SAMPLE-SPLITTING. 通过样本分裂一致地选择改变点的数量。

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 Epub Date: 2020-02-17 DOI: 10.1214/19-aos1814

Changliang Zou, Guanghui Wang, Runze Li

{"title":"CONSISTENT SELECTION OF THE NUMBER OF CHANGE-POINTS VIA SAMPLE-SPLITTING.","authors":"Changliang Zou, Guanghui Wang, Runze Li","doi":"10.1214/19-aos1814","DOIUrl":"https://doi.org/10.1214/19-aos1814","url":null,"abstract":"In multiple change-point analysis, one of the major challenges is to estimate the number of change-points. Most existing approaches attempt to minimize a Schwarz information criterion which balances a term quantifying model fit with a penalization term accounting for model complexity that increases with the number of change-points and limits overfitting. However, different penalization terms are required to adapt to different contexts of multiple change-point problems and the optimal penalization magnitude usually varies from the model and error distribution. We propose a data-driven selection criterion that is applicable to most kinds of popular change-point detection methods, including binary segmentation and optimal partitioning algorithms. The key idea is to select the number of change-points that minimizes the squared prediction error, which measures the fit of a specified model for a new sample. We develop a cross-validation estimation scheme based on an order-preserved sample-splitting strategy, and establish its asymptotic selection consistency under some mild conditions. Effectiveness of the proposed selection criterion is demonstrated on a variety of numerical experiments and real-data examples.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":"48 1","pages":"413-439"},"PeriodicalIF":4.5,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7397423/pdf/nihms-1022718.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38232848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

Envelope-based sparse partial least squares 基于包络的稀疏偏最小二乘法

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 DOI: 10.1214/18-aos1796

G. Zhu, Zhihua Su

引用次数: 23

MODEL ASSISTED VARIABLE CLUSTERING: MINIMAX-OPTIMAL RECOVERY AND ALGORITHMS. 模型辅助变量聚类:最小最大最优恢复和算法。

IF 4.5 1区数学

Annals of Statistics Pub Date : 2020-02-01 Epub Date: 2020-02-17 DOI: 10.1214/18-aos1794

Florentina Bunea, Christophe Giraud, Xi Luo, Martin Royer, Nicolas Verzelen

{"title":"MODEL ASSISTED VARIABLE CLUSTERING: MINIMAX-OPTIMAL RECOVERY AND ALGORITHMS.","authors":"Florentina Bunea, Christophe Giraud, Xi Luo, Martin Royer, Nicolas Verzelen","doi":"10.1214/18-aos1794","DOIUrl":"https://doi.org/10.1214/18-aos1794","url":null,"abstract":"The problem of variable clustering is that of estimating groups of similar components of a p-dimensional vector X = (X 1, … , X p ) from n independent copies of X. There exists a large number of algorithms that return data-dependent groups of variables, but their interpretation is limited to the algorithm that produced them. An alternative is model-based clustering, in which one begins by defining population level clusters relative to a model that embeds notions of similarity. Algorithms tailored to such models yield estimated clusters with a clear statistical interpretation. We take this view here and introduce the class of G-block covariance models as a background model for variable clustering. In such models, two variables in a cluster are deemed similar if they have similar associations will all other variables. This can arise, for instance, when groups of variables are noise corrupted versions of the same latent factor. We quantify the difficulty of clustering data generated from a G-block covariance model in terms of cluster proximity, measured with respect to two related, but different, cluster separation metrics. We derive minimax cluster separation thresholds, which are the metric values below which no algorithm can recover the model-defined clusters exactly, and show that they are different for the two metrics. We therefore develop two algorithms, COD and PECOK, tailored to G-block covariance models, and study their minimax-optimality with respect to each metric. Of independent interest is the fact that the analysis of the PECOK algorithm, which is based on a corrected convex relaxation of the popular K-means algorithm, provides the first statistical analysis of such algorithms for variable clustering. Additionally, we compare our methods with another popular clustering method, spectral clustering. Extensive simulation studies, as well as our data analyses, confirm the applicability of our approach.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":" ","pages":"111-137"},"PeriodicalIF":4.5,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9286061/pdf/nihms-1765231.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40532443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Detecting relevant changes in the mean of nonstationary processes—A mass excess approach 检测非平稳过程均值的相关变化——一种质量过剩方法

IF 4.5 1区数学

Annals of Statistics Pub Date : 2019-12-01 DOI: 10.1214/19-aos1811

H. Dette, Weichi Wu

引用次数: 25

Joint convergence of sample autocovariance matrices when $p/nto 0$ with application $p/n为0$时样本自协方差矩阵的联合收敛性及其应用

IF 4.5 1区数学

Annals of Statistics Pub Date : 2019-12-01 DOI: 10.1214/18-aos1785

M. Bhattacharjee, A. Bose

引用次数: 6

TEST FOR HIGH DIMENSIONAL CORRELATION MATRICES. 高维相关矩阵的测试。