The Annals of Statistics最新文献_第6页

Conformal prediction beyond exchangeability 超越互换性的保角预测

The Annals of Statistics Pub Date : 2022-02-27 DOI: 10.1214/23-aos2276

R. Barber, E. Candès, Aaditya Ramdas, R. Tibshirani

{"title":"Conformal prediction beyond exchangeability","authors":"R. Barber, E. Candès, Aaditya Ramdas, R. Tibshirani","doi":"10.1214/23-aos2276","DOIUrl":"https://doi.org/10.1214/23-aos2276","url":null,"abstract":"Conformal prediction is a popular, modern technique for providing valid predictive inference for arbitrary machine learning models. Its validity relies on the assumptions of exchangeability of the data, and symmetry of the given model fitting algorithm as a function of the data. However, exchangeability is often violated when predictive models are deployed in practice. For example, if the data distribution drifts over time, then the data points are no longer exchangeable; moreover, in such settings, we might want to use a nonsymmetric algorithm that treats recent observations as more relevant. This paper generalizes conformal prediction to deal with both aspects: we employ weighted quantiles to introduce robustness against distribution drift, and design a new randomization technique to allow for algorithms that do not treat data points symmetrically. Our new methods are provably robust, with substantially less loss of coverage when exchangeability is violated due to distribution drift or other challenging features of real data, while also achieving the same coverage guarantees as existing conformal prediction methods if the data points are in fact exchangeable. We demonstrate the practical utility of these new tools with simulations and real-data experiments on electricity and election forecasting.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84913508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 83

A general characterization of optimal tie-breaker designs 最优决胜设计的一般特征

The Annals of Statistics Pub Date : 2022-02-25 DOI: 10.1214/23-aos2275

Harrison H. Li, A. Owen

{"title":"A general characterization of optimal tie-breaker designs","authors":"Harrison H. Li, A. Owen","doi":"10.1214/23-aos2275","DOIUrl":"https://doi.org/10.1214/23-aos2275","url":null,"abstract":"Tie-breaker designs trade off a statistical design objective with short-term gain from preferentially assigning a binary treatment to those with high values of a running variable $x$. The design objective is any continuous function of the expected information matrix in a two-line regression model, and short-term gain is expressed as the covariance between the running variable and the treatment indicator. We investigate how to specify design functions indicating treatment probabilities as a function of $x$ to optimize these competing objectives, under external constraints on the number of subjects receiving treatment. Our results include sharp existence and uniqueness guarantees, while accommodating the ethically appealing requirement that treatment probabilities are non-decreasing in $x$. Under such a constraint, there always exists an optimal design function that is constant below and above a single discontinuity. When the running variable distribution is not symmetric or the fraction of subjects receiving the treatment is not $1/2$, our optimal designs improve upon a $D$-optimality objective without sacrificing short-term gain, compared to the three level tie-breaker designs of Owen and Varian (2020) that fix treatment probabilities at $0$, $1/2$, and $1$. We illustrate our optimal designs with data from Head Start, an early childhood government intervention program.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73786500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Optimal high-dimensional and nonparametric distributed testing under communication constraints 通信约束下最优高维非参数分布测试

The Annals of Statistics Pub Date : 2022-02-02 DOI: 10.1214/23-aos2269

Botond Szab'o, Lasse Vuursteen, H. Zanten

引用次数: 2

Minimax nonparametric estimation of pure quantum states 纯量子态的极大极小非参数估计

The Annals of Statistics Pub Date : 2022-02-01 DOI: 10.1214/21-aos2115

Samriddha Lahiry, M. Nussbaum

引用次数: 0

Testing community structure for hypergraphs 测试超图的社区结构

The Annals of Statistics Pub Date : 2022-02-01 DOI: 10.1214/21-aos2099

Mingao Yuan, Ruiqi Liu, Yang Feng, Zuofeng Shang

{"title":"Testing community structure for hypergraphs","authors":"Mingao Yuan, Ruiqi Liu, Yang Feng, Zuofeng Shang","doi":"10.1214/21-aos2099","DOIUrl":"https://doi.org/10.1214/21-aos2099","url":null,"abstract":"Many complex networks in the real world can be formulated as hypergraphs where community detection has been widely used. However, the fundamental question of whether communities exist or not in an observed hypergraph remains unclear. This work aims to tackle this important problem. Specifically, we systematically study when a hypergraph with community structure can be successfully distinguished from its Erdős–Rényi counterpart, and propose concrete test statistics when the models are distinguishable. The main contribution of this paper is threefold. First, we discover a phase transition in the hyperedge probability for distinguishability. Second, in the bounded-degree regime, we derive a sharp signal-to-noise ratio (SNR) threshold for distinguishability in the special two-community 3uniform hypergraphs, and derive nearly tight SNR thresholds in the general two-community m-uniform hypergraphs. Third, in the dense regime, we propose a computationally feasible test based on sub-hypergraph counts, obtain its asymptotic distribution, and analyze its power. Our results are further extended to nonuniform hypergraphs in which a new test involving both edge and hyperedge information is proposed. The proofs rely on Janson’s contiguity theory (Combin. Probab. Comput. 4 (1995) 369–405), a high-moments driven asymptotic normality result by Gao and Wormald (Probab. Theory Related Fields 130 (2004) 368–376), and a truncation technique for analyzing the likelihood ratio.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82013937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Dimension reduction for functional data based on weak conditional moments 基于弱条件矩的函数数据降维

The Annals of Statistics Pub Date : 2022-02-01 DOI: 10.1214/21-aos2091

Bing Li, Jun Song

引用次数: 10

Half-trek criterion for identifiability of latent variable models 潜在变量模型可识别性的半跋涉准则

The Annals of Statistics Pub Date : 2022-01-12 DOI: 10.1214/22-aos2221

R. Barber, M. Drton, Nils Sturma, Luca Weihs

{"title":"Half-trek criterion for identifiability of latent variable models","authors":"R. Barber, M. Drton, Nils Sturma, Luca Weihs","doi":"10.1214/22-aos2221","DOIUrl":"https://doi.org/10.1214/22-aos2221","url":null,"abstract":"We consider linear structural equation models with latent variables and develop a criterion to certify whether the direct causal effects between the observable variables are identifiable based on the observed covariance matrix. Linear structural equation models assume that both observed and latent variables solve a linear equation system featuring stochastic noise terms. Each model corresponds to a directed graph whose edges represent the direct effects that appear as coefficients in the equation system. Prior research has developed a variety of methods to decide identifiability of direct effects in a latent projection framework, in which the confounding effects of the latent variables are represented by correlation among noise terms. This approach is effective when the confounding is sparse and effects only small subsets of the observed variables. In contrast, the new latent-factor half-trek criterion (LF-HTC) we develop in this paper operates on the original unprojected latent variable model and is able to certify identifiability in settings, where some latent variables may also have dense effects on many or even all of the observables. Our LF-HTC is an effective sufficient criterion for rational identifiability, under which the direct effects can be uniquely recovered as rational functions of the joint covariance matrix of the observed random variables. When restricting the search steps in LF-HTC to consider subsets of latent variables of bounded size, the criterion can be verified in time that is polynomial in the size of the graph.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"130 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79605232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

On robustness and local differential privacy 关于鲁棒性和局部差分隐私

The Annals of Statistics Pub Date : 2022-01-03 DOI: 10.1214/23-aos2267

Mengchu Li, Thomas B. Berrett, Yi Yu

{"title":"On robustness and local differential privacy","authors":"Mengchu Li, Thomas B. Berrett, Yi Yu","doi":"10.1214/23-aos2267","DOIUrl":"https://doi.org/10.1214/23-aos2267","url":null,"abstract":"It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of our knowledge, we are the first to systematically study the connections between the optimality under Huber's contamination model and the local differential privacy (LDP) constraints. In this paper, we start with a general minimax lower bound result, which disentangles the costs of being robust against Huber's contamination and preserving LDP. We further study four concrete examples: a two-point testing problem, a potentially-diverging mean estimation problem, a nonparametric density estimation problem and a univariate median estimation problem. For each problem, we demonstrate procedures that are optimal in the presence of both contamination and LDP constraints, comment on the connections with the state-of-the-art methods that are only studied under either contamination or privacy constraints, and unveil the connections between robustness and LDP via partially answering whether LDP procedures are robust and whether robust procedures can be efficiently privatised. Overall, our work showcases a promising prospect of joint study for robustness and local differential privacy.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84019787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Variable selection, monotone likelihood ratio and group sparsity 变量选择、单调似然比和组稀疏性

The Annals of Statistics Pub Date : 2021-12-30 DOI: 10.1214/22-aos2251

C. Butucea, E. Mammen, M. Ndaoud, A. Tsybakov

引用次数: 0

General and feasible tests with multiply-imputed datasets 用多输入数据集进行一般和可行的测试

The Annals of Statistics Pub Date : 2021-12-30 DOI: 10.1214/21-aos2132

Kin Wai Chan

{"title":"General and feasible tests with multiply-imputed datasets","authors":"Kin Wai Chan","doi":"10.1214/21-aos2132","DOIUrl":"https://doi.org/10.1214/21-aos2132","url":null,"abstract":"Multiple imputation (MI) is a technique especially designed for handling missing data in public-use datasets. It allows analysts to perform incompletedata inference straightforwardly by using several already imputed datasets released by the dataset owners. However, the existing MI tests require either a restrictive assumption on the missing-data mechanism, known as equal odds of missing information (EOMI), or an infinite number of imputations. Some of them also require analysts to have access to restrictive or nonstandard computer subroutines. Besides, the existing MI testing procedures cover only Wald’s tests and likelihood ratio tests but not Rao’s score tests, therefore, these MI testing procedures are not general enough. In addition, the MI Wald’s tests and MI likelihood ratio tests are not procedurally identical, so analysts need to resort to distinct algorithms for implementation. In this paper, we propose a general MI procedure, called stacked multiple imputation (SMI), for performing Wald’s tests, likelihood ratio tests and Rao’s score tests by a unified algorithm. SMI requires neither EOMI nor an infinite number of imputations. It is particularly feasible for analysts as they just need to use a complete-data testing device for performing the corresponding incomplete-data test.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90771126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1