Statistics and Computing最新文献_第4页

Learning variational autoencoders via MCMC speed measures 通过 MCMC 速度测量学习变分自编码器

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-08-06 DOI: 10.1007/s11222-024-10481-x

Marcel Hirt, Vasileios Kreouzis, Petros Dellaportas

引用次数: 0

The COR criterion for optimal subset selection in distributed estimation 分布式估算中最优子集选择的 COR 准则

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-08-02 DOI: 10.1007/s11222-024-10471-z

Guangbao Guo, Haoyue Song, Lixing Zhu

引用次数: 0

High-dimensional missing data imputation via undirected graphical model 通过无向图模型进行高维缺失数据估算

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-08-01 DOI: 10.1007/s11222-024-10475-9

Yoonah Lee, Seongoh Park

引用次数: 0

Distributed subsampling for multiplicative regression 用于乘法回归的分布式子采样

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-08-01 DOI: 10.1007/s11222-024-10477-7

Xiaoyan Li, Xiaochao Xia, Zhimin Zhang

{"title":"Distributed subsampling for multiplicative regression","authors":"Xiaoyan Li, Xiaochao Xia, Zhimin Zhang","doi":"10.1007/s11222-024-10477-7","DOIUrl":"https://doi.org/10.1007/s11222-024-10477-7","url":null,"abstract":"Multiplicative regression is a useful alternative tool in modeling positive response data. This paper proposes two distributed estimators for multiplicative error model on distributed system with non-randomly distributed massive data. We first present a Poisson subsampling procedure to obtain a subsampling estimator based on the least product relative error (LPRE) loss, which is effective on a distributed system. Theoretically, we justify the subsampling estimator by establishing its convergence rate, asymptotic normality and deriving the optimal subsampling probabilities in terms of the L-optimality criterion. Then, we provide a distributed LPRE estimator based on the Poisson subsampling (DLPRE-P), which is communication-efficient since it needs to transmit a very small subsample from local machines to the central site, which is empirically feasible, together with the gradient of the loss. Practically, due to the use of Newton–Raphson iteration, the Hessian matrix can be computed more robustly using the subsampled data than using one local dataset. We also show that the DLPRE-P estimator is statistically efficient as the global estimator, which is based on putting all the datasets together. Furthermore, we propose a distributed regularized LPRE estimator (DRLPRE-P) to consider the variable selection problem in high dimension. A distributed algorithm based on the alternating direction method of multipliers (ADMM) is developed for implementing the DRLPRE-P. The oracle property holds for DRLPRE-P. Finally, simulation experiments and two real-world data analyses are conducted to illustrate the performance of our methods.","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"46 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141867000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Detection of spatiotemporal changepoints: a generalised additive model approach 检测时空变化点：广义相加模型方法

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-08-01 DOI: 10.1007/s11222-024-10478-6

Michael J. Hollaway, Rebecca Killick

{"title":"Detection of spatiotemporal changepoints: a generalised additive model approach","authors":"Michael J. Hollaway, Rebecca Killick","doi":"10.1007/s11222-024-10478-6","DOIUrl":"https://doi.org/10.1007/s11222-024-10478-6","url":null,"abstract":"The detection of changepoints in spatio-temporal datasets has been receiving increased focus in recent years and is utilised in a wide range of fields. With temporal data observed at different spatial locations, the current approach is typically to use univariate changepoint methods in a marginal sense with the detected changepoint being representative of a single location only. We present a spatio-temporal changepoint method that utilises a generalised additive model (GAM) dependent on the 2D spatial location and the observation time to account for the underlying spatio-temporal process. We use the full likelihood of the GAM in conjunction with the pruned linear exact time (PELT) changepoint search algorithm to detect multiple changepoints across spatial locations in a computationally efficient manner. When compared to a univariate marginal approach our method is shown to perform more efficiently in simulation studies at detecting true changepoints and demonstrates less evidence of overfitting. Furthermore, as the approach explicitly models spatio-temporal dependencies between spatial locations, any changepoints detected are common across the locations. We demonstrate an application of the method to an air quality dataset covering the COVID-19 lockdown in the United Kingdom.","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"187 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Mallows-type model averaging estimator for ridge regression with randomly right censored data 用于随机右删失数据脊回归的马洛式模型平均估算器

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-07-29 DOI: 10.1007/s11222-024-10472-y

Jie Zeng, Guozhi Hu, Weihu Cheng

{"title":"A Mallows-type model averaging estimator for ridge regression with randomly right censored data","authors":"Jie Zeng, Guozhi Hu, Weihu Cheng","doi":"10.1007/s11222-024-10472-y","DOIUrl":"https://doi.org/10.1007/s11222-024-10472-y","url":null,"abstract":"Instead of picking up a single ridge parameter in ridge regression, this paper considers a frequentist model averaging approach to appropriately combine the set of ridge estimators with different ridge parameters, when the response is randomly right censored. Within this context, we propose a weighted least squares ridge estimation for unknown regression parameter. A new Mallows-type weight choice criterion is then developed to allocate model weights, where the unknown distribution function of the censoring random variable is replaced by the Kaplan–Meier estimator and the covariance matrix of random errors is substituted by its averaging estimator. Under some mild conditions, we show that when the fitting model is misspecified, the resulting model averaging estimator achieves optimality in terms of minimizing the loss function. Whereas, when the fitting model is correctly specified, the model averaging estimator of the regression parameter is root-n consistent. Additionally, for the weight vector which is obtained by minimizing the new criterion, we establish its rate of convergence to the infeasible optimal weight vector. Simulation results show that our method is better than some existing methods. A real dataset is analyzed for illustration as well.","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"150 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141866999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Byzantine-robust and efficient distributed sparsity learning: a surrogate composite quantile regression approach 拜占庭式稳健高效分布式稀疏性学习：一种代用复合量化回归方法

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-07-22 DOI: 10.1007/s11222-024-10470-0

Canyi Chen, Zhengtian Zhu

{"title":"Byzantine-robust and efficient distributed sparsity learning: a surrogate composite quantile regression approach","authors":"Canyi Chen, Zhengtian Zhu","doi":"10.1007/s11222-024-10470-0","DOIUrl":"https://doi.org/10.1007/s11222-024-10470-0","url":null,"abstract":"Distributed statistical learning has gained significant traction recently, mainly due to the availability of unprecedentedly massive datasets. The objective of distributed statistical learning is to learn models by effectively utilizing data scattered across various machines. However, its performance can be impeded by three significant challenges: arbitrary noises, high dimensionality, and machine failures—the latter being specifically referred to as Byzantine failure. To address the first two challenges, we propose leveraging the potential of composite quantile regression in conjunction with the (ell _1) penalty. However, this combination introduces a doubly nonsmooth objective function, posing new challenges. In such scenarios, most existing Byzantine-robust methods exhibit slow sublinear convergence rates and fail to achieve near-optimal statistical convergence rates. To fill this gap, we introduce a novel smoothing procedure that effectively handles the nonsmooth aspects. This innovation allows us to develop a Byzantine-robust sparsity learning algorithm that converges provably to the near-optimal convergence rate linearly. Moreover, we establish support recovery guarantees for our proposed methods. We substantiate the effectiveness of our approaches through comprehensive empirical analyses.","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"84 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141741483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ForLion: a new algorithm for D-optimal designs under general parametric statistical models with mixed factors ForLion：混合因子一般参数统计模型下 D-最优设计的新算法

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-07-18 DOI: 10.1007/s11222-024-10465-x

Yifei Huang, Keren Li, Abhyuday Mandal, Jie Yang

{"title":"ForLion: a new algorithm for D-optimal designs under general parametric statistical models with mixed factors","authors":"Yifei Huang, Keren Li, Abhyuday Mandal, Jie Yang","doi":"10.1007/s11222-024-10465-x","DOIUrl":"https://doi.org/10.1007/s11222-024-10465-x","url":null,"abstract":"In this paper, we address the problem of designing an experimental plan with both discrete and continuous factors under fairly general parametric statistical models. We propose a new algorithm, named ForLion, to search for locally optimal approximate designs under the D-criterion. The algorithm performs an exhaustive search in a design space with mixed factors while keeping high efficiency and reducing the number of distinct experimental settings. Its optimality is guaranteed by the general equivalence theorem. We present the relevant theoretical results for multinomial logit models (MLM) and generalized linear models (GLM), and demonstrate the superiority of our algorithm over state-of-the-art design algorithms using real-life experiments under MLM and GLM. Our simulation studies show that the ForLion algorithm could reduce the number of experimental settings by 25% or improve the relative efficiency of the designs by 17.5% on average. Our algorithm can help the experimenters reduce the time cost, the usage of experimental devices, and thus the total cost of their experiments while preserving high efficiencies of the designs.","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"9 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141741479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sparse and geometry-aware generalisation of the mutual information for joint discriminative clustering and feature selection 用于联合判别聚类和特征选择的互信息的稀疏和几何感知广义化

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-07-17 DOI: 10.1007/s11222-024-10467-9

Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

引用次数: 0

Optimal designs for nonlinear mixed-effects models using competitive swarm optimizer with mutated agents 利用具有变异代理的竞争性蜂群优化器优化非线性混合效应模型的设计

IF 2.2 2区数学

Statistics and Computing Pub Date : 2024-07-17 DOI: 10.1007/s11222-024-10468-8

Elvis Han Cui, Zizhao Zhang, Weng Kee Wong

{"title":"Optimal designs for nonlinear mixed-effects models using competitive swarm optimizer with mutated agents","authors":"Elvis Han Cui, Zizhao Zhang, Weng Kee Wong","doi":"10.1007/s11222-024-10468-8","DOIUrl":"https://doi.org/10.1007/s11222-024-10468-8","url":null,"abstract":"Nature-inspired meta-heuristic algorithms are increasingly used in many disciplines to tackle challenging optimization problems. Our focus is to apply a newly proposed nature-inspired meta-heuristics algorithm called CSO-MA to solve challenging design problems in biosciences and demonstrate its flexibility to find various types of optimal approximate or exact designs for nonlinear mixed models with one or several interacting factors and with or without random effects. We show that CSO-MA is efficient and can frequently outperform other algorithms either in terms of speed or accuracy. The algorithm, like other meta-heuristic algorithms, is free of technical assumptions and flexible in that it can incorporate cost structure or multiple user-specified constraints, such as, a fixed number of measurements per subject in a longitudinal study. When possible, we confirm some of the CSO-MA generated designs are optimal with theory by developing theory-based innovative plots. Our applications include searching optimal designs to estimate (i) parameters in mixed nonlinear models with correlated random effects, (ii) a function of parameters for a count model in a dose combination study, and (iii) parameters in a HIV dynamic model. In each case, we show the advantages of using a meta-heuristic approach to solve the optimization problem, and the added benefits of the generated designs.\u0000","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":"62 1","pages":""},"PeriodicalIF":2.2,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141741481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0