Bao Anh Vu , David Gunawan , Andrew Zammit-Mangion
{"title":"Recursive variational Gaussian approximation with the Whittle likelihood for linear non-Gaussian state space models","authors":"Bao Anh Vu , David Gunawan , Andrew Zammit-Mangion","doi":"10.1016/j.csda.2025.108324","DOIUrl":null,"url":null,"abstract":"<div><div>Parameter inference for linear and non-Gaussian state space models is challenging because the likelihood function contains an intractable integral over the latent state variables. While Markov chain Monte Carlo (MCMC) methods provide exact samples from the posterior distribution as the number of samples goes to infinity, they tend to have high computational cost, particularly for observations of a long time series. When inference with MCMC methods is computationally expensive, variational Bayes (VB) methods are a useful alternative. VB methods approximate the posterior density of the parameters with a simple and tractable distribution found through optimisation. A novel sequential VB algorithm that makes use of the Whittle likelihood is proposed for computationally efficient parameter inference in linear, non-Gaussian state space models. The algorithm, called Recursive Variational Gaussian Approximation with the Whittle Likelihood (R-VGA-Whittle), updates the variational parameters by processing data in the frequency domain. At each iteration, R-VGA-Whittle requires the gradient and Hessian of the Whittle log-likelihood, which are available in closed form. Through several examples involving a linear Gaussian state space model; a univariate/bivariate stochastic volatility model; and a state space model with Student’s t measurement error, where the latent states follow an autoregressive fractionally integrated moving average (ARFIMA) model, R-VGA-Whittle is shown to provide good approximations to posterior distributions of the parameters, and it is very computationally efficient when compared to asymptotically exact methods such as Hamiltonian Monte Carlo.</div></div>","PeriodicalId":55225,"journal":{"name":"Computational Statistics & Data Analysis","volume":"218 ","pages":"Article 108324"},"PeriodicalIF":1.6000,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Statistics & Data Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167947325002002","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/27 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Parameter inference for linear and non-Gaussian state space models is challenging because the likelihood function contains an intractable integral over the latent state variables. While Markov chain Monte Carlo (MCMC) methods provide exact samples from the posterior distribution as the number of samples goes to infinity, they tend to have high computational cost, particularly for observations of a long time series. When inference with MCMC methods is computationally expensive, variational Bayes (VB) methods are a useful alternative. VB methods approximate the posterior density of the parameters with a simple and tractable distribution found through optimisation. A novel sequential VB algorithm that makes use of the Whittle likelihood is proposed for computationally efficient parameter inference in linear, non-Gaussian state space models. The algorithm, called Recursive Variational Gaussian Approximation with the Whittle Likelihood (R-VGA-Whittle), updates the variational parameters by processing data in the frequency domain. At each iteration, R-VGA-Whittle requires the gradient and Hessian of the Whittle log-likelihood, which are available in closed form. Through several examples involving a linear Gaussian state space model; a univariate/bivariate stochastic volatility model; and a state space model with Student’s t measurement error, where the latent states follow an autoregressive fractionally integrated moving average (ARFIMA) model, R-VGA-Whittle is shown to provide good approximations to posterior distributions of the parameters, and it is very computationally efficient when compared to asymptotically exact methods such as Hamiltonian Monte Carlo.
线性和非高斯状态空间模型的参数推理具有挑战性,因为似然函数包含对潜在状态变量的难以处理的积分。当样本数量趋于无穷大时,马尔可夫链蒙特卡罗(MCMC)方法提供来自后验分布的精确样本,但它们往往具有很高的计算成本,特别是对于长时间序列的观测。当使用MCMC方法进行推理的计算成本很高时,变分贝叶斯(VB)方法是一种有用的替代方法。VB方法近似参数的后验密度,通过优化找到一个简单而易于处理的分布。提出了一种利用Whittle似然的序列VB算法,用于线性非高斯状态空间模型的高效参数推理。该算法被称为递归变分高斯近似与惠特尔似然(R-VGA-Whittle),通过在频域处理数据来更新变分参数。在每次迭代中,r - ga -Whittle需要Whittle对数似然的梯度和Hessian,它们以封闭形式可用。通过几个涉及线性高斯状态空间模型的例子;单变量/双变量随机波动模型;以及具有Student’s t测量误差的状态空间模型,其中潜在状态遵循自回归分数积分移动平均(ARFIMA)模型,R-VGA-Whittle被证明可以很好地近似参数的后验分布,并且与渐近精确方法(如hamilton - Monte Carlo)相比,它的计算效率非常高。
期刊介绍:
Computational Statistics and Data Analysis (CSDA), an Official Publication of the network Computational and Methodological Statistics (CMStatistics) and of the International Association for Statistical Computing (IASC), is an international journal dedicated to the dissemination of methodological research and applications in the areas of computational statistics and data analysis. The journal consists of four refereed sections which are divided into the following subject areas:
I) Computational Statistics - Manuscripts dealing with: 1) the explicit impact of computers on statistical methodology (e.g., Bayesian computing, bioinformatics,computer graphics, computer intensive inferential methods, data exploration, data mining, expert systems, heuristics, knowledge based systems, machine learning, neural networks, numerical and optimization methods, parallel computing, statistical databases, statistical systems), and 2) the development, evaluation and validation of statistical software and algorithms. Software and algorithms can be submitted with manuscripts and will be stored together with the online article.
II) Statistical Methodology for Data Analysis - Manuscripts dealing with novel and original data analytical strategies and methodologies applied in biostatistics (design and analytic methods for clinical trials, epidemiological studies, statistical genetics, or genetic/environmental interactions), chemometrics, classification, data exploration, density estimation, design of experiments, environmetrics, education, image analysis, marketing, model free data exploration, pattern recognition, psychometrics, statistical physics, image processing, robust procedures.
[...]
III) Special Applications - [...]
IV) Annals of Statistical Data Science [...]