Kevin Rupp, Rudolf Schill, Jonas Süskind, Peter Georg, Maren Klever, Andreas Lösch, Lars Grasedyck, Tilo Wettig, Rainer Spang
{"title":"Differentiated uniformization: a new method for inferring Markov chains on combinatorial state spaces including stochastic epidemic models","authors":"Kevin Rupp, Rudolf Schill, Jonas Süskind, Peter Georg, Maren Klever, Andreas Lösch, Lars Grasedyck, Tilo Wettig, Rainer Spang","doi":"10.1007/s00180-024-01454-9","DOIUrl":null,"url":null,"abstract":"<p>We consider continuous-time Markov chains that describe the stochastic evolution of a dynamical system by a transition-rate matrix <i>Q</i> which depends on a parameter <span>\\(\\theta \\)</span>. Computing the probability distribution over states at time <i>t</i> requires the matrix exponential <span>\\(\\exp \\,\\left( tQ\\right) \\,\\)</span>, and inferring <span>\\(\\theta \\)</span> from data requires its derivative <span>\\(\\partial \\exp \\,\\left( tQ\\right) \\,/\\partial \\theta \\)</span>. Both are challenging to compute when the state space and hence the size of <i>Q</i> is huge. This can happen when the state space consists of all combinations of the values of several interacting discrete variables. Often it is even impossible to store <i>Q</i>. However, when <i>Q</i> can be written as a sum of tensor products, computing <span>\\(\\exp \\,\\left( tQ\\right) \\,\\)</span> becomes feasible by the uniformization method, which does not require explicit storage of <i>Q</i>. Here we provide an analogous algorithm for computing <span>\\(\\partial \\exp \\,\\left( tQ\\right) \\,/\\partial \\theta \\)</span>, the <i>differentiated uniformization method</i>. We demonstrate our algorithm for the stochastic SIR model of epidemic spread, for which we show that <i>Q</i> can be written as a sum of tensor products. We estimate monthly infection and recovery rates during the first wave of the COVID-19 pandemic in Austria and quantify their uncertainty in a full Bayesian analysis. Implementation and data are available at https://github.com/spang-lab/TenSIR.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"74 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s00180-024-01454-9","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
We consider continuous-time Markov chains that describe the stochastic evolution of a dynamical system by a transition-rate matrix Q which depends on a parameter \(\theta \). Computing the probability distribution over states at time t requires the matrix exponential \(\exp \,\left( tQ\right) \,\), and inferring \(\theta \) from data requires its derivative \(\partial \exp \,\left( tQ\right) \,/\partial \theta \). Both are challenging to compute when the state space and hence the size of Q is huge. This can happen when the state space consists of all combinations of the values of several interacting discrete variables. Often it is even impossible to store Q. However, when Q can be written as a sum of tensor products, computing \(\exp \,\left( tQ\right) \,\) becomes feasible by the uniformization method, which does not require explicit storage of Q. Here we provide an analogous algorithm for computing \(\partial \exp \,\left( tQ\right) \,/\partial \theta \), the differentiated uniformization method. We demonstrate our algorithm for the stochastic SIR model of epidemic spread, for which we show that Q can be written as a sum of tensor products. We estimate monthly infection and recovery rates during the first wave of the COVID-19 pandemic in Austria and quantify their uncertainty in a full Bayesian analysis. Implementation and data are available at https://github.com/spang-lab/TenSIR.
期刊介绍:
Computational Statistics (CompStat) is an international journal which promotes the publication of applications and methodological research in the field of Computational Statistics. The focus of papers in CompStat is on the contribution to and influence of computing on statistics and vice versa. The journal provides a forum for computer scientists, mathematicians, and statisticians in a variety of fields of statistics such as biometrics, econometrics, data analysis, graphics, simulation, algorithms, knowledge based systems, and Bayesian computing. CompStat publishes hardware, software plus package reports.