{"title":"Composite method for fast computation of individual level spatial epidemic models","authors":"Yirao Zhang , Rob Deardon , Lorna Deeth","doi":"10.1016/j.spasta.2026.100957","DOIUrl":null,"url":null,"abstract":"<div><div>Individual-level models, also known as ILMs, are commonly used in epidemics modelling, as they can flexibly incorporate individual-level covariates that influence susceptibility and transmissibility upon infection. However, inference for ILMs is computationally intensive, especially as the total population size increases and additional covariates are incorporated. We propose a composite method, the composite ILM (C-ILM), that clusters the population into minimally-interfered subpopulations, with between-cluster infections enabled through a “spark function.” This approach allows for parallel computation of subsets before aggregation. Focusing on C-ILM, we consider four “spark functions”, and introduce a Dirichlet process mixture modelling (DPMM) algorithm for clustering. Simulation results indicate that, in addition to faster computation, C-ILM performs well in parameter estimation and posterior predictions. Furthermore, within C-ILM framework, DPMM algorithm demonstrates superior performance compared to the conventional <span><math><mi>K</mi></math></span>-means algorithm. We apply the methods to data from the 2001 UK foot-and-mouth disease outbreak. The results provide evidence that C-ILM is not only computationally efficient but also achieves a better model fit compared to the basic spatial ILM.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"72 ","pages":"Article 100957"},"PeriodicalIF":2.5000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675326000059","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/21 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Individual-level models, also known as ILMs, are commonly used in epidemics modelling, as they can flexibly incorporate individual-level covariates that influence susceptibility and transmissibility upon infection. However, inference for ILMs is computationally intensive, especially as the total population size increases and additional covariates are incorporated. We propose a composite method, the composite ILM (C-ILM), that clusters the population into minimally-interfered subpopulations, with between-cluster infections enabled through a “spark function.” This approach allows for parallel computation of subsets before aggregation. Focusing on C-ILM, we consider four “spark functions”, and introduce a Dirichlet process mixture modelling (DPMM) algorithm for clustering. Simulation results indicate that, in addition to faster computation, C-ILM performs well in parameter estimation and posterior predictions. Furthermore, within C-ILM framework, DPMM algorithm demonstrates superior performance compared to the conventional -means algorithm. We apply the methods to data from the 2001 UK foot-and-mouth disease outbreak. The results provide evidence that C-ILM is not only computationally efficient but also achieves a better model fit compared to the basic spatial ILM.
期刊介绍:
Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication.
Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.