{"title":"Fast fitting of phylogenetic mixed effects models","authors":"Bert van der Veen, Robert Brian O'Hara","doi":"arxiv-2408.05333","DOIUrl":null,"url":null,"abstract":"Mixed effects models are among the most commonly used statistical methods for\nthe exploration of multispecies data. In recent years, also Joint Species\nDistribution Models and Generalized Linear Latent Variale Models have gained in\npopularity when the goal is to incorporate residual covariation between species\nthat cannot be explained due to measured environmental covariates. Few software\nimplementations of such models exist that can additionally incorporate\nphylogenetic information, and those that exist tend to utilize Markov chain\nMonte Carlo methods for estimation, so that model fitting takes a long time. In\nthis article we develop new methods for quickly and flexibly fitting\nphylogenetic mixed models, potentially incorporating residual covariation\nbetween species using latent variables, with the possibility to estimate the\nstrength of phylogenetic structuring in species responses per environmental\ncovariate, and while incorporating correlation between different covariate\neffects. By combining Variational approximations with a reduced rank matrix\nnormal covariance structure, Nearest Neighbours Gaussian Processes, and\nparallel computation, phylogenetic mixed models can be fitted much more quickly\nthan the current state-of-the-art. Two simulation studies demonstrate that the\nproposed combination of approximations is not only fast, but also enjoys high\naccuracy. Finally, we demonstrate use of the method with a real world dataset\nof wood-decaying fungi.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"73 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Mixed effects models are among the most commonly used statistical methods for
the exploration of multispecies data. In recent years, also Joint Species
Distribution Models and Generalized Linear Latent Variale Models have gained in
popularity when the goal is to incorporate residual covariation between species
that cannot be explained due to measured environmental covariates. Few software
implementations of such models exist that can additionally incorporate
phylogenetic information, and those that exist tend to utilize Markov chain
Monte Carlo methods for estimation, so that model fitting takes a long time. In
this article we develop new methods for quickly and flexibly fitting
phylogenetic mixed models, potentially incorporating residual covariation
between species using latent variables, with the possibility to estimate the
strength of phylogenetic structuring in species responses per environmental
covariate, and while incorporating correlation between different covariate
effects. By combining Variational approximations with a reduced rank matrix
normal covariance structure, Nearest Neighbours Gaussian Processes, and
parallel computation, phylogenetic mixed models can be fitted much more quickly
than the current state-of-the-art. Two simulation studies demonstrate that the
proposed combination of approximations is not only fast, but also enjoys high
accuracy. Finally, we demonstrate use of the method with a real world dataset
of wood-decaying fungi.