Mika Sipilä, Claudia Cappello, Sandra De Iaco, Klaus Nordhausen, Sara Taskinen
{"title":"Modelling multivariate spatio-temporal data with identifiable variational autoencoders","authors":"Mika Sipilä, Claudia Cappello, Sandra De Iaco, Klaus Nordhausen, Sara Taskinen","doi":"arxiv-2409.04162","DOIUrl":null,"url":null,"abstract":"Modelling multivariate spatio-temporal data with complex dependency\nstructures is a challenging task but can be simplified by assuming that the\noriginal variables are generated from independent latent components. If these\ncomponents are found, they can be modelled univariately. Blind source\nseparation aims to recover the latent components by estimating the unmixing\ntransformation based on the observed data only. The current methods for\nspatio-temporal blind source separation are restricted to linear unmixing, and\nnonlinear variants have not been implemented. In this paper, we extend\nidentifiable variational autoencoder to the nonlinear nonstationary\nspatio-temporal blind source separation setting and demonstrate its performance\nusing comprehensive simulation studies. Additionally, we introduce two\nalternative methods for the latent dimension estimation, which is a crucial\ntask in order to obtain the correct latent representation. Finally, we\nillustrate the proposed methods using a meteorological application, where we\nestimate the latent dimension and the latent components, interpret the\ncomponents, and show how nonstationarity can be accounted and prediction\naccuracy can be improved by using the proposed nonlinear blind source\nseparation method as a preprocessing method.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.04162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Modelling multivariate spatio-temporal data with complex dependency
structures is a challenging task but can be simplified by assuming that the
original variables are generated from independent latent components. If these
components are found, they can be modelled univariately. Blind source
separation aims to recover the latent components by estimating the unmixing
transformation based on the observed data only. The current methods for
spatio-temporal blind source separation are restricted to linear unmixing, and
nonlinear variants have not been implemented. In this paper, we extend
identifiable variational autoencoder to the nonlinear nonstationary
spatio-temporal blind source separation setting and demonstrate its performance
using comprehensive simulation studies. Additionally, we introduce two
alternative methods for the latent dimension estimation, which is a crucial
task in order to obtain the correct latent representation. Finally, we
illustrate the proposed methods using a meteorological application, where we
estimate the latent dimension and the latent components, interpret the
components, and show how nonstationarity can be accounted and prediction
accuracy can be improved by using the proposed nonlinear blind source
separation method as a preprocessing method.