{"title":"A tensor decomposition model for longitudinal microbiome studies","authors":"Siyuan Ma, Hongzhe Li","doi":"10.1214/22-aoas1661","DOIUrl":null,"url":null,"abstract":"Longitudinal microbiome studies can help delineate true biological signals from the high interindividual variability that is common in microbiome data. However, there are few methods available for unsupervised dimension reduction of time course microbial abundance observations. Existing methods do not fully observe the distribution characteristics of such data types, namely, zero-inflation, compositionality, and overdispersion. We present a tensor decomposition model and a semiparametric quasi-likelihood estimation method for the decomposition of longitudinal microbiome data, by gen-eralizing existing approaches in tensor decomposition of Gaussian data. Optimization is performed through projected gradient descent additionally allowing interpretability constraints. We show through simulation studies our method is able to recover low rank structures from microbiome time course data, better than existing approaches. Lastly, we apply our method to two existing longitudinal microbiome studies, to detect global microbial changes associated with dietary and pharmaceutical effects, as well as infant birth modes.","PeriodicalId":188068,"journal":{"name":"The Annals of Applied Statistics","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aoas1661","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Longitudinal microbiome studies can help delineate true biological signals from the high interindividual variability that is common in microbiome data. However, there are few methods available for unsupervised dimension reduction of time course microbial abundance observations. Existing methods do not fully observe the distribution characteristics of such data types, namely, zero-inflation, compositionality, and overdispersion. We present a tensor decomposition model and a semiparametric quasi-likelihood estimation method for the decomposition of longitudinal microbiome data, by gen-eralizing existing approaches in tensor decomposition of Gaussian data. Optimization is performed through projected gradient descent additionally allowing interpretability constraints. We show through simulation studies our method is able to recover low rank structures from microbiome time course data, better than existing approaches. Lastly, we apply our method to two existing longitudinal microbiome studies, to detect global microbial changes associated with dietary and pharmaceutical effects, as well as infant birth modes.