{"title":"使用半参数贝叶斯和机器学习组件对具有不可忽略的非单调缺失性的纵向数据进行统计建模。","authors":"Yu Cao, Nitai D Mukhopadhyay","doi":"10.1007/s13571-019-00222-w","DOIUrl":null,"url":null,"abstract":"<p><p>In longitudinal studies, outcomes are measured repeatedly over time and it is common that not all the patients will be measured throughout the study. For example patients can be lost to follow-up (monotone missingness) or miss one or more visits (non-monotone missingness); hence there are missing outcomes. In the longitudinal setting, we often assume the missingness is related to the unobserved data, which is non-ignorable. Pattern-mixture models (PMM) analyze the joint distribution of outcome and patterns of missingness in longitudinal data with non-ignorable nonmonotone missingness. Existing methods employ PMM and impute the unobserved outcomes using the distribution of observed outcomes, conditioned on missing patterns. We extend the existing methods using latent class analysis (LCA) and a shared-parameter PMM. The LCA groups patterns of missingness with similar features and the shared-parameter PMM allows a subset of parameters to be different between latent classes when fitting a model. We also propose a method for imputation using distribution of observed data conditioning on latent class. Our model improves existing methods by accommodating data with small sample size. In a simulation study our estimator had smaller mean squared error than existing methods. Our methodology is applied to data from a phase II clinical trial that studies quality of life of patients with prostate cancer receiving radiation therapy.</p>","PeriodicalId":74754,"journal":{"name":"Sankhya. Series B (2008)","volume":"83 1","pages":"152-169"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209781/pdf/nihms-1574489.pdf","citationCount":"0","resultStr":"{\"title\":\"Statistical Modeling of Longitudinal Data with Non-ignorable Non-monotone Missingness with Semiparametric Bayesian and Machine Learning Components.\",\"authors\":\"Yu Cao, Nitai D Mukhopadhyay\",\"doi\":\"10.1007/s13571-019-00222-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In longitudinal studies, outcomes are measured repeatedly over time and it is common that not all the patients will be measured throughout the study. For example patients can be lost to follow-up (monotone missingness) or miss one or more visits (non-monotone missingness); hence there are missing outcomes. In the longitudinal setting, we often assume the missingness is related to the unobserved data, which is non-ignorable. Pattern-mixture models (PMM) analyze the joint distribution of outcome and patterns of missingness in longitudinal data with non-ignorable nonmonotone missingness. Existing methods employ PMM and impute the unobserved outcomes using the distribution of observed outcomes, conditioned on missing patterns. We extend the existing methods using latent class analysis (LCA) and a shared-parameter PMM. The LCA groups patterns of missingness with similar features and the shared-parameter PMM allows a subset of parameters to be different between latent classes when fitting a model. We also propose a method for imputation using distribution of observed data conditioning on latent class. Our model improves existing methods by accommodating data with small sample size. In a simulation study our estimator had smaller mean squared error than existing methods. Our methodology is applied to data from a phase II clinical trial that studies quality of life of patients with prostate cancer receiving radiation therapy.</p>\",\"PeriodicalId\":74754,\"journal\":{\"name\":\"Sankhya. Series B (2008)\",\"volume\":\"83 1\",\"pages\":\"152-169\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209781/pdf/nihms-1574489.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sankhya. Series B (2008)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s13571-019-00222-w\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/3/9 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sankhya. Series B (2008)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13571-019-00222-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/3/9 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在纵向研究中,结果是随着时间的推移反复测量的,而并非所有患者都会在整个研究期间接受测量,这种情况很常见。例如,患者可能失去随访(单调遗漏)或错过一次或多次就诊(非单调遗漏);因此会出现结果遗漏。在纵向研究中,我们通常假定缺失与未观察到的数据有关,而这些数据是不可忽略的。模式混杂模型(PMM)分析了具有不可忽略的非单调缺失的纵向数据中结果和缺失模式的联合分布。现有方法采用 PMM,利用观察到的结果分布,以缺失模式为条件,对未观察到的结果进行估算。我们利用潜类分析(LCA)和共享参数 PMM 扩展了现有方法。LCA 将具有相似特征的缺失模式分组,而共享参数 PMM 则允许在拟合模型时不同潜类之间的参数子集有所不同。我们还提出了一种利用潜类条件下的观测数据分布进行估算的方法。我们的模型改进了现有的方法,适用于样本量较小的数据。在一项模拟研究中,我们的估计器比现有方法的均方误差更小。我们的方法被应用于一项研究接受放射治疗的前列腺癌患者生活质量的 II 期临床试验数据。
Statistical Modeling of Longitudinal Data with Non-ignorable Non-monotone Missingness with Semiparametric Bayesian and Machine Learning Components.
In longitudinal studies, outcomes are measured repeatedly over time and it is common that not all the patients will be measured throughout the study. For example patients can be lost to follow-up (monotone missingness) or miss one or more visits (non-monotone missingness); hence there are missing outcomes. In the longitudinal setting, we often assume the missingness is related to the unobserved data, which is non-ignorable. Pattern-mixture models (PMM) analyze the joint distribution of outcome and patterns of missingness in longitudinal data with non-ignorable nonmonotone missingness. Existing methods employ PMM and impute the unobserved outcomes using the distribution of observed outcomes, conditioned on missing patterns. We extend the existing methods using latent class analysis (LCA) and a shared-parameter PMM. The LCA groups patterns of missingness with similar features and the shared-parameter PMM allows a subset of parameters to be different between latent classes when fitting a model. We also propose a method for imputation using distribution of observed data conditioning on latent class. Our model improves existing methods by accommodating data with small sample size. In a simulation study our estimator had smaller mean squared error than existing methods. Our methodology is applied to data from a phase II clinical trial that studies quality of life of patients with prostate cancer receiving radiation therapy.