Minqian Yan, Lizhi Zhou, Chongye Zhao, Chen Shi, Chunquan Ou
{"title":"Comparison of different approaches in handling missing data in longitudinal multiple-item patient-reported outcomes: a simulation study.","authors":"Minqian Yan, Lizhi Zhou, Chongye Zhao, Chen Shi, Chunquan Ou","doi":"10.1186/s12955-025-02364-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patient-reported outcomes (PROs) are important clinical outcomes widely used as primary and secondary endpoints in clinical studies. However, PRO data often suffers from missing values for various reasons, which pose challenges to data analysis. This simulation study aimed to compare the performance of existing state-of-the-art approaches in handling missing PRO data.</p><p><strong>Methods: </strong>Using a real and complete multiple-item PRO dataset, we generated various missing scenarios with different missing rates, mechanisms, and patterns. The performances of eight methods were compared, including a mixed model for repeated measures (MMRM) with and without imputation at the item level, multiple imputation by chained equations (MICE) at the composite score and item levels, and three control-based pattern mixture models (PPMs) and the last observation carried forward (LOCF) imputation at the item level.</p><p><strong>Results: </strong>We found that the bias (i.e., deviation of the estimated from the true value) in the treatment effect estimates increased, and the statistical power diminished as the missing rate increased, especially for monotonic missing data. Item-level imputation led to a smaller bias and less reduction in power than composite score-level imputation. Except for cases under missing-not-at-random mechanisms (MNAR) and with a high proportion of patients' entire questionnaire missing, MMRM imputation at the item level demonstrated the lowest bias and highest power, followed by MICE imputation at the item level. The PPM methods were superior to the other methods under MNAR mechanisms.</p><p><strong>Conclusions: </strong>PPMs imputation at the item level was preferable for MNAR, whereas MMRM and MICE imputation at the item level were better for other scenarios. These findings provide valuable insight for selecting appropriate methods for handling missing PRO data.</p>","PeriodicalId":12980,"journal":{"name":"Health and Quality of Life Outcomes","volume":"23 1","pages":"34"},"PeriodicalIF":3.2000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11972534/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health and Quality of Life Outcomes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12955-025-02364-0","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Patient-reported outcomes (PROs) are important clinical outcomes widely used as primary and secondary endpoints in clinical studies. However, PRO data often suffers from missing values for various reasons, which pose challenges to data analysis. This simulation study aimed to compare the performance of existing state-of-the-art approaches in handling missing PRO data.
Methods: Using a real and complete multiple-item PRO dataset, we generated various missing scenarios with different missing rates, mechanisms, and patterns. The performances of eight methods were compared, including a mixed model for repeated measures (MMRM) with and without imputation at the item level, multiple imputation by chained equations (MICE) at the composite score and item levels, and three control-based pattern mixture models (PPMs) and the last observation carried forward (LOCF) imputation at the item level.
Results: We found that the bias (i.e., deviation of the estimated from the true value) in the treatment effect estimates increased, and the statistical power diminished as the missing rate increased, especially for monotonic missing data. Item-level imputation led to a smaller bias and less reduction in power than composite score-level imputation. Except for cases under missing-not-at-random mechanisms (MNAR) and with a high proportion of patients' entire questionnaire missing, MMRM imputation at the item level demonstrated the lowest bias and highest power, followed by MICE imputation at the item level. The PPM methods were superior to the other methods under MNAR mechanisms.
Conclusions: PPMs imputation at the item level was preferable for MNAR, whereas MMRM and MICE imputation at the item level were better for other scenarios. These findings provide valuable insight for selecting appropriate methods for handling missing PRO data.
期刊介绍:
Health and Quality of Life Outcomes is an open access, peer-reviewed, journal offering high quality articles, rapid publication and wide diffusion in the public domain.
Health and Quality of Life Outcomes considers original manuscripts on the Health-Related Quality of Life (HRQOL) assessment for evaluation of medical and psychosocial interventions. It also considers approaches and studies on psychometric properties of HRQOL and patient reported outcome measures, including cultural validation of instruments if they provide information about the impact of interventions. The journal publishes study protocols and reviews summarising the present state of knowledge concerning a particular aspect of HRQOL and patient reported outcome measures. Reviews should generally follow systematic review methodology. Comments on articles and letters to the editor are welcome.