Comparison of different approaches in handling missing data in longitudinal multiple-item patient-reported outcomes: a simulation study.

IF 3.4 2区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Health and Quality of Life Outcomes Pub Date : 2025-04-05 DOI:10.1186/s12955-025-02364-0

Minqian Yan, Lizhi Zhou, Chongye Zhao, Chen Shi, Chunquan Ou

{"title":"Comparison of different approaches in handling missing data in longitudinal multiple-item patient-reported outcomes: a simulation study.","authors":"Minqian Yan, Lizhi Zhou, Chongye Zhao, Chen Shi, Chunquan Ou","doi":"10.1186/s12955-025-02364-0","DOIUrl":null,"url":null,"abstract":"Background: Patient-reported outcomes (PROs) are important clinical outcomes widely used as primary and secondary endpoints in clinical studies. However, PRO data often suffers from missing values for various reasons, which pose challenges to data analysis. This simulation study aimed to compare the performance of existing state-of-the-art approaches in handling missing PRO data.Methods: Using a real and complete multiple-item PRO dataset, we generated various missing scenarios with different missing rates, mechanisms, and patterns. The performances of eight methods were compared, including a mixed model for repeated measures (MMRM) with and without imputation at the item level, multiple imputation by chained equations (MICE) at the composite score and item levels, and three control-based pattern mixture models (PPMs) and the last observation carried forward (LOCF) imputation at the item level.Results: We found that the bias (i.e., deviation of the estimated from the true value) in the treatment effect estimates increased, and the statistical power diminished as the missing rate increased, especially for monotonic missing data. Item-level imputation led to a smaller bias and less reduction in power than composite score-level imputation. Except for cases under missing-not-at-random mechanisms (MNAR) and with a high proportion of patients' entire questionnaire missing, MMRM imputation at the item level demonstrated the lowest bias and highest power, followed by MICE imputation at the item level. The PPM methods were superior to the other methods under MNAR mechanisms.Conclusions: PPMs imputation at the item level was preferable for MNAR, whereas MMRM and MICE imputation at the item level were better for other scenarios. These findings provide valuable insight for selecting appropriate methods for handling missing PRO data.","PeriodicalId":12980,"journal":{"name":"Health and Quality of Life Outcomes","volume":"23 1","pages":"34"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11972534/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health and Quality of Life Outcomes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12955-025-02364-0","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Patient-reported outcomes (PROs) are important clinical outcomes widely used as primary and secondary endpoints in clinical studies. However, PRO data often suffers from missing values for various reasons, which pose challenges to data analysis. This simulation study aimed to compare the performance of existing state-of-the-art approaches in handling missing PRO data.

Methods: Using a real and complete multiple-item PRO dataset, we generated various missing scenarios with different missing rates, mechanisms, and patterns. The performances of eight methods were compared, including a mixed model for repeated measures (MMRM) with and without imputation at the item level, multiple imputation by chained equations (MICE) at the composite score and item levels, and three control-based pattern mixture models (PPMs) and the last observation carried forward (LOCF) imputation at the item level.

Results: We found that the bias (i.e., deviation of the estimated from the true value) in the treatment effect estimates increased, and the statistical power diminished as the missing rate increased, especially for monotonic missing data. Item-level imputation led to a smaller bias and less reduction in power than composite score-level imputation. Except for cases under missing-not-at-random mechanisms (MNAR) and with a high proportion of patients' entire questionnaire missing, MMRM imputation at the item level demonstrated the lowest bias and highest power, followed by MICE imputation at the item level. The PPM methods were superior to the other methods under MNAR mechanisms.

Conclusions: PPMs imputation at the item level was preferable for MNAR, whereas MMRM and MICE imputation at the item level were better for other scenarios. These findings provide valuable insight for selecting appropriate methods for handling missing PRO data.

查看原文本刊更多论文

在纵向多项病人报告结果中处理缺失数据的不同方法的比较：一项模拟研究。

背景：患者报告结局（pro）是重要的临床结局，在临床研究中被广泛用作主要和次要终点。然而，由于各种原因，PRO数据经常存在缺失值，这给数据分析带来了挑战。本模拟研究旨在比较现有的最先进的方法在处理丢失的PRO数据方面的性能。方法：使用真实完整的多项目PRO数据集，我们生成了具有不同缺失率、机制和模式的各种缺失场景。比较了8种方法的性能，包括在项目水平上采用重复测量混合模型（MMRM）、在综合得分和项目水平上采用链式方程混合模型（MICE）、在项目水平上采用基于控制的模式混合模型（PPMs）和最后观察结转模型（LOCF）。结果：我们发现治疗效果估计的偏倚（即估计值与真实值的偏差）随着缺失率的增加而增加，统计效力随着缺失率的增加而降低，特别是单调缺失数据。项目水平的归算比综合得分水平的归算导致更小的偏差和更小的功率降低。除了缺失非随机机制（MNAR）和缺失全部问卷比例较高的情况外，MMRM在项目水平上的归咎表现出最低的偏倚和最高的有效性，其次是MICE在项目水平上的归咎。在MNAR机制下，PPM方法优于其他方法。结论：在MNAR情景中，项目水平的PPMs归算更有效，而在其他情景中，项目水平的MMRM和MICE归算更有效。这些发现为选择合适的方法来处理缺失的PRO数据提供了有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Health and Quality of Life Outcomes 医学-卫生保健

CiteScore

7.30

自引率

2.80%

发文量

154

审稿时长

3-8 weeks

期刊介绍： Health and Quality of Life Outcomes is an open access, peer-reviewed, journal offering high quality articles, rapid publication and wide diffusion in the public domain. Health and Quality of Life Outcomes considers original manuscripts on the Health-Related Quality of Life (HRQOL) assessment for evaluation of medical and psychosocial interventions. It also considers approaches and studies on psychometric properties of HRQOL and patient reported outcome measures, including cultural validation of instruments if they provide information about the impact of interventions. The journal publishes study protocols and reviews summarising the present state of knowledge concerning a particular aspect of HRQOL and patient reported outcome measures. Reviews should generally follow systematic review methodology. Comments on articles and letters to the editor are welcome.