Melissa P Wilson, Kristine M Erlandson, Camille M Moore, Samantha MaWhinney
{"title":"The effects of missing data due to study dropout on longitudinal analysis inference using outcome-dependent sampling.","authors":"Melissa P Wilson, Kristine M Erlandson, Camille M Moore, Samantha MaWhinney","doi":"10.1093/ije/dyaf150","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Existing longitudinal cohort study data and associated biospecimen libraries provide abundant opportunities to efficiently examine new hypotheses through retrospective specimen testing. Outcome-dependent sampling (ODS) methods offer a powerful alternative to random sampling when testing all available specimens is not feasible or biospecimen preservation is desired. For repeated binary outcomes, a common ODS approach is to extend the case-control framework to the longitudinal setting.For ODS designs, we consider the impact of incomplete follow-up when missingness is completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). We further consider sampling from (i) complete cases, in which, in an attempt to maximize power, participants who dropped out before study completion were excluded; and (ii) all individuals, including those with incomplete follow-up.</p><p><strong>Methods: </strong>Simulation studies based on the Advancing Clinical Therapeutics Globally HIV Infection, Aging, and Immune Function Long-Term Observational Study cohort were used to examine the impact of MCAR, MAR, and MNAR missingness, assuming specimens were sampled from either (i) complete cases; or (ii) all individuals. Three ODS analytical methods were considered.</p><p><strong>Results: </strong>When longitudinal data are MNAR, ODS methods exhibit bias similar to that seen in random sampling. MNAR and MAR bias is exacerbated when sampling only participants with complete follow-up. Simulations indicate that ODS analyses that include participants with incomplete follow-up are robust to MCAR and less biased by MAR missingness.</p><p><strong>Conclusion: </strong>Dropout is common in longitudinal cohort studies. Investigators utilizing ODS methods must consider the effect of dropout in both the retrospective sampling design and analysis.</p>","PeriodicalId":14147,"journal":{"name":"International journal of epidemiology","volume":"54 5","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ije/dyaf150","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Existing longitudinal cohort study data and associated biospecimen libraries provide abundant opportunities to efficiently examine new hypotheses through retrospective specimen testing. Outcome-dependent sampling (ODS) methods offer a powerful alternative to random sampling when testing all available specimens is not feasible or biospecimen preservation is desired. For repeated binary outcomes, a common ODS approach is to extend the case-control framework to the longitudinal setting.For ODS designs, we consider the impact of incomplete follow-up when missingness is completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). We further consider sampling from (i) complete cases, in which, in an attempt to maximize power, participants who dropped out before study completion were excluded; and (ii) all individuals, including those with incomplete follow-up.
Methods: Simulation studies based on the Advancing Clinical Therapeutics Globally HIV Infection, Aging, and Immune Function Long-Term Observational Study cohort were used to examine the impact of MCAR, MAR, and MNAR missingness, assuming specimens were sampled from either (i) complete cases; or (ii) all individuals. Three ODS analytical methods were considered.
Results: When longitudinal data are MNAR, ODS methods exhibit bias similar to that seen in random sampling. MNAR and MAR bias is exacerbated when sampling only participants with complete follow-up. Simulations indicate that ODS analyses that include participants with incomplete follow-up are robust to MCAR and less biased by MAR missingness.
Conclusion: Dropout is common in longitudinal cohort studies. Investigators utilizing ODS methods must consider the effect of dropout in both the retrospective sampling design and analysis.
期刊介绍:
The International Journal of Epidemiology is a vital resource for individuals seeking to stay updated on the latest advancements and emerging trends in the field of epidemiology worldwide.
The journal fosters communication among researchers, educators, and practitioners involved in the study, teaching, and application of epidemiology pertaining to both communicable and non-communicable diseases. It also includes research on health services and medical care.
Furthermore, the journal presents new methodologies in epidemiology and statistics, catering to professionals working in social and preventive medicine. Published six times a year, the International Journal of Epidemiology provides a comprehensive platform for the analysis of data.
Overall, this journal is an indispensable tool for staying informed and connected within the dynamic realm of epidemiology.