Longitudinal Methods Versus Multiple Imputation to Infer Missing Maternal Data in Registry-Based Pregnancy Studies.

IF 2.5 3区医学 Q2 OBSTETRICS & GYNECOLOGY

Paediatric and perinatal epidemiology Pub Date : 2025-06-27 DOI:10.1111/ppe.70011

Takamasa Sakai, Hedvig Nordeng, Marleen M H J van Gelder

{"title":"Longitudinal Methods Versus Multiple Imputation to Infer Missing Maternal Data in Registry-Based Pregnancy Studies.","authors":"Takamasa Sakai, Hedvig Nordeng, Marleen M H J van Gelder","doi":"10.1111/ppe.70011","DOIUrl":null,"url":null,"abstract":"Background: In birth registries, incomplete recording of information leads to missing values. Multiple imputation (MI) by chained equations is a widely used method for analysing datasets with missing data. It is unknown whether using registry records from multiple pregnancies contributed by the same woman could potentially give more accurate values when resolving missing data.Objectives: To investigate the relative performance of five methods to infer missing data on maternal characteristics using data from a medical birth registry, comparing longitudinal methods and MI with data from previous and future pregnancies.Methods: We used data from the Medical Birth Registry of Norway (MBRN), selecting records among mothers with more than one pregnancy between 2004 and 2018. Longitudinal methods used reference pregnancies in three time directions: past, future and closest pregnancy record. MI was conducted with only index pregnancy records (single-pregnancy MI) and with both index and closest reference pregnancy records (multiple-pregnancy MI). Validity was assessed by comparing the actual values with inferred/imputed values. For continuous variables, we calculated the proportion of inferred values within predefined increments. For binary variables, we calculated five parameters: agreement rate, sensitivity, specificity, positive predictive value and negative predictive value.Results: We included 578,670 pregnancies among 256,658 women. For continuous variables, the longitudinal methods showed the highest proportion within predefined increments, followed by multiple-pregnancy MI, and single-pregnancy MI showed the lowest value. For binary variables, longitudinal methods generally showed higher values among the five validity parameters than MI. Single-pregnancy MI had substantially lower agreement, while multiple-pregnancy MI performed similarly to longitudinal methods.Conclusions: The longitudinal method outperformed MI in inferring missing data on maternal characteristics in a medical birth registry.","PeriodicalId":19698,"journal":{"name":"Paediatric and perinatal epidemiology","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Paediatric and perinatal epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/ppe.70011","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: In birth registries, incomplete recording of information leads to missing values. Multiple imputation (MI) by chained equations is a widely used method for analysing datasets with missing data. It is unknown whether using registry records from multiple pregnancies contributed by the same woman could potentially give more accurate values when resolving missing data.

Objectives: To investigate the relative performance of five methods to infer missing data on maternal characteristics using data from a medical birth registry, comparing longitudinal methods and MI with data from previous and future pregnancies.

Methods: We used data from the Medical Birth Registry of Norway (MBRN), selecting records among mothers with more than one pregnancy between 2004 and 2018. Longitudinal methods used reference pregnancies in three time directions: past, future and closest pregnancy record. MI was conducted with only index pregnancy records (single-pregnancy MI) and with both index and closest reference pregnancy records (multiple-pregnancy MI). Validity was assessed by comparing the actual values with inferred/imputed values. For continuous variables, we calculated the proportion of inferred values within predefined increments. For binary variables, we calculated five parameters: agreement rate, sensitivity, specificity, positive predictive value and negative predictive value.

Results: We included 578,670 pregnancies among 256,658 women. For continuous variables, the longitudinal methods showed the highest proportion within predefined increments, followed by multiple-pregnancy MI, and single-pregnancy MI showed the lowest value. For binary variables, longitudinal methods generally showed higher values among the five validity parameters than MI. Single-pregnancy MI had substantially lower agreement, while multiple-pregnancy MI performed similarly to longitudinal methods.

Conclusions: The longitudinal method outperformed MI in inferring missing data on maternal characteristics in a medical birth registry.

查看原文本刊更多论文

在基于登记的妊娠研究中，纵向方法与多重归算推断缺失的孕产妇数据。

背景：在出生登记中，信息记录的不完整导致值的缺失。链式方程的多重插值（MI）是一种广泛应用于缺失数据集分析的方法。目前尚不清楚使用同一名妇女多胎妊娠的登记记录是否可能在解决缺失数据时提供更准确的值。目的：利用医学出生登记处的数据，比较纵向方法和MI与以往和未来妊娠的数据，研究五种推断产妇特征缺失数据的方法的相对性能。方法：我们使用挪威医学出生登记处（MBRN）的数据，选择2004年至2018年间怀孕一次以上的母亲的记录。纵向方法采用参照妊娠三个时间方向：过去、未来和最近妊娠记录。仅使用指数妊娠记录（单次妊娠MI）和同时使用指数和最接近的参考妊娠记录（多次妊娠MI）进行MI。通过比较实际值与推断/估算值来评估有效性。对于连续变量，我们计算了预定义增量内推断值的比例。对于二元变量，我们计算了五个参数：符合率、敏感性、特异性、阳性预测值和阴性预测值。结果：我们纳入了256,658名妇女中578,670名孕妇。对于连续变量，纵向方法在预定义增量内的比例最高，其次是多胎MI，单胎MI最低。对于二元变量，纵向方法在五个效度参数中显示的值普遍高于MI。单胎MI的一致性明显较低，而多胎MI的结果与纵向方法相似。结论：纵向方法在推断医学出生登记中缺失的产妇特征数据方面优于MI。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Paediatric and perinatal epidemiology 医学-妇产科学

CiteScore

5.40

自引率

7.10%

发文量

审稿时长

1 months

期刊介绍： Paediatric and Perinatal Epidemiology crosses the boundaries between the epidemiologist and the paediatrician, obstetrician or specialist in child health, ensuring that important paediatric and perinatal studies reach those clinicians for whom the results are especially relevant. In addition to original research articles, the Journal also includes commentaries, book reviews and annotations.